Department of Biological Science, Northern Arizona
University, Flagstaff, Arizona 86011-56401;
Department of Epidemiology and Community Health, School of
Veterinary Medicine, Louisiana State University, Baton Rouge,
Louisiana 708032; and Environmental
Molecular Biology Group, Los Alamos National Laboratory, Los
Alamos, New Mexico 875453
 |
INTRODUCTION |
Bacillus anthracis is the
causative organism of the potentially fatal disease anthrax. Virulent
forms of B. anthracis carry two large plasmids, pX01 (ca.
174 kb) and pX02 (ca. 95 kb). Virulence factors include toxin and
capsule production, encoded on pX01 and pX02, respectively. The anthrax
toxin is composed of three proteinaceous subunits: (i) lethal factor
(LF), the toxin component thought to kill host cells by disrupting the
mitogen-activated protein kinase pathway (2); (ii) edema
factor (EF), an adenylyl cyclase that causes skin edema in the infected
host (6); and (iii) protective antigen (PA), which binds to
eucaryotic cell surface proteins, forms homoheptamers, and then binds
to and internalizes EF and LF.
The structure and function of PA have been well described. The entire
PA gene (pag) sequence has been published and is available in GenBank (accession no. M22589) (12). The
three-dimensional structure has also been solved and is available in
the NCBI Entrez 3D database (MMDB no. 6980) (10). Finally,
antibody-binding experiments have been used to define regions of the PA
protein critical to cell surface attachment as well as LF binding
(8). Missing from the literature until now was a population
study of pag from diverse strains of B. anthracis
to define the natural variation in this important gene.
In past studies, plasmid-specific genetic variation in B. anthracis has been largely ignored. A recent population study,
based on chromosomal markers, demonstrated that B. anthracis
is one of the most monomorphic bacterial species known (5).
This chromosomal amplified fragment length polymorphism study examined
ca. 6.3% of the B. anthracis genome for length variations
and ca. 0.36% for point mutations. However, due to ambiguities arising
from the absence of one or both of the plasmids, plasmidal data were omitted from the final results. Studies of pX01 diversity and especially of pag are essential to understanding evolution
of pathogenesis in B. anthracis. Likewise, comparative
studies of plasmid-based versus chromosomal variation can provide
insight into the frequency of horizontal plasmid transfer in natural
B. anthracis populations.
In this study we have sequenced the entire pag gene from 26 of the most diverse strains of B. anthracis (5).
These sequences were aligned and analyzed for point mutations then
studied phylogenetically to determine if the pag data are
consistent with chromosomal diversity groups. Additionally, we
sequenced a 307-bp variable region of pag from 10 Sverdlovsk
anthrax victim necropsy samples (4) in order to identify
novel pag sequences.
 |
MATERIALS AND METHODS |
B. anthracis DNA.
Culture conditions, DNA
isolation methods, and diversity groups are described in reference
5. Necropsy tissue DNA was isolated as described by
Jackson et al. (4).
PCR amplification of DNA.
Table
1 contains the sequences for all primers
used for this project. These were designed from the published
pag sequence (GenBank accession no. M22589) and synthesized
by Gibco/BRL, Bethesda, Md. All primer positions cited throughout this
report are based on this GenBank sequence. Two DNA fragments, together totaling 2,531 bp of sequence, were initially amplified to provide a
pag sequencing template from the 26 B. anthracis
strains. PA-1F and PA-1R were used to amplify a 1,191-bp fragment
containing the 5' portion of PA. This included 131 bp of upstream
flanking sequence. PA-2F and PA-2R were used to amplify a 1,449-bp
fragment containing the 3' portion of PA. This included 106 bp of
downstream flanking sequence. The two fragments contained 109 bp of
overlapping sequence near the middle of the gene. Fifty-microliter PCR
mixtures contained 1× PCR buffer (20 mM Tris [pH 8.4], 50 mM KCl;
Gibco/BRL), 0.10 mM deoxynucleoside triphosphates, 4 mM
MgCl2, ~0.2 ng of template DNA per µl, 0.04 U of
Taq DNA polymerase (Gibco/BRL) per µl, and 0.4 µM
forward and reverse primers, adjusted to 50 µl with filtered
(0.2-µm-pore-size filter) 17.8 mohm E-pure water. Reactions were
heated to 94°C for 5 min and then subjected to 35 cycles, each
consisting of 30 s at 94°C, 30 s at 62°C, and 1.5 min at
72°C. This was followed by heating to 72°C for 5 min to complete
primer extension. PCR products were purified through Qiaquick
purification minicolumns (Qiagen Inc., Valencia, Calif.) and then
quantified on ethidium bromide-stained 1.25% agarose-Tris-acetate-EDTA gels. These purified fragments were then used in subsequent sequencing reactions. PCR amplification of necropsy sample DNA was performed as
described by Jackson et al. (4), using primers PA-5F, PA-5R, PA-5Fnest, and PA-5Rnest (Table 1).
DNA sequencing.
PCR products were sequenced on an ABI model
377 fluorescence sequencer using a PRISM Ready Reaction BigDye
terminator cycle sequencing kit (both from Perkin-Elmer/Applied
Biosystems Inc., Foster City, Calif.). Sequences were aligned and
analyzed with Sequence Navigator software (Perkin-Elmer/Applied Biosystems).
Cladistic analysis.
Cladistic analysis was performed on the
pag sequences by using maximum parsimony with PAUP 3.1.1 software (developed by David L. Swofford, Illinois Natural History
Survey) and manual examinations of sequence polymorphisms.
Three-dimensional analysis.
The PA structure has been solved
and is available on the NCBI Entrez 3D database (MMDB no. 6980)
(10). Amino acid residues shown to vary among strains were
identified on the three-dimensional structure, and then physical
distances from the putative LF binding region of PA domains 3 and 4 were estimated by using MAGE 4.5 software (developed by David
Richardson, Biochemistry Department, Duke University, Durham, N.C.).
 |
RESULTS |
Sequence alignment of the entire PA gene from 26 strains
representative of the five B. anthracis diversity groups
(5) (Table 2) revealed five
point mutations, three synonymous and two missense, shown in Table
3. All five mutations are transitions.
Two of the synonymous mutations occur only once. However, the other
differences are present with frequencies ranging from 3/26 to 20/26.
The two missense mutations are located adjacent to a highly antigenic region crossing the junction between PA domains 3 and 4 shown to be
critical to LF binding (Fig. 1) (8,
10). The different mutational combinations observed in this study
give rise to six PA genotypes and three PA phenotypes (Table
4).

View larger version (8K):
[in this window]
[in a new window]
|
FIG. 1.
Model of pag from B. anthracis. S,
region of gene that codes for cleaved signal region; NP-F and NP-R,
forward and reverse nested primers used to amplify variable regions
from the Sverdlovsk tissue samples; black vertical arrows, missense
mutations; grey vertical arrows, synonymous mutations; HAR, highly
antigenic region important to LF binding (8, 10). Dom.,
domain.
|
|
Cladistic analysis of the 26 pag sequences was performed by
the maximum parsimony method to produce a gene tree (Fig.
2A). The 26 strains grouped into four
clades of 3, 3, 6, and 14 individuals. These groups were defined by
three synapomorphic (informative) differences. In addition, we
identified two apomorphic (uninformative) nucleotide differences
(mutations 2 and 6) that separated two strains (28 and 33) from others
in their clades. These mutations are identified on the respective
branches but were not used to isolate these strains from their groups.
The clades and topology identified by this tree were mostly congruent
with those generated from chromosomal markers (Fig. 2B) (5).
The only aberrations are the following. (i) Chromosomal data from
strain A24 indicate that it is of the Southern Africa lineage
(5), but the pag data place this strain with the
Western North America (WNA) diversity group (one mutational step away);
(ii) chromosomal data from strain F-1 indicate that it is of the Vollum
lineage, but the pag data place this strain with the WNA
diversity group (again, one mutational step away); and (iii)
chromosomal markers indicate that the Kruger samples, although very
similar, are genetically distinct from the WNA lineage. However, the
pag gene tree did not resolve these two distinct groups. It
should be noted that chromosomal markers indicate that Vollum and WNA
are sister groups and, likewise, that Kruger and WNA are closely
related. Only with strain A24 do the pag data suggest that
strains from two distantly related groups (based on chromosomal
markers) are closely related.

View larger version (28K):
[in this window]
[in a new window]
|
FIG. 2.
Cladistic analysis of the 26 diverse strains. (A)
Unrooted, maximum parsimony gene tree based on pag data
developed in this study; (B) strain diversity groups based on
chromosomal AFLP data described by Keim et al. (5). Branch
mutations are numbered as described for Table 3.
|
|
To determine the pag genotypes and phenotypes of the
strain(s) involved in the Sverdlovsk incident, nested PCR primers
(Table 1) were designed to amplify and sequence a 307-bp region of
pag. This region spans the junction between PA domains 3 and
4 where much of the variation was observed. This analysis uncovered two additional transition mutations (3 and 7 in Table 3). One was synonymous, while the other was a novel missense mutation resulting in
a phenylalanine
leucine change. These changes resulted in two additional genotypes and one new phenotype (Table 4). The amino acid
change was, again, immediately adjacent to the highly antigenic region
of PA domains 3 and 4 (Fig. 1). Repetitive sequencing of these tissues
uncovered multiple PA genotypes within some of the individual necropsy
samples. Together, five different PA genotypes were observed in the
Sverdlovsk samples, with some samples showing evidence of infection by
multiple strains (Table 5). This finding is consistent with the results of Jackson et al. (4).
Figure 3 is an unrooted phylogenetic tree
demonstrating the five mutational steps leading to the six PA genotypes
and three PA phenotypes identified in this study. Additionally, the
putative positions of the Sverdlovsk samples are shown. However,
because the Sverdlovsk identifications were based on just the 307-bp
region around the antigenic portion of PA domains 3 and 4, these
placements are only tentative.

View larger version (29K):
[in this window]
[in a new window]
|
FIG. 3.
Unrooted phylogenetic tree of PA genotypes. Open boxes
show the three PA phenotypes identified; shaded boxes show the possible
positions of the Sverdlovsk genotypes, VIISvd and
VIIISvd. Synonymous mutations are shown in open circles,
and missense mutations are shown in closed circles. Each mutation is
described in Table 3 and the phenotypes are described in Table 4.
|
|
Three-dimensional analysis of all the amino acid changes observed in
this study (mutations 3, 4, and 5 in Table 3) indicated that these
changes are not only close sequentially but also very close in
three-dimensional space to the antigenic region important for LF
binding. Mutation 3 (Phe to Leu), is ca. 11.2 Å, mutation 4 (Pro to
Ser) is ca. 20.3 Å, and mutation 5 (Ala to Val) is ca. 19.0 Å from
the central portion of this region. These spatial distances were
estimated solely on peptide backbone-to-peptide backbone relationships.
However, when the three-dimensional spaces occupied by the side chains
of the amino acids were considered, changes were found to affect
residues as close as 6.9 Å from the central amino acids of this
critical antigenic region.
 |
DISCUSSION |
The protective antigen protein is central to the virulence
associated with anthrax toxin. Elucidation of PA variation and its
encoding gene could lead to a better understanding of B. anthracis virulence and evolution. Until now, pag had
been sequenced in its entirety only from a single B. anthracis strain (12). In this study, a detailed
analysis of the entire pag sequence (2,294 bp) from 26 diverse B. anthracis strains revealed only five point mutations, corroborating the high degree of genetic monomorphism found
by Keim et al. (5).
Among these mutations, there is a disproportionate number of missense
(two) to synonymous (three) changes. A common ratio of missense to
synonymous mutations is approximately 1:5; here we see a ratio more
than threefold greater (7). These missense mutations are
located near a highly antigenic region, critical to LF binding. In
monoclonal antibody studies, Little et al. demonstrated that by
blocking an epitope between amino acids Ile-581 and Asn-601 (Fig. 1),
they could effectively block LF binding to PA (8). Three-dimensional analysis indicated that the missense mutations identified in our study are very close in three-dimensional space to
this antigenic region. While none of the three missense mutations were
dramatic, such as a change from an extremely hydrophobic to a
hydrophilic amino acid, the proline-to-serine change has the potential
to make important three-dimensional alterations, since proline
isomerization is known to play a critical role in protein folding.
Because of their close proximity, these amino acid changes have the
potential to effect LF binding, either directly or indirectly, within
an infected host. The grouping of these missense mutations near this
antigenic region and the disproportionate number of missense to
synonymous mutations suggests adaptive variation. One of the two new
mutations identified in the Sverdlovsk victims' tissues was found to
be a novel missense mutation located, sequentially and three
dimensionally, near the highly antigenic region of the junction between
PA domains 3 and 4. When these mutations are included with those
identified in the 26-sample survey, the ratio of missense to synonymous
mutation is increased to 3.8:5.
The amplification and sequencing of the 307-bp pag fragment
from the Sverdlovsk tissue samples suggested that at least five different strains of B. anthracis were present in the
samples and that some of the individual victims had been infected with multiple strains. These data corroborate earlier work with the vrrA locus that suggested that multiple strains of anthrax
had been released during the 1979 incident (1, 4). Besides
the Russian vaccine strain STI-1, included in this study, these tissue samples are a rare glimpse at the different strains of B. anthracis that are thought to be endemic in the vast region of the
former Soviet Union. The fact that two previously unobserved mutations were found in the Sverdlovsk samples stresses the importance of collecting and analyzing B. anthracis strains from areas
where anthrax is endemic but largely uncharacterized by molecular
genetic analysis.
Independent cladistic analysis of pX01 by using the pag
sequence has enabled us to estimate the likelihood of horizontal
transfer of this plasmid between different B. anthracis
strains in natural populations. Although horizontal transfer in
Bacillus spp. is possible under laboratory conditions, the
similarity of the cladistic grouping from the pag data to
that of the chromosomal markers suggests that the differences in
pag arose from evolution within particular strain lineages
and were not a result of horizontal pX01 transfer. The single possible
exception is associated with the A24 sample, which chromosomally is
related to the Southern Africa strains, while the pag data
for this strain are consistent with Kruger-WNA. This is either a result
of convergent evolution or evidence of horizontal pX01 transfer.
Further, it should be noted that the data presented in this report do
not rule out the potential for horizontal transfer of plasmid pX01
between closely related strains within an infected host.
The unrooted phylogenetic tree (Fig. 3) is a useful tool for
demonstrating the relationships between the different PA genotypes. However, it is not meant to infer an evolution toward a particular form
of PA. Although distant homologues from other gram-positive bacteria
are cited (3, 9, 11), none of these is close enough to root
a B. anthracis PA phylogenetic tree. Without an ancestral PA
sequence, one is unable to determine which PA phenotypes are ancestral
and which are derived.
We thank James M. Schupp, Kimothy L. Smith, Debra M. Adair, and
Karen K. Hill for technical support.
| 1.
|
Andersen, G. L.,
J. M. Simchock, and K. H. Wilson.
1996.
Identification of a region of genetic variability among Bacillus anthracis strains and related species.
J. Bacteriol.
178:377-384[Abstract/Free Full Text].
|
| 2.
|
Duesbery, N. S.,
C. P. Webb,
S. H. Leppla,
V. M. Gordon,
K. R. Klimpel,
T. D. Copeland,
N. G. Ahn,
M. K. Oskarsson,
K. Fukasawa,
K. D. Paull, and G. F. Vande Woude.
1998.
Proteolytic inactivation of MAP-kinase-kinase by anthrax lethal factor.
Science
280:734-737[Abstract/Free Full Text].
|
| 3.
|
Gibert, M.,
S. Perelle,
G. Daube, and M. R. Popoff.
1997.
Clostridium spiroforme toxin genes are related to C. perfringens iota toxin genes but have a different genomic localization.
Syst. Appl. Microbiol.
20:337-347.
|
| 4.
|
Jackson, P. J.,
M. E. Hugh-Jones,
D. M. Adair,
G. Green,
K. K. Hill,
C. R. Kuske,
L. M. Grinberg,
F. A. Abramova, and P. Keim.
1998.
PCR analysis of tissue samples from the 1979 Sverdlovsk anthrax victims: the presence of multiple Bacillus anthracis strains in different victims.
Proc. Natl. Acad. Sci. USA
95:1224-1229[Abstract/Free Full Text].
|
| 5.
|
Keim, P.,
A. Kalif,
J. Schupp,
K. Hill,
S. E. Travis,
K. Richmond,
D. M. Adair,
M. Hugh-Jones,
C. R. Kuske, and P. Jackson.
1997.
Molecular evolution and diversity in Bacillus anthracis as detected by amplified fragment length polymorphism markers.
J. Bacteriol.
179:818-824[Abstract/Free Full Text].
|
| 6.
|
Leppla, S. H.
1982.
Anthrax toxin edema factor: a bacterial adenylate cyclase that increases cyclic AMP concentrations of eukaryotic cells.
Proc. Natl. Acad. Sci. USA
79:3162-3166[Abstract/Free Full Text].
|
| 7.
|
Li, W., and D. Graur.
1991.
Fundamentals of molecular evolution.
Sinauer Associates, Inc., Sunderland, Mass.
|
| 8.
|
Little, S. F.,
J. M. Novak,
J. R. Lowe,
S. H. Leppla,
Y. Singh,
K. R. Klimpel,
B. C. Ligerding, and A. M. Friedlander.
1996.
Characterization of lethal factor binding and cell receptor binding domains of protective antigen of Bacillus anthracis using monoclonal antibodies.
Microbiology
142:707-715[Abstract].
|
| 9.
|
Perelle, S.,
M. Gibert,
P. Boquet, and M. R. Popoff.
1993.
Characterization of Clostridium perfringens iota-toxin genes and expression in Escherichia coli.
Infect. Immun.
61:5147-5156[Abstract/Free Full Text].
|
| 10.
|
Petosa, C.,
R. J. Collier,
K. R. Klimpel,
S. H. Leppla, and R. C. Liddington.
1997.
Crystal structure of the anthrax toxin protective antigen.
Nature
385:833-838[Medline].
|
| 11.
| Selvapandiyan, A. 1998. Direct submission. Genbank
accession no. Y17158.
|
| 12.
|
Welkos, S. L.,
J. R. Lowe,
F. Eden-McCutchan,
M. Vodkin,
S. H. Leppla, and J. J. Schmidt.
1988.
Sequence and analysis of the DNA encoding protective antigen of Bacillus anthracis.
Gene
69:287-300[Medline].
|