Previous Article | Next Article ![]()
Journal of Bacteriology, January 2005, p. 619-628, Vol. 187, No. 2
0021-9193/05/$08.00+0 doi:10.1128/JB.187.2.619-628.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Microbial Evolution Laboratory, National Food Safety and Toxicology Center,1 Infectious Disease Unit, Department of Internal Medicine, Michigan State University, East Lansing, Michigan,4 Microbial Diseases Laboratory, Division of Communicable Disease Control, California Department of Health Services, Richmond, California,2 Centers for Disease Control and Prevention, Atlanta, Georgia3
Received 30 August 2004/ Accepted 8 October 2004
|
|
|---|
7.4% of the nucleotide sites from pathogenic E. coli strains and at 15.7% from Salmonella enterica serotype Typhimurium. Interestingly, E. albertii strains were found to be closely related to strains of Shigella boydii serotype 13 (Shigella B13), a distant relative of E. coli representing a divergent lineage in the genus Escherichia. Analysis of homologues of intimin (eae) revealed that the central conserved domains are similar in E. albertii and Shigella B13 and distinct from those of eae variants found in pathogenic E. coli. Sequence analysis of the cytolethal distending toxin gene cluster (cdt) also disclosed three allelic groups corresponding to E. albertii, Shigella B13, and a nontypeable isolate serologically related to S. boydii serotype 7. Based on the synonymous substitution rate, the E. albertii-Shigella B13 lineage is estimated to have split from an E. coli-like ancestor
28 million years ago and formed a distinct evolutionary branch of enteric pathogens that has radiated into groups with distinct virulence properties. |
|
|---|
Further investigation led to the recognition of these H. alvei-like strains as a new Escherichia species, Escherichia albertii, based on the results of DNA-DNA hybridization and 16S rRNA gene sequence data (18). DNA relatedness, assessed by DNA-DNA hybridization, was highest (ranging from 54 to 64%) among E. albertii, E. coli, and Shigella flexneri, lowest (9 to 17%) between E. albertii and H. alvei, and intermediate (16 to 45%) among E. albertii and the four Escherichia species E. blattae, E. fergusonii, E. hermanii, and E. vulneris (18). Analysis of 16S rRNA gene sequences reinforced the conclusion that E. albertii does not belong to H. alvei but is most closely related to Escherichia and Shigella (18). Additionally, the structure of the O chain of the lipopolysaccharide of E. albertii has been described and does not resemble that of any known lipopolysaccharide O chain from E. coli or H. alvei; however, based on the types of sugar residues and acidity, it could fit into either group (13).
Based on genotypic analysis, another bacterium, Shigella boydii serotype 13 (Shigella B13), has also been shown to be a distant relative of E. coli representing a divergent lineage in the genus Escherichia (5, 26, 40). E. coli and Shigella have been described as one genetic species based on DNA-DNA hybridization patterns, and it has been demonstrated that although the O antigens of Shigella B13 exhibit a reciprocal cross-reaction with E. coli O28 antigens, they are genetically distinct (5). Additionally, Shigella B13 is the only member of the E. coli-Shigella group that can be reliably separated by bulk DNA-DNA hybridization techniques (5).
Nucleotide sequencing of multiple conserved genes has supported the hypothesis that E. coli and Shigella are the same genetic species and has also provided evidence that Shigella B13 belongs to a divergent branch outside the diversity of E. coli and Shigella. Furthermore, three main groups of Shigella strains are apparent, each including serotypes from the traditional Shigella species: S. boydii, S. flexneri, S. sonnei, and S. dysenteriae (26, 40). An independent multilocus sequence typing (MLST) analysis of a variety of serotypes of Shigella and enteroinvasive E. coli (EIEC) strains confirmed the presence of these three groups and supported the placement of Shigella serotypes into distinct groups within the diversity of the E. coli population; this study also placed nontypeable Shigella strains into these three groups (6).
Attachment and effacement appear to be critical components of the virulence of E. albertii (2, 3, 19). In Citrobacter rodentium and diarrheagenic E. coli, attaching-effacing lesion formation is mediated by intimin, an outer membrane protein encoded by the eae gene, and other components of the locus of enterocyte effacement pathogenicity island (34). Intimin is a highly polymorphic protein with more than 20 distinct subtypes and has been discovered in clinical isolates and animal sources (1, 25, 41, 54). The C-terminal region of the protein is the surface-exposed, immunogenic domain, which functions in binding to epithelial cells via the translocated bacterial receptor, Tir (30). In contrast, the central region is a highly conserved transmembrane domain with homology to the invasins of pathogenic Yersinia (53). Sequence comparisons have shown that intimin subtypes are encoded by distinct eae alleles with mosaic structures that presumably have arisen through lateral gene transfer and recombination (33, 48).
A second putative virulence factor discovered in E. albertii is cytolethal distending toxin (CDT) (38). In addition to strains of pathogenic E. coli and Shigella (22, 24), CDT is dispersed phylogenetically and has been found in Campylobacter spp. (23), Haemophilus ducreyi (8), Actinobacillus actinomycetemcomitans (46), and Helicobacter spp. (7, 50, 52). Cultured mammalian cell lines exposed to CDT become arrested in the G1 or G2 phase of the cell cycle, in association with cell distension and eventually death (9, 27, 28, 36). In all bacterial species in which CDT activity has been demonstrated, three closely linked genes (cdtA, cdtB, and cdtC) have been associated with cytotoxic activity (28, 36, 37, 44). Although genetic studies have indicated that all three genes are necessary to transfer full activity to a noncytotoxic E. coli strain (9, 28, 36, 37), cdtB has been implicated as encoding the enzymatically active subunit (27, 28). The CdtB subunit is homologous to a class of phosphodiesterases that includes nucleases, and it has been demonstrated that CdtB damages host cell DNA through DNase I-like activity in vitro (12, 17, 27, 32). This activity triggers the DNA damage checkpoint response (9), which results in cell cycle arrest (9, 12, 27, 28). The CdtA and CdtC subunits function together to mediate the binding of the AB2 holotoxin to eukaryotic cells and the subsequent delivery of CdtB to the cytoplasm (10, 29, 35).
The purpose of the study reported here was to elucidate the phylogenetic relationships of E. albertii and its relatives that express Shigella B13- and S. boydii serotype 7 (Shigella B7)-related antigens. In addition, we characterize the genes encoding the putative virulence factors intimin and CDT, whose role, if any, in the pathogenesis of E. albertii and relatives is not known. We propose a model, based on MLST analysis and supported by virulence gene sequences, in which the divergence of the E. albertii lineage preceded the radiation of pathogenic E. coli and Shigella. We also demonstrate that the E. albertii lineage includes other unusual strains, such as divergent Shigella B13 (5) and two nontypeable clinical isolates.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Sources of E. albertii and Shigella strains
|
MLST analysis. The nucleotide sequences of internal fragments of the following genes were obtained for MLST analysis: arcA, aroE, aspC, clpX, cyaA, dnaG, fadD, grpE, icdA, lysP, mdh, mtlD, mutS, rpoS, and uidA. The analysis included 34 E. coli and Shigella strains and 2 Salmonella strains for comparison. Information on primers and protocols for MLST can be found at website http://www.shigatox.net/mlst. Sequence analysis was performed with MEGA2 software (version 2.0; Pennsylvania State University, University Park).
16S rRNA gene sequencing. The forward primer S-D-Bact-0008-a-S-20 and the reverse primer S-*-Univ-1492-b-A-21 (45) were used to amplify bacterial 16S rRNA genes by PCR under the following conditions: 94°C for 10 min; 35 cycles of 92°C for 1 min, 58°C for 1 min, and 72°C for 1 min; and 72°C for 5 min. Additional internal sequencing primers included 16SintF1 (5'-CGG AGG GTG CAA GCG TTA AT-3'), 16SintF2 (5'-ACC CTT ATC CTT TGT TGC-3'), 16SintR1 (5'-CCA GGT AAG GTT CTT CGC GT-3'), and 16SintR2 (5'-AGT ACT TTA CAA CCC GAA GG-3').
Diagnostic multiplex PCR. The following primers were designed and used for diagnostic multiplex PCR: clpX_28F (5'-TGG GCT CGA GTT GGG CA-3'), clpX_411R (5'-TCC TGC TGC GGA TGT TTA CG-3'), lysP_107F (5'-GGG CGC TGC TTT CAT ATA TTC TT-3'), lysP_358R (5'-TCC AGA TCC AAC CGG GAG TAT CAG GA-3'), mdh_50F (5'-CTG GAAGGC GCA GAT GTG GTA CTG ATT-3'), and mdh_164R (5'-CTT GCT GAA CCA GAT TCT TCA CAA TAC CG-3').
eae sequencing. The entire eae gene, which encodes intimin, was sequenced for representative strains 19982, K-694, C-425, 3555-77, 3097-02, and K-1. PCR and sequencing primers included cesT-F9 (5'-TCA GGG AAT AAC ATT AGA AA-3'), eae-F1 (5'-ACT CCG ATT CCT CTG GTG AC-3'), eae-R3 (5'-TCT TGT GCG CTT TGG CTT-3'), and escD-R1 (5'-GTA TCA ACA TCT CCC GCC CA-3'). Internal sequencing primers included eae-F01s (5'-AAG CAA CAT GAC CGA TGA CA-3'), eae-F02s (5'-GGC GAA TAC TGG CGA GAC TA-3'), eae-F05s (5'-AGT TAA ATG CAA ATG GCG GT-3'), eae-F06s (5'-GTA ACG GAC TTT ACG GCT GAT A-3'), eae-F07s (5'-GCC TAA TAG CAT AGT GAG CGT T-3'), eae-F08s (5'-GTG ATA ATC AAA CCG CAA CT-3'), eae-F09s (5'-CGA CCT TAG GGA CGT TAA GT-3'), eae-F10s (5'-GCT TGT TAG TGC CCG AGT TC-3'), eae-R02s (5'-GGG CGC TCA TCA TAG TCT TT-3'), eae-R05s (5'-GGT AAT AAT GCC TGT ATT AG-3'), eae-R06s (5'-AGT GAG CGA GAC TGA AGT TG-3'), eae-R08s (5'-GTC GCG GTA TAA GTA AT-3'), eae-R09s (5'-TTC CTG GGA ATA ATG TTG AT-3'), eae-R10s (5'-TGG CAT GAG CTT ACA GCA TC-3'), eae-R11s (5'-CGG ATA ACG GCT TAC CAT CT-3'), and eae-R12s (5'-GCA TTA AGC GCT GAA GTC AT-3').
cdt sequencing. Nucleotide sequences were obtained for an internal fragment of putative virulence locus cdtB by using the primers VAT2 and WMI1 (39). Sequences for the cdtABC operon were determined for the four representative E. coli strains 19982, K-694, 3097-02, and K-1. The following additional primers were used for PCR and sequencing: cdt_5_flank_F (5'-AGG AGA TTT GCC AAT GTG AT-3'), cdt_1101_B_R (5'-GTA TCT ACA GCC GTT GAT GG-3'), cdt_1248_B_R (5'-GGC AGG CCG ATG AGG TG-3'), cdt_2057_R (5'-CGT TTG TTG AAG GTA TGA GAG-3'), and cdt_1815_F (5'-TCT ATT CAC ATT CCG CA-3'). The following primers were used for internal sequencing: cdt_584_R (5'-CCC GGG AAG CTT GTC ATA-3') and cdt_564_F (5'-CTT TCT ACT GCA CCT TGC CA-3'). Primer randR1 (5'-TGC CGG GCA A-3') was used for PCR.
CDT activity assays.
CDT activity assays were performed by the method of Young et al. (51, 52). Bacterial strains were grown overnight with shaking at 37°C in Luria-Bertani broth. Total bacterial proteins were solubilized with a nonionic detergent (bacterial protein extraction reagent; Pierce Chemical Co., Rockford, Ill.) according to the recommendations of the manufacturer. Bacteria were pelleted by centrifugation at 2,040 x g for 10 min, suspended in 300 µl of bacterial protein extraction reagent, and vortexed. Insoluble material was removed by centrifugation at 13,800 x g for 5 min, and soluble proteins contained in the supernatant were collected and stored at 80°C. HeLa cells were used to seed 13-mm-diameter circular glass coverslips in 24-well tissue culture plates at a density of
2 x 103 per well. Ten microliters of bacterial protein extract was added to each well, and the plates were incubated in 5% CO2 at 37°C. At appropriate time points, coverslips were washed with phosphate-buffered saline and then fixed and stained with a Hema 3 stain set (Fisher Scientific, Hampton, N.H.). CDT-positive E. coli 493/89 (22) and CDT-negative E. coli K-12 were used as positive and negative controls, respectively.
To confirm the CDT-positive results, CDT activity assays were also performed with Shiga toxin-insensitive CHO cells by the method of Scott and Kaper (44). CHO cells were maintained at 37°C in Ham's F-12 medium plus glutamine and supplemented with 10% fetal bovine serum, penicillin, and streptomycin. Cells were trypsinized and suspended at 2 x 104 per ml in Ham's F-12 medium with 1% fetal bovine serum, penicillin, and streptomycin. The cell suspension was added to 96-well plates at 150 µl per well. Bacterial cultures were grown with shaking overnight in Trypticase soy broth with 0.6% yeast extract. Aliquots (15 µl per well) of each test supernatant were added in triplicate to freshly plated CHO cells. Assay mixtures were incubated for 5 days at 37°C in 5% CO2 and then fixed and stained with the Hema 3 stain set.
Virulence gene profiling. Multilocus virulence gene profiling was performed by using a Beckman (Fullerton, Calif.) CEQ8000 genetic analysis system as described previously (47) with the following modification: additional labeled primer sets were used for more extensive profiling of strains to give a total of 30 genes for fragment analysis, including an mdh control. Several primer sets were obtained from IDT (Coralville, Iowa), Proligo (Boulder, Colo.), and Research Genetics (Huntsville, Ala.). Cycling conditions were modified to include an annealing temperature of 50°C and 40 amplification cycles. Additional primer pairs can be found at website http://www.shigatox.net/mlst.
Nucleotide sequence accession numbers. The nucleotide sequences determined in this study have been submitted to GenBank under accession numbers AY696659 to AY697031. These include accession numbers AY696752 to AY696755 for sequences for the cdtABC operon from E. coli strains 19982, K-694, 3097-02, and K-1, respectively.
|
|
|---|
1.3-kb) insertion sequence was present in the region of rpoS used for MLST analysis. The insertion sequence was removed for analysis. In Shigella B13 strain 616, rpoS could not be amplified with E. coli PCR primers.
The results of PCR and sequencing of housekeeping genes allowed for a comparison of 2,484 homologous codon positions in 14 genes (uidA not included). E. albertii strains were highly divergent from E. coli strains, differing on average at
7.4% of the nucleotide sites from pathogenic E. coli strains (7.3% for EPEC strain E2348/69 and 7.5% for E. coli O157:H7) and 15.7% from Salmonella enterica serotype Typhimurium LT2. The E. albertii strains were most closely related to a Shigella B13 strain (ATCC 12032) (6), differing at only 0.72% of the nucleotide sites.
To assess the generality of these findings, an additional 16 Shigella B13 strains, 1 nontypeable strain, and 1 Shigella B7-related strain were examined. Five of the Shigella B13 strains (5216-70, 3053-94, 3052-94, 3556-77, and 3557-77) were phylogenetically more closely related to members of the E. coli-Shigella group than to other Shigella B13 strains (Fig. 1). These Shigella B13 strains were the same strains in which uidA could be amplified with E. coli uidA PCR primers. Strain 5216-70 (Fig. 1B) was most closely related to a group of EIEC strains (EIEC group 2) (6), whereas strains 3053-94, 3052-94, 3556-77, and 3557-77 were phylogenetically most closely related to each other and formed a distinct cluster (Fig. 1C); this cluster appears to have descended from an ancestor that also gave rise to all Shigella and EIEC groups. For clarification, these five strains are hereafter referred to as atypical Shigella B13 strains. The remaining Shigella B13 strains, the nontypeable strain, and the Shigella B7-related strain were most closely related to the E. albertii strains and are hereafter collectively referred to as the E. albertii lineage (Fig. 1A).
![]() View larger version (45K): [in a new window] |
FIG. 1. Phylogenetic relationships of Shigella B13 strains and the E. albertii lineage. A neighbor-joining tree (MEGA2) was constructed by using the Tamura-Nei model with gamma correction for the distribution of rates based on 14 housekeeping genes. The E. albertii lineage (A) includes E. albertii, Shigella B13, and Shigella B7. On the basis of the rate of synonymous substitutions for E. coli and S. enterica (4.7 x 109 per site per year), the E. albertii lineage shared a common ancestor with E. coli and Shigella pathogenic groups 28 million years ago (mya). Atypical S. boydii strains express the Shigella B13 antigen and do not cluster with the E. albertii lineage but are more similar in their MLST profile to E. coli and Shigella groups. Shigella B13 strain 5216-70 belongs to EIEC group 2 (B), and four atypical Shigella B13 strains belong to their own cluster within the diversity of E. coli and Shigella (C). The number of strains examined is in parentheses. NT, nontypeable; EHEC, enterohemorrhagic E. coli.
|
1.1% (82 of 7,470) of nucleotide sites and
0.2% (6 of 2,490) of amino acid sites are variable. Synonymous substitution rates per 100 nucleotide sites in housekeeping genes between E. albertii and E. coli-Shigella range from a low of 10.97 for icdA to a high of 37.99 for cyaA and average 25.34, compared to an average of 5.21 ± 0.26 (mean and standard deviation) within E. coli-Shigella. Because there is substantial divergence at every locus, there is no evidence for recent recombination or gene exchange events between the E. albertii lineage and the E. coli population. Nonsynonymous substitution rates are much lower in each comparison, indicating conservative selection of housekeeping loci and allowing us to measure sequence divergence independently of adaptive evolution (Table 2). It is noteworthy that the average divergence between E. albertii and E. coli represents about 40% of the evolutionary distance from E. coli to S. enterica. |
View this table: [in a new window] |
TABLE 2. Pairwise sequence divergence among strains of the E. albertii-Shigella B13 lineage (Ea), E. coli (Ec), and S. enterica (Se)a
|
0.04% (3 of 7,470) of nucleotide sites and
0.08% (2 of 2,490) of amino acid sites. The remaining two sequence types each represent their own branch; the E. albertii strains all belong to one, and Shigella B7-related strain K-1 belongs to the other. In pairwise comparisons, the E. albertii group varies from the Shigella B13 group at
0.6% of nucleotide sites and
0.01% of amino acid sites and from the Shigella B7-related strain at
0.7% of nucleotide sites and
0.08% of amino acid sites. The Shigella B7-related strain differs from the Shigella B13 group by
0.7% of nucleotide sites and
0.1% of amino acid sites (Fig. 1A). 16S rRNA gene sequencing. To assess further the genetic relatedness of the E. albertii and Shigella B13 strains, we sequenced and analyzed part of the 16S rRNA gene of all members of the E. albertii lineage and atypical Shigella B13 strains. There were 1.8% (27 of 1,464) variable sites within the lineage and five distinct sequence types divided into groups corresponding to MLST results: two groups of atypical Shigella B13 strains, one group of E. albertii strains, one group of Shigella B13 strains, and one group including the Shigella B7-related strain. In pairwise comparisons within the E. albertii lineage, the Shigella B13 and B7-related strains were the most similar (99.86%), and the Shigella B7-related and atypical Shigella B13 strains were the least similar (98.32%).
eae sequencing. The close evolutionary relationships between E. albertii and Shigella B13 strains predicted that these bacteria should share additional chromosomal genes. We tested this prediction by attempting to detect, amplify, and sequence eae homologues in Shigella B13 and related Shigella strains of the E. albertii lineage. All strains tested were amplified, and an analysis of intimin (eae) sequences in six strains revealed three distinct eae variants, one in E. albertii strains, one in the Shigella B13 subcluster, and one in the Shigella B7-related strain. The central conserved domain of eae in six strains of the E. albertii lineage were most similar to each other, with the iota2 allele (GenBank accession number AF530553) being the most closely related E. coli allele. The C-terminal region of the intimin gene (Int280 region), in contrast to the central conserved region, which varied at 1.2% of nucleotide sites and 0.9% of amino acid sites within this lineage, differed at 37.6% of nucleotide sites and 42.3% of amino acid sites (Table 3). The Int280 region in strains of the Shigella B13 subcluster all have the same sequence, which was most similar to the iota2 allele sequence; E. albertii and Shigella B7-related strains, although possessing different alleles, are most closely related to each other, with the alpha and zeta alleles being the next most similar alleles (Fig. 2).
|
View this table: [in a new window] |
TABLE 3. Sequence divergence and properties of the domain and subunits of the intimin alleles (eae) and CDT gene (cdt) of the E. albertii-Shigella B13 lineagea
|
![]() View larger version (51K): [in a new window] |
FIG. 2. Intimin variants of the E. albertii-Shigella B13 lineage. (A) A neighbor-joining tree (MEGA2) based on nucleotide sequences of the eae central conserved region of intimins of E. coli shows that the sequences of the intimin gene homologues of E.albertii-Shigella B13 strains are most similar to the sequence of the E. coli iota2 allele. (B) Haplotype plots of the nucleotide differences in the 3' part of the eae gene encoding the Int280 region. The sequences of E. albertii and Shigella B7 strains are similar to the sequence of the iota2 allele of E. coli in the central conserved region but are most similar to the sequences of the alpha and zeta alleles of E. coli in the Int280 region. (C) Percent amino acid similarities among representative intimin variants. The lower triangular matrix shows pairwise similarity values for the central conserved region, and the upper triangular matrix shows values for the Int280 region. The shading highlights close similarities to the intimin domains of the E. albertii-Shigella B13 lineage (tan). C.rod, C. rodentium.
|
![]() View larger version (46K): [in a new window] |
FIG. 3. Multiple sequence alignment of CdtBs from representatives of the five described E. coli alleles, CDT I (44), CDT II (37), CDT III (36), CDT IV (49), and CDT V (4, 22), and representatives of three distinct alleles of the E. albertii lineage (B13, E.alb., and B7). At amino acid position 41, shown here in a box, Shigella B13 strains are predicted to translate a stop codon rather than a tryptophan codon due to a single G-to-A transition at the second codon position. Conserved residues thought to be important for enzymatic activity (11, 35) are highlighted in gray. Arrowheads indicate residues that were mutated and shown to completely or partially abolish the cell cycle arrest activity of CDT (11, 35).
|
![]() View larger version (129K): [in a new window] |
FIG. 4. Cytopathic effects of bacterial cell extracts on HeLa cell monolayers. Cytoplasmic distension and nuclear abnormalities were apparent 72 h after HeLa cells were exposed to E. albertii and Shigella B7-related strains (A), whereas Shigella B13 strains did not appear to produce active toxin (B). Included for comparison are CDT-negative E. coli K-12 (C) and CDT-positive E. coli 493/89 (D). Magnification, x40.
|
|
View this table: [in a new window] |
TABLE 4. Presence or absence of E. coli and Shigella virulence genes in the E. albertii-Shigella B13 lineage as detected by multilocus virulence gene profilinga
|
|
|
|---|
It has been recognized for nearly 50 years that Shigella B13 is highly divergent from other shigellae and has been classified incorrectly (5, 14, 31, 40). For example, in a DNA relatedness study, Shigella B13 was the only known group within the E. coli-Shigella complex that could be separated with certainty (5). Later studies based on sequence analysis of multiple housekeeping genes confirmed that Shigella B13 is highly divergent from E. coli and other Shigella serotypes (40). The key characteristic of typical Shigella strains is invasiveness due to the presence of a large invasion plasmid, pINV (16). Although Shigella B13 strains were originally considered to be Shigella, there has been no evidence that they are invasive, nor do they carry the known virulence genes present on pINV. Our analyses also failed to PCR amplify pINV genes in any of the Shigella B13 strains or E. albertii strains examined by multilocus virulence gene profiling.
We conclude, based on evolutionary analyses of multiple housekeeping genes, the 16S rRNA gene, and putative virulence genes, that E. albertii and Shigella B13 strains are monophyletic and represent a distinct lineage of enteric pathogens that separated from an ancient Escherichia ancestor after E. coli and S. enterica had diverged from a common ancestor. On the basis of the rate of synonymous substitutions for E. coli and S. enterica (4.7 x 109 per site per year) (42), the lineage shared a common ancestor with E. coli
28 million years ago. The molecular evolutionary analysis supports the proposal that certain strains, such as nontypeable strain 3097-02 and Shigella B7-related strain K-1, should also be included in the new species, E. albertii.
Although most Shigella B13 strains belong to the E. albertii lineage, certain E. coli-Shigella-like strains (atypical Shigella B13 strains) also express the Shigella B13 antigen, suggesting there has been either horizontal transfer of genes responsible for this phenotype or convergence to a similar antigenic phenotype. Atypical Shigella B13 strains 3556-77 and 3557-77 were also detected as intermediate between E. coli and Shigella B13 strains by DNA relatedness, with levels similar to those found between E. coli and Shigella B13 strain but lower than those found within other Shigella B13 strains (5). The chemical and genetic organization of the O antigen of Shigella B13 was recently described; based on nucleotide sequence data, the authors concluded that the Shigella B13 rfb gene cluster was recently assembled and has also undergone recent evolutionary change (15). The inclusion of nontypeable and Shigella B7-related strains in the E. albertii lineage and the evidence for change in the genes responsible for antigenic variations suggest that the lineage may be very diverse, but that diversity has probably been seriously underestimated because of sampling bias.
To explore putative virulence factors present in the E. albertii lineage, we described two genes that E. albertii strains are known to express, the intimin (eae) (3) and CDT (cdt) (38) genes, and screened the strains for the presence of other known E. coli and Shigella virulence factors. Phylogenetic analyses of virulence genes revealed three subclusters within the E. albertii lineage, consistent with phylogenetic analyses of multiple housekeeping genes, and supported the conclusion that these strains belong to an evolutionary branch of enteric pathogens that have further diverged into three distinct groups.
The intimin gene (eae), which was found to be present in all members of the E. albertii lineage by virulence gene profiling, was sequenced in representatives of the E. albertii subclusters. Phylogenetic analyses indicated that the sequences of Shigella B13 strains, which are identical, are similar to the sequence of the iota2 allele of pathogenic E. coli. The sequences of strains 19982 (E. albertii) and K-1 (Shigella B7 related) are similar to the sequence of the iota2 allele in the central conserved region but have different alleles most similar to the alpha and zeta alleles of E. coli in the Int280 region.
CDT activity does not appear to be required for the virulence of E. albertii and its relatives. In two Shigella B13 strains (3553-77 and 3555-77), originally isolated from human patients, the cdtB gene was not amplified by PCR with degenerate primers; the remainder of the Shigella B13 strains contained a putative stop codon and lacked CDT activity. The silencing of CDT expression suggests that it does not have a required role in the virulence of these organisms. The short evolutionary distance between the active E. albertii cdt alleles and the inactive Shigella B13 cdt alleles suggests that the inactivation was a recent evolutionary event.
Both virulence genes examined have a lower GC content and a lower codon adaptation index, as measured by the rate of amelioration of GC content at the third codon positions, than do housekeeping genes (Table 3). The low codon adaptation index suggests recent acquisition of the virulence genes by lateral gene transfer. In particular, cdt sequences have a very low codon adaptation index. Levels of polymorphism within the E. albertii lineage are similar between the virulence and housekeeping genes, with the exception of the 3' Int280 region of eae. The sequence diversity of this extracellular region in E. coli strains is thought to be under positive selection (33), and the sequence diversity in the E. albertii lineage supports the hypothesis that in the E. albertii lineage, as in E. coli, positive selection drives sequence divergence in this region.
To facilitate the economical screening of isolates, a diagnostic multiplex PCR was designed to identify bacteria of the E. albertii lineage based on sequence variations in housekeeping genes. This method of classification is independent of biochemical and antigenic phenotypes and therefore may provide a useful starting point for discovering genetic and antigenic diversity within the lineage. Infection by members of the E. albertii lineage may contribute to unexplained diarrhea, and molecular markers for the rapid detection of the new lineage will be crucial to further elucidating the role of these novel organisms in enteric disease.
This project was funded in part with an American Society for Microbiology undergraduate research fellowship to K.E.H. and in part with federal funds from the NIAID, NIH, and DHHS under NIH research contract N01-AI-30058.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»