| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Meiying Yan,1,
Zhigang Cui,1
Xiaofen Ye,2
Baowei Diao,1
Yonghong Ren,2
Shouyi Gao,1
Liang Zhang,2 and
Biao Kan1*
State Key Laboratory for Infectious Disease Prevention and Control, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, P.O. Box 5, Changping, Beijing 102206, People's Republic of China,1 National Engineering Research Center for Beijing Biochip Technology, 18 Life Science Parkway, Beijing 102206, People's Republic of China2
Received 25 December 2006/ Accepted 21 April 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Of the more than 200 O-antigen serogroups of V. cholerae identified, only O1 and O139 are known to cause epidemic and pandemic cholera. The fifth and sixth pandemics (caused by a classical biotype of the O1 serogroup) were confirmed etiologically, and in 1961 the seventh and present pandemic began, caused by the O1 El Tor biotype (2). In 1992, a novel serogroup, O139, emerged and caused epidemic cholera in Bangladesh and India (1, 5); the O139 serogroup remains confined to Southeast Asia (49).
Cholera toxin is the major virulence factor of toxigenic V. cholerae (31), and colonization of the intestine is the first major step in Vibrio pathogenesis. The lysogenic infection of nontoxigenic V. cholerae strains by CTX
, which carries ctxAB, is an important mechanism for the emergence of new toxigenic strains (54). The toxin-coregulated pilus (TCP) is the receptor for CTX
and plays an important role in intestinal colonization by V. cholerae. The TCP is located in the V. cholerae pathogenicity island (VPI). Other genomic islands, including Vibrio seventh-pandemic island I (VSP-I) and VSP-II, are present in toxigenic O139 and El Tor strains of the seventh pandemic (16).
V. cholerae can coexist with a variety of zooplankton and phytoplankton (29), and such survival in aquatic systems is regarded as a prerequisite for cholera pandemics. A report showed that strains with virulence potential (from among the majority of genetically diverse nonpathogenic strains) can be enriched by passage through mammalian host intestines (20). Analysis of diverse V. cholerae strains isolated from different sources has revealed that the strains that can cause diarrhea are highly clonal and that they have evolved independently from different lineages of nontoxigenic environmental isolates. Additionally, it is known that the various nontoxigenic environmental lineages are composed of heterogeneous populations (20, 32, 50). Several studies have indicated that virulence genes disperse among different serogroups of nontoxigenic strains that are present in aquatic ecosystems and that these aquatic strains can be virulence gene reservoirs for toxigenic strains (20, 21, 42).
In order to understand the origins of toxigenic strains, the investigation of genomic differences among V. cholerae strains (especially nontoxigenic strains) will provide insight. The genome contents of toxigenic and nontoxigenic strains have been studied by genome sequencing or comparative genome hybridization (CGH); one nontoxigenic O1 strain and four nontoxigenic non-O1/non-O139 strains were included in these studies (16, 17). These investigations revealed that genomic contents are quite conservative in O1 and O139 toxigenic strains (including a nontoxigenic TCP+ strain) (16), whereas four pathogenic non-O1/non-O139 strains diverge from O1 and O139 strains (17).
Serogroup transformation between O1 and non-O1 may not happen frequently in the environment (20); hence, genomic analysis among nontoxigenic serogroup O1 and O139 strains may provide further insight into the origins of the seventh-pandemic strains. Until now, only one nontoxigenic O1 strain was analyzed by CGH (16). In the present study, we constructed microarrays based on the genome sequence of El Tor strain N16961 to investigate the genomic difference of V. cholerae and we used seven nontoxigenic O1 strains and three O139 strains. We found that the nontoxigenic O1 and O139 strains are demonstrably different from toxigenic strains. We speculate that the nontoxigenic strains carrying different tcpA alleles may represent different intermediate strains in the origination of toxigenic strains which would be different from extant toxigenic clones.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Construction of microarrays. DNA microarrays were constructed on the basis of the genomic content of N16961 (GenBank accession no. NC002505 and NC002506). An in-house high-throughput computer algorithm was used to design PCR primers for all open reading frames (ORFs). The fundamental rules of our computer algorithm dictates that all primers have a 61°C annealing temperature, PCR product lengths should be 200 to 1,000 bp, and amplicons should have minimum sequence similarity. PCR amplifications were performed in 100-µl reaction mixture volumes. A total of 3,687 ORFs were amplified successfully for N16961. Products were confirmed by agarose gel electrophoresis. PCR products were precipitated with isopropanol and redissolved in 50% dimethyl sulfoxide. For the 174 genes that were not successfully amplified by PCR, 70-mer sense oligonucleotides were synthesized. PCR products and long oligonucleotides (in 50% dimethyl sulfoxide, 20 µM) were spotted with a SmartArray Microarrayer (CapitalBio Corp., Beijing, China). Each gene was printed in triplicate to facilitate subsequent statistical analyses. After printing, the slides were baked at 80°C for 1 h and stored dry at room temperature until used. In all, the microarray contained 3,861 predicted ORFs representing 99.4% (3,861/3,885) of the genome of N16961, of which 2,758 ORFs are located on chromosome 1 and 1,103 are on chromosome 2.
Prior to hybridization, the slides were rehydrated over a 65°C water bath for 10 s and UV cross-linked at 250 mJ/cm2. The unimmobilized DNA was washed off with 0.5% sodium dodecyl sulfate (SDS) for 15 min at room temperature, and the SDS was removed by dipping the slides in anhydrous ethanol for 30 s. The slides were spin dried at 1,000 rpm for 2 min.
Hybridization. Genomic DNA was fragmented by Dpn restriction endonuclease digestion and purified with the PCR Clean-up NucleoSpin Extract kit (Macherey-Nagel, Germany). For each labeling reaction, 2 µg of digested DNA and 4 µg of a random nonamer were heated to 95°C for 3 min and snap cooled on ice. Reaction buffer (10x), deoxynucleoside triphosphates, and Cy5-dCTP or Cy3-dCTP (Amersham Pharmacia Biotech, Piscataway, NJ) at final concentrations of 120 µM each dATP, dGTP, and dTTP; 60 µM dCTP; and 40 µM Cy dye, respectively, were added. Klenow enzyme (1 µl; TaKaRa) was added, and the mixture was incubated at 37°C for 60 min. The labeled DNA was purified with the PCR Clean-up NucleoSpin Extract II kit (Macherey-Nagel) and resuspended in elution buffer. The optical density at 260 nm was read. Labeled control and test samples were quantitatively adjusted on the basis of the efficiency of Cy dye incorporation and mixed into 30 µl hybridization solution (3x SSC [1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate], 0.2% SDS, 25% formamide, 5x Denhardt's solution). DNA in the hybridization solution was denatured at 95°C for 3 min prior to loading onto a microarray. The microarray was hybridized at 42°C overnight and washed at room temperature with two consecutive washing solutions (0.2% SDS-2x SSC at 42°C for 5 min and 0.2% SSC for 5 min).
Imaging and data analysis. Microarrays were scanned with a LuxScan scanner (CapitalBio Corp.), and images were analyzed with SpotData software (CapitalBio Corp.). Analysis of CGH results was also accomplished by determination of fluorescence intensity ratios via a space- and intensity-dependent normalization procedure based on a LOWESS program. For each experiment, each ORF was spotted in triplicate onto each slide.
Classification of ORFs as absent or present in test strains. For each strain, ORFs with ratios lower than 0.2 were classified as absent whereas those with a ratio greater than 0.6 were regarded as present. The presence or absence of an ORF with ratios between 0.2 and 0.6 was determined by PCR confirmation assays to validate cutoff values as follows. About 80% of such ORFs in each strain were selected for a PCR assay. The positive cutoff was the highest ratio in all PCR-negative ORFs, while the negative cutoff was the lowest ratio of the PCR-positive ORF. Between the positive and negative cutoff values, some ORFs were PCR positive and others were negative; the PCR results did not correspond exactly to the highness and lowness of ratios. Finally, every ORF with a ratio between the positive and negative cutoff values was assayed by PCR and classified according to its PCR result. The cutoff values in the various test strains differed from each other because of sequence divergence between strains. For oligonucleotide probe ORFs with ratios between positive and negative cutoff values, the absence or presence of the ORFs was decided according to the PCR results obtained with primers derived from flanking ORFs.
Comparative phylogenomic analysis of microarray data. Final absent or present calls for each gene were translated into a binary code (gene present in a given strain = 2, absent = 0.5) and analyzed by Cluster/TreeView. Microarray data were processed with a practical extraction and reporting language procedure, and trees based on the unweighted-pair group method using average linkages (UPGMA) were constructed with BioNumerics (version 3.0; Applied Maths BVBA, Belgium). The correlations between these trees were calculated with BioNumerics.
| RESULTS |
|---|
|
|
|---|
|
The large number of ORFs (112 to 599) missing from nontoxigenic strains is associated with a wide variety of diverse cellular functions. The variation in ORFs among the different V. cholerae strains that were classified with respect to functional categories is shown in Table 2. Of the 2,876 common genes, the conservation categories are protein and amino acid biosynthesis (100 and 99.9%, respectively) in toxigenic El Tor strains (Fig. 2). Genes in the category of mobile and extrachromosomal element function exhibited high diversity, followed by those for cellular processes and central intermediary metabolism function, especially in nontoxigenic strains. In the mobile and extrachromosomal element category, subcategories that contain phage- and transposon-related genes appeared to be the most variable, suggesting a role in the diversification of V. cholerae. Of the 33 ORFs in the cellular process category that exhibit differences between toxigenic and nontoxigenic strains, 29 are pathogenesis-related genes.
|
|
|
|
Considering the specific characteristics of the superintegron and its diversity, we analyzed variation in the superintegron and its impact on chromosome 2, as well as on the whole V. cholerae genome, by establishing phylogenetic trees with and without the superintegron. For this purpose, we constructed phylogenetic trees of the test strains based on the comparative hybridization data of the whole genome without the superintegron (I+IISI) (Fig. 5B), of chromosome 2 alone (Fig. 5C), of chromosome 2 without the superintegron (IISI) (Fig. 5A), and of the superintegron (SI) alone (Fig. 5D). The indexes of similarity between SI and I+IISI and between SI and IISI were found to be only 27.9 and 0%, respectively, indicating the instability of the superintegron and demonstrating its different evolutionary rate compared to the rest of genome. The similarity index also revealed frequent gene losses and gains in the superintegron, as well as recent evolution within this group of bacteria.
|
|
To further examine this phenomenon of missing ORFs, we sequenced the genome region corresponding to this missing fragment in FJ147 and ZJ0596. A new fragment was inserted between VC0495 and VC0513 (Fig. 7). BLAST analysis of this region revealed that it contained an IS element carrying a transposase with 100% identity to transposases in the N16961 genome; additionally, the flanking sequences suggest that this insertion was a random event.
|
Cluster I includes one environmental strain, 7743, that has a VPI region with the same tcpA gene as toxigenic El Tor strains. In several biochemical tests, including sensitivity to the V. cholerae typing phages, sorbitol and mannitol fermentation, and hemolysis activity, strain 7743 shows the same characteristics as toxigenic El Tor strains but differs from the nontoxigenic strains (data not shown). A total of 112 ORFs were absent from strain 7743. In comparison, on average, only 12 ORFs were absent from toxigenic El Tor strains and 273 ORFs were absent from clusterII and III nontoxigenic El Tor strains. Moreover, in the phylogenetic tree, 7743 is clustered together with toxigenic strains rather than with other nontoxigenic ones, and the cluster III strains diverge strongly from the epidemic strains (Fig. 4).
Aside from VPI, we also found sequence variations between cluster II and cluster III nontoxigenic strains in other genome regions. One gene cluster, VC0919 to VC0925, contains exopolysaccharide (EPS) biosynthesis genes. This region was absent from cluster III nontoxigenic El Tor strains but present in cluster II nontoxigenic El Tor strains by hybridization. However, amplicons with the same size as that of N16961 in cluster III nontoxigenic El Tor strains were obtained by using primers for the flanking ORFs of VC0919 to VC0925. Restriction enzyme fragment lengths of PCR products of this region revealed different patterns for cluster III nontoxigenic El Tor strains and N16961. Additionally, cluster III strains also varied among themselves. These data indicate that multiple new sequence alleles exist in this region. Since the EPS are involved in biofilm formation and motility, as well as in the secretion of hemolysin and CT, in V. cholerae (34), variance of these genes might suggest differences in EPS roles and environmental survival among nontoxigenic TCP+, TCP, and toxigenic El Tor strains.
Several other virulence-related genes, including ompU and pilA, also exhibited sequence variation within cluster I nontoxigenic strains compared to seventh-pandemic strain N16961. In the array analysis, these genes in cluster III strains were shown to be absent whereas adjacent genes were present. We sequenced the corresponding regions and found that the ompU sequences in cluster III strains diverged at the nucleotide level. Thus, the sequence identity was 66.4% (strains 19-22 and V0550, 73.7% amino acid sequence identity) and 74.1% (strain JS32, 79.5% amino acid sequence identity) with respect to N16961. OmpU is a pore-forming protein of the outer membrane of V. cholerae. Since V. cholerae lacking ompU has reduced tolerance to bile salts, it is possible that strains with variant ompU alleles differ in environmental survival ability (33). Similarly, pilA of cluster III strains was only 59.8% identical, at the nucleotide level (11 to 49% amino acid sequence identity only), to pilA of N16961. Because pilA contributes to the adherence and colonization abilities of bacteria (51, 18), such allelic variation may have significance for colonization. The DNA sequences of ompU and pilA of the cluster II strains and strain 7743 were identical to the sequence of N16961.
Another region where sequence variation was observed between cluster II and III nontoxigenic El Tor strains was the superintegron. Other differences were found. For instance, part of VPI-II (VC1773 to VC1788) is present in cluster III of nontoxigenic El Tor strains but absent from cluster II strains. Likewise, fragments of VC1748 to VC1753 and VCA1042 to VCA1044 are present in cluster II strains but absent from cluster III strains. VC1748 to VC1753 encode pqiA and five hypothetical proteins, while VCA1042 to VCA1044 encode the Ccm2-related protein and the TagE protein.
On the basis of the array constructed with the genome content of N16961, in all tested toxigenic and nontoxigenic El Tor strains, a total of 455 ORFs were missing from various El Tor strains. Of these, 31 are commonly found only in toxigenic El Tor strains. Thirty of these common genes are genomic island components, including VSP-I, CTX
, and the superintegron. The remaining ORF, VC2346, is annotated as encoding the Smp protein, but its function is unknown.
Comparison of genome contents of serogroup O139 strains. (i) Differences between the genome contents of toxigenic El Tor and O139 strains. It was likely that the serogroup O139 V. cholerae strains originated from an El Tor-like strain (53), and the results of our whole-genome content analysis support this idea (Fig. 4). Compared to toxigenic El Tor strains, 77 ORFs were absent or highly divergent in all toxigenic O139 strains and most of them are clustered in three genomic islands and in the gene cluster encoding the O antigen. The first island represents 83% of the gene cluster encoding enzymes involved in lipopolysaccharide antigen synthesis (Fig. 8). The second island spans the region from VC1761 to VC1787 and consists of two parts of VPI-II, including an ORF that encodes neuraminidase. Neuraminidase may increase host cell susceptibility to cholera toxin (23). Possible explanations for the reemergence of El Tor cholera in Bangladesh since 1994 (19) are the presence of VPI-II in the El Tor biotype and the deletion of most of VPI-II from O139 strains (30). The third island is the superintegron in which 19 ORFs are missing from toxigenic O139 strains. Herein we show that the genes missing from the superintegron of toxigenic O139 strains are almost the same as those of toxigenic El Tor strains isolated after 1993 (93284, 200106, V011506, ZJ0596, and FJ147). Moreover, the first three toxigenic El Tor strains are clustered more closely with toxigenic O139 strains in the phylogenetic tree based on the superintegron hybridization profile (Fig. 5D), suggesting a close relationship between toxigenic O139 and El Tor strains prevailing at that time.
|
(ii) Divergence of nontoxigenic O139 strains. In contrast to the conservation of toxigenic O139 strains, the nontoxigenic O139 strains showed more diversity in both the number and categories of absent ORFs (Table 2). These absent ORFs are dispersed on the two chromosomes. The toxigenic and nontoxigenic O139 strains have the same microarray hybridization patterns in the gene cluster involved in lipopolysaccharide antigen synthesis, confirming the genetic basis of the serogroup antigen of these strains (Fig. 8). Nonetheless, in the phylogenetic tree, the nontoxigenic O139 strains are scattered throughout the tree, indicative of the large genetic diversity of genomic content among these strains.
| DISCUSSION |
|---|
|
|
|---|
Conservation of chromosome 2 and impact of the superintegron on strain diversification. It is unclear why V. cholerae has two chromosomes. Sequence comparison of chromosomes 1 and 2 (strain N16961) indicates that chromosome 2 was originally a megaplasmid captured by an ancestral Vibrio species; perhaps chromosome 2 provides certain evolutionarily selective advantages for the bacteria (25). Our examination of multiple strains (including nontoxigenic O1 and O139 strains) indicates that the proportion of absent genes in chromosome 2 is higher than is the case in chromosome 1. Our analysis also shows that the majority of missing genes are located within the superintegron, a region of more than 200 ORFs. Phylogenetic trees of these strains, based on chromosome 2 but excluding the superintegron, reveal that the gene content of chromosome 2 is as conservative as that of chromosome 1. Therefore, our CGH data suggest that chromosome 2 (aside from the superintegron) is conservative in V. cholerae. The presumed reason that the phylogenetic trees for chromosomes 1 and 2 do not correspond strictly is that the superintegron provides variability in genome content.
Phylogenetic trees based on microarray hybridization results of chromosome 2 and the superintegron alone revealed evolutionary trends (shared or otherwise) between toxigenic O1 and O139 strains and between toxigenic O1 El Tor strains. The frequent gene gains and losses in the superintegron, characteristic of such integrons, represent recent genetic change and active gene transfer. It has been reported that microevolution is an ancient and perpetual event in Vibrionaceae resulting in the loss, gain, and duplication of genes (47), and our results generally support that observation. The structural diversification of the genomes of toxigenic El Tor strains spanning 45 years in this study favors this viewpoint.
The superintegron cassettes examined to date seemingly encode adaptive rather than indispensable functions (48). In the present study, the nontoxigenic strains exhibited greater diversity than the toxigenic strains in the superintegron region. The impact of integrons on bacterial evolution remains to be evaluated beyond their obvious role in the dissemination of antibiotic resistance genes. The variation of the superintegron and its impact on evolution of V. cholerae can be explained more accurately only after elucidation of the function of the genes inside the structure.
Microevolution among the seventh-pandemic toxigenic El Tor strains. The seventh cholera pandemic was caused by toxigenic El Tor strains, which were reported to be a conserved clone (16). In spite of the conservation observed in the 11 toxigenic El Tor strains, minor differences among these strains can be observed and have accumulated over time from 1961 to now. In the phylogenetic tree for chromosome 2 (Fig. 5), toxigenic strains can be clustered into different groups according to the year of isolation, reflecting ongoing but exiguous evolution.
VSP-I and VSP-II are the characteristic genome islands identified in the seventh-pandemic El Tor strains and toxigenic O139 strains (16, 44, 45). In this study, a toxigenic strain, D118, was found to lack VSP-II entirely. It is reported that two toxigenic El Tor strains isolated in the 1970s lacked both VSP-I and VSP-II or only VSP-II, and one toxigenic El Tor strain from 1937 lacked VSP-I, VSP-II, and VPI-2 (45). Such strains resemble the presumed progenitors of the seventh-pandemic El Tor strains (16, 45). Strain D118, used in this study, was isolated in 1961, the year the seventh pandemic began. In the phylogenetic tree based on the sequences of four housekeeping genes, D118 is in a lineage separate from that of other toxigenic El Tor strains (Fig. 6). Therefore, D118 may represent a strain or group of strains that were converted into the predominant clone by the acquisition of VSP-II.
However, two epidemic strains isolated in 2005, FJ147 and ZJ0596, were found to lack most of the ORFs in VSP-II, and these two strains caused epidemics spanning thousands of kilometers and lasting several months in southeastern China. Therefore, the function of VSP-II in bacterial pathogenesis, environmental survival, and epidemic instigation remained to be defined. Likely the seventh-pandemic strains are in continual evolution, and it is expected that they may evolve into new pathogenic clones through the acquisition of new virulence and/or adaptive factors.
Diversity of nontoxigenic strains and clonality of toxigenic strains in serogroups O1 and O139. Previous studies revealed high diversity in nontoxigenic strains, especially the non-O1 and non-O139 isolates, through molecular subtyping techniques including ribotyping, pulsed-field gel electrophoresis, housekeeping gene sequencing, and amplified fragment length polymorphism (20, 32, 45, 50). Herein we used microarray methods to investigate the genome contents of toxigenic and nontoxigenic O1 and O139 strains. Extensive genomic diversity among these strains was again found.
Comparative analyses of nearly 300 microbial species has demonstrated that multiple forces have shaped prokaryotic genomes during their dynamic evolution (22). The genomic diversity of toxigenic and nontoxigenic O1 and O139 V. cholerae strains reflects the same dynamism of these other prokaryote genomes.
Toxigenic O1 and O139 V. cholerae strains exhibit greater clonality than environmental non-O1/non-O139 strains (32, 45). Our study supports this observation by demonstrating that the genomic content of nontoxigenic V. cholerae strains is quite different from that of closely related toxigenic strains. It has been suggested that populations of bacteria typically consist of abundant and heterogeneous isolates that rarely cause disease but that a small number of groups of closely related strains (clones or lineages) are particularly associated with epidemics of disease (40). Serogroup O37 caused an outbreak in 1968 (58). IS1004 fingerprinting, housekeeping gene sequencing, and multilocus enzyme electrophoresis analysis revealed that epidemic O37 strains are phylogenetically clustered with O1 and O139 isolates and that they are especially close to O1 classical strains (3, 6, 45). It seems that, to date, cholera epidemics have been caused only by strains possessing an O1 serogroup core genome, despite the fact that O37 and O139 are non-O1 serogroups (36, 45).
A study in an area where cholera is endemic provided a model for understanding the selection of toxigenic strains from the genetically diverse nonpathogenic strains in the environment; such selection has been suggested to occur through enrichment in the mammalian host intestine (20). In areas where cholera is endemic, large numbers of toxigenic V. cholerae cells are present in the stool of patients which, when excreted into environmental waters, contribute in part to the expansion of cholera epidemics. A study indicated that by passing through the mammalian intestine, V. cholerae cells acquire a hyperinfectious state that may also contribute to the epidemic spread of cholera (41). We suppose that the rapid spread of toxigenic V. cholerae strains in areas where cholera is endemic makes the isolates in epidemics more conservative in clonality. In contrast, nontoxigenic strains of diverse lineages do not undergo such enrichment and exhibit extensive genetic diversity. It should also be considered whether the epidemic strains have advantages in maintenance and multiplication over nonpathogenic strains in aquatic environments.
Our CGH analysis revealed that the O139 and El Tor strains isolated in the coepidemic time period (10 more years since the emergence of O139 V. cholerae) exhibit superintegron missing-gene profiles essentially identical to those of El Tor strains isolated before the emergence of O139 strains. It has been proposed that serogroup O139 originated from an El Tor strain that obtained the O139 antigen biosynthesis and some other gene clusters (4, 52, 55). However, some portion of VPI-II is absent from all of the test toxigenic O139 strains isolated through the past years, and an additional five common missing genes in the superintegron in the toxigenic O139 strains have been documented (in comparison with all toxigenic El Tor strains). We speculate that these missing genes were lost after the replacement of the O-antigen-synthetic gene cluster.
Genomic analysis of toxigenic and environmental nontoxigenic O1, O139, and non-O1/non-O139 V. cholerae strains can provide clues to V. cholerae evolution and the origin of pathogenic strains. Strains that can trigger epidemics may acquire virulence genes successively (9, 10, 20, 45), rather than in a single horizontal gene transfer event. In this study, we classified the nontoxigenic O1 El Tor V. cholerae strains into three clusters, named TCP, TCPET, and TCPETvar strains. We speculate that the nontoxigenic TCPET and TCPETvar strains represent intermediate strains in different lineages evolving from nontoxigenic to toxigenic strains. The acquisition of ctxAB through the lysogenic conversion of CTX
may confer the ability to initiate epidemics upon those strains (54). The gene of pIIICTX (previously known as OrfU) of CTX
encodes a protein that can recognize TcpA as the receptor of phage infection (26); however, tcpA has different sequence variants (12, 24, 28, 42, 43, 46). Variable gene sequences of pIIICTX have also been documented (9, 11). The tcpA sequence of nontoxigenic strain 7743 (also known as IEM101) used in this study is identical to the sequence of seventh-pandemic El Tor strains, and the attB site for CTX
genome integration is also identical (57). Strain 7743 can be infected by CTXET
and thereby converted into a toxigenic strain (37, 38, 57). Moreover, rabbits immunized with 7743 and the live-vaccine candidates developed from 7743 can be protected against a challenge with classical or El Tor strains (37-39, 57). This protection demonstrates that strain 7743 has the same immunogenicity as toxigenic El Tor strains. Intact VPI, VPI-II, and TLC clusters are present in strain 7743, whereas VSP-I, VSP-II, and CTX
are absent. Taken together, these data suggest that it is possible that strain 7743 represents an intermediate in the origination of the seventh-pandemic clone.
It has been reported that the different TcpA alleles can serve as CTXET
and CTXcalc
receptors. However, a difference in the infection frequency of CTX
among these strains has been documented (11); perhaps there are barriers to the cross-infection with some CTX
s with different pIIICTX alleles and nontoxigenic strains with different tcpA alleles. Perhaps, in comparison with strain 7743, nontoxigenic TCPETvar strains may have a reduced ability to acquire CTXET
. Such strains may provide mechanistic clues about evolutionary events in the development new toxigenic strains. In other words, perhaps these strains (or similar strains) have the potential to become new dominant clones in future epidemics through the acquisition of CTX
with the corresponding pIIICTX allele.
Obvious divergence in virulence-related genes such as tcpA (12, 21, 24, 28, 42, 43, 46), the gene for pIIICTX (9, 11), and rstR (9, 14, 15) in CTX
has been found in different strains. Similarly, sequence variance in the ompU, pilA, and EPS cluster was also observed in nontoxigenic O1 V. cholerae in our study. It is inconvincible that this variation represents an accumulation of random mutations because the ORFs flanking those genes show little or no nucleotide variation.
V. cholerae can survive in aquatic environments and in host intestines. Different gene expression mechanisms are used (7, 27, 35, 41, 56) in these two niches. Besides the modulation of gene expression, in different environments, gene sequence variation may confer selective advantages on the strains in various environments. The hypervariability at the tcpA locus is thought to be generated through multiple horizontal transfer and recombination events, and tcpA diversity is presumed to reflect diversifying selection resulting from the host immune response, susceptibility to CTX
(11). The sequence variability in type IV pili of several other bacterial species has also been reported (8, 11). Therefore, we speculate that variation in the ompU, pilA, and EPS gene cluster may contribute to functional differences (phenotypes) in V. cholerae as the strains respond to selective pressure.
Conclusion.
Our analysis shows that nontoxigenic strains from serogroups O1 and O139 are a collection of heterogeneous clones that are part of the natural microflora of aquatic environments. Some strains carry virulence-related genes that differ in sequence from those of the seventh-pandemic El Tor strains; although these strains have no El Tor-type TCP, they may nonetheless have suitable receptors that allow infection by other types of CTX
. For example, the CTX
with pIIICTX differed from CTXET
and may infect the nontoxigenic strains with tcpA sequences distinct from that of the El Tor type. The possibility is proposed that these strains can serve as raw material for the development of (future) epidemic clones through acquisition of novel alleles of CTX
. These nontoxigenic O1 and O139 strains may serve as models to understand how environmental nontoxigenic strains become toxigenic strains via gene loss and gain. Array-based CGH provides the ability to rapidly examine the genomic contents of numerous bacterial isolates, thereby allowing determination of which genes are present and which are absent in nontoxigenic strains compared to toxigenic ones, a powerful tool. We proposed that more robust conclusions regarding the pathogenicity, environmental adaptation, and evolution of V. cholerae can be achieved by whole-genome sequencing and biological research on these nontoxigenic strains.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published ahead of print on 27 April 2007. ![]()
B.P. and M.Y. contributed equally to this work. ![]()
| REFERENCES |
|---|
|
|
|---|
precursor and evidence for independent acquisition of distinct CTX
s by toxigenic Vibrio cholerae. J. Bacteriol. 182:5530-5538.
and the Vibrio pathogenicity island prophage in Vibrio mimicus: evidence for recent horizontal transfer between V. mimicus and V. cholerae. Infect. Immun. 68:1507-1513.
is infectious and encodes a novel repressor. J. Bacteriol. 181:6779-6787.
minor coat protein, can expand the host range of coliphage fd to include Vibrio cholerae. J. Bacteriol. 185:1037-1044.
infection. Vaccine 24:1749-1755.[CrossRef][Medline]
prophage in environmental strains of Vibrio cholerae. J. Bacteriol. 183:4737-4746.
infection. Vaccine 25:4046-4055.[CrossRef][Medline]
| |||||||