Previous Article | Next Article ![]()
Journal of Bacteriology, May 2002, p. 2620-2625, Vol. 184, No. 10
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.10.2620-2625.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Sandy Huskic, Adam Cisterne, Deborah Rothemund, and Peter R. Reeves*
Department of Microbiology, The University of Sydney, Sydney, New South Wales 2006, Australia
Received 23 October 2001/ Accepted 18 February 2002
|
|
|---|
|
|
|---|
The O antigen contributes major antigenic variability to the cell surface, and on the basis of this antigenic variation, 166 O forms have been recognized in E. coli (not including Shigella strains). The surface O antigen is subject to intense selection by the host immune system, which may account for maintenance of the many different O-antigen forms within species such as E. coli. The genes specific to O-antigen synthesis in E. coli are commonly clustered adjacent to the gnd gene between the colanic acid (CA) and his operons (28).
Lateral transfer of large DNA segments is thought to have played an important role in the evolution of bacterial pathogens. The evidence is usually the presence of genes with atypical GC content and/or the distribution of pathogenicity islands or other gene clusters. We, among others, have undertaken extensive studies on O-antigen genes by sequencing and identifying the O-antigen genes, mostly in Salmonella enterica and E. coli, and found evidence for lateral gene transfer at all levels of O-antigen variation (see Reeves [27] for a review). There is evidence for gene transfer in assembly of O-antigen gene clusters (5, 45) and also for transfer of O-antigen gene clusters between clones of a species involving homologous recombination in adjacent genes (35). Finally there is evidence for interspecies transfer of the entire O-antigen gene cluster from Plesiomonas shigelloides to E. coli (31). Tarr et al. (34) showed by sequence comparison that the O157 gene cluster and adjacent gnd gene of the O157:H7 clone cotransferred into an E. coli O55:H7 organism to generate the O157:H7 clone.
It is thought that E. coli and S. enterica diverged from a common ancestor about 140 million years ago (21, 22). It is noteworthy that only three forms of O antigen are common to both species, and in E. coli all three (O55, O111, and O157) are associated with enteropathogenic (and sometimes enterohemorrhagic) E. coli strains. O55 and O111 are the only two colitose-containing O antigens in E. coli. For E. coli O111 and identical S. enterica O35, the organization and sequences of the gene clusters support their derivation from a gene cluster in the common ancestral species, but data are not available for the other cases.
To better understand the genetics of the O55 antigen and the genetic basis of the shift from O55 to O157 in the evolution of E. coli O157:H7, we sequenced the O55 antigen genes and flanking sequence. The O55 O unit is atypical in that two of five colitose biosynthesis pathway genes are located downstream of the gnd gene and a newly described UDP-GlcNAc epimerase (gne) gene is upstream of galF, suggesting formation by addition of genes adjacent to an ancestral gene cluster.
|
|
|---|
Construction of random DNase I bank for sequencing DNA fragments. Chromosomal DNA used as the template for PCR was prepared by using Wizard DNA preparation kits from Promega. Long PCR was carried out using the Expand Long Template PCR system from Boehringer, and products were subjected to DNase I digestion and cloned into pGEM-T to make banks for sequencing by the method described previously (39). Products of 12 individual PCRs were pooled to make each bank in order to limit the effect of PCR errors.
Sequencing and analysis. A total of 27,730 bp of DNA sequence, from the end of the CA gene cluster to hisG, was obtained from the O55:H7 strain (Fig. 1) in three overlapping segments. We sequenced the galF to gnd region and from gnd to the distal end of the his operon regions using random DNase I banks constructed from DNA amplified by long PCR using primer pairs 1523 (5'-ATTGTGGCTGCAGGGATCAAAGAAATC) and 1524 [tag-TC(A,G)CGCTG(A,C,T,G)GCCTG(A,G)AT(C,T)ARGTT(A,C)GC] and 3380 (5'-GATATTGCAAACCTGCTGCTTGCTCCGTATTTC) and 3378 (5'-TATCCTCACCTGCTCAAGCGTTATCTCGACCAG), respectively (bases in parentheses for 1524 indicate redundancy). The region upstream of galF was PCR amplified using primers 3667 (5'-GGATTAATCACCATATTGT) and 3432 (5'-ATAAGAGGTGTCGAAGTG) and sequenced by primer walking.
![]() View larger version (24K): [in a new window] |
FIG. 1. Maps of the E. coli O55:H7 and E. coli O157 O-antigen genes (top and bottom) and respective flanking regions, including the gne gene (center). Differences between the DNA sequences are given for each gene in the flanking regions. Proposed recombination sites for transfer of the O157 gene cluster into an O55:H7 strain are indicated by vertical arrows. A, B, and C indicate the binding sites for primers used in PCR of gne (primers at sites A and B for detection of gne were inside the primers for cloning gne).
|
Cloning of the gne gene. The gne gene was PCR amplified from strain M1685 using primers 3859 (5'-ATATAGAGCTCATGAACGATAACGTTTTGCTC) and 3860 (5'-CGGGATCCTTACTCAGACAAAAATGCTAT), which bind to the 5' and 3' ends, respectively, of the gne gene (shown as A and B in Fig. 1) and have SacI and BamHI restriction sites, respectively, incorporated at their 5' ends. The PCR product was cloned into the SacI and BamHI sites of pTRC99A (from Pharmacia) to make plasmid pPR2062. In plasmid pPR2062, the cloned gne gene is under the control of a trc promoter, which is repressed by the LacIq protein encoded by a gene in the same plasmid. We used 2.5 mM IPTG (isopropylthiogalactopyranoside) to induce expression of the cloned gene.
Deletion of gne gene from O55:H7 and O157:H7 strains. The gne genes of both strains were replaced by a chloramphenicol acetyltransferase (CAT) gene using the RED recombination system of phage lambda (6, 46). The CAT gene was PCR amplified from plasmid pKK232-8 (Pharmacia) using primers binding to the 5' and 3' ends of the gene, with each primer carrying 36 bp based on the O55:H7 DNA which flanks gne. The PCR product was transformed into M1685 and M2136 carrying pKD20, and chloramphenicol-resistant transformants were selected after induction of the RED genes according to the protocol described by Datsenko and Wanner (6). PCR using primers specific to the CAT gene and O55:H7 or O157:H7 DNA flanking the gne gene was carried out to confirm the replacement.
Assay for UDP-GlcNAc epimerase. We used the assay first described by Glaser (12) and recently used by others (4, 8) for UDP-GlcNAc epimerase. UDP-GalNAc is used as the substrate, and after removal of the UDP moiety by acid hydrolysis, the product is measured by the Morgen-Elson reaction (23), in which GlcNAc yields a threefold-higher color reading than GalNAc. Reactions were performed at 37°C for 10 min with a total volume of 0.5 ml (pH 9.0) which contains 10 mM glycine, 1 mM MgCl2, 0.1 mM EDTA, 0.1 mM UDP-GalNAc, and 50 µl of cell extract. The reaction was stopped by addition of 1 µl of 10 M HCl to bring the pH to 2.0, followed by incubation at 100°C for 20 min for hydrolysis. After neutralization with 1.25 µl of 10 M NaOH, 0.05 ml of freshly prepared 1.5% (vol/vol) acetic anhydride in acetone was added. After 5 min at room temperature, 0.15 ml of a 0.7 M potassium tetraborate solution was added, and the mixture was boiled immediately for 3 min. After cooling, 0.3 ml of DMAB reagent (30) was added without shaking, followed by addition of 2.7 ml of glacial acetic acid. After incubation at 37°C for 20 min, the A585 was recorded. The assay was done in duplicate for each sample, and standard curves, prepared by using UDP-GlcNAc and UDP-GalNAc subjected to acid hydrolysis under the same conditions, were used to estimate the concentration of UDP-GlcNAc.
Other methods. Membrane preparation, sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), and silver staining of lipopolysaccharide (LPS) for visualizing LPS were carried out as described by Wang and Reeves (38). Preparation of cell extracts and total protein determination were carried out as described by Estrela et al. (8).
Gene nomenclature. We have used gene names based on principles described previously (28). In the case of wbdJ and wbdK, the name is expected to change when the GDP-colitose biosynthesis pathway is better understood and the function order is known. Any such change will be reported on the Bacterial Polysaccharide Genes Database (BPGD) website (http://www.angis.su.oz.au/BacPolGenes/welcome/html).
Nucleotide sequence accession numbers. The O55:H7 amn sequence and the sequence covering wcaM to hisG have been deposited in GenBank under accession numbers AF461121 and AF461122, respectively.
|
|
|---|
O55 O-antigen genes. The structure of the O55 O unit is known (Fig. 2), and we expect genes for the synthesis of GDP-colitose, an O-antigen flippase gene (wzx), an O-antigen polymerase gene (wzy), an O-antigen chain length determinant gene (wzz), and transferase genes for two galactose residues, N-acetyl-galactosamine and colitose. Note that O55 antigen synthesis is initiated by transfer of GlcNAc-1-phosphate (GlcNAc-1-P) to undecaprenyl-1-P by WecA , encoded in the enterobacterial common antigen gene cluster (1, 18).
|
View larger version (7K): [in a new window] |
FIG. 2. Structure of the E. coli O55 antigen (17). Gal, galactose; Col, colitose.
|
|
View this table: [in a new window] |
TABLE 1. Summary of O55 antigen genes
|
(1-3) linkage to GlcNAc (13), and we suggest that WbgM is the transferase for the
(1-3) galactosyl linkage to GalNAc. WbgN shows 23% identity (47% similarity) with the FUT2 protein, a human secretor blood group fucosyltransferase forming an
(1-2) linkage to ß-galactose (15). WbgN may well form the
-colitose-(1-2)-ß-galactose linkage. WbgO and WbgP show similarity with many other putative bacterial polysaccharide transferases, and they are likely to be the remaining two transferases. O-antigen processing genes. The wzz gene was easily identified, as it is 98% identical to that of E. coli K-12 (GenBank entry AE000294) and is in the usual location between gnd and hisI. A presumptive wzx gene was identified as encoding an integral inner membrane protein with 12 predicted transmembrane segments and confirmed by a motif search using the method described by Jiang et al. (14). The gene we consider to be wzy encodes a protein with 10 predicted transmembrane segments and one loop of 52 amino acid residues, a characteristic topology for O-antigen polymerases (19). No motifs were shared by this protein and other known Wzy proteins. However, given that this gene, wzx, and wzy are the only genes with inferred products having multiple predicted transmembrane segments and that it has the expected topology, we conclude that it is the wzy gene.
The gne gene. In E. coli K-12 and S. enterica LT2, wcaL, the last gene of the colanic acid gene cluster, is separated by one gene of unknown function from galF (32). In the O55:H7 strain, there is an additional gene upstream of galF, also found in O157:H7, that shows 48% identity to an Edwardsiella ictaluri gene of unknown function (GenBank AAL25633) and 22% or lower identity with a range of putative or characterized UDP-galactose 4-epimerase genes. We suspected that this gene might encode a UDP-GlcNAc 4-epimerase, responsible for conversion of UDP-GlcNAc to UDP-GalNAc, as GalNAc is present in both the O55 and O157 antigens. Deletion of the gene from either an O55 or O157 strain (to make strains M2313 and M2311, respectively) led to loss of O-antigen production, which was restored when plasmid pPR2062 was present (Fig. 3). We also showed that strain M2313 was devoid of UDP-GlcNAc 4-epimerase activity, while the parent strain (M1685) and M2313 carrying plasmid pPR2062 (M2318) both had the function (Table 2). E. coli K-12 strain P4971 was negative for the epimerase activity but positive after transfer of plasmid pPR2062 (data not shown). The gene upstream of galF is clearly a UDP-GlcNAc 4-epimerase gene essential for synthesis of O55 and O157 O antigens, and it was named gne.
![]() View larger version (95K): [in a new window] |
FIG. 3. Requirement of the gne gene for expression of the E. coli O55 and O157 antigens. Membrane extracts were run on SDS-PAGE gels and stained by silver staining. Lanes: A, M1685 (wild-type O55:H7); B, M2313 (M1685 missing the gne gene); C, M2313 carrying plasmid pPR2062; D, M2136 (wild-type O157:H7); E, M2311 (M2136 missing the gne gene); and F, M2311 carrying plasmid pPR2062.
|
|
View this table: [in a new window] |
TABLE 2. UDP-GlcNAc 4-epimerase activity in cell extracts of E. coli strains
|
In summary, four putative transferase genes, five GDP-colitose synthesis genes, a gne gene, an O-antigen polymerase gene, a flippase gene, and a chain length determinant gene were identified. They account for all the genes needed for the synthesis and processing of the O55 O unit. In addition, an H repeat and remnant gmm gene were found, as discussed below.
gne gene of E. coli O55 and O157 is present in many other E. coli strains. There are 62 E. coli O-antigen forms with reported structures, of which 22 include GalNAc. We carried out PCR on the type strains for the 62 O antigens using primers based on the E. coli O55 and O157 gne genes (5'-ACAGATTGGTGATGTTCG and 5'-ATCAAAGCAATATCCACC, indicated in Fig. 1 by arrows A and B). Fourteen of the 22 strains with GalNAc-containing structures gave a positive result, whereas only 4 of the other 40 strains were positive. PCR with one primer in gne and the other in galF (indicated by C in Fig. 1) gave a positive result for 12 of the 14 previously positive strains and for two additional strains with GalNAc-containing structures. The 16 strains positive in one or both experiments must have the gne gene found in the O55 and O157 strains, and of these 14 were confirmed to be at the same site upstream of galF. PCR with the same 22 strains and primers appropriate to the O113 gne gene previously described by Paton et al. (25) revealed no additional strains carrying this gene. It seems that the form of the gne gene found in O55 and O157 is the most common in E. coli.
The presence of gne in some strains with O antigens reported to lack GalNAc is not surprising, as a strain carrying a gne gene as part of its required O-antigen gene set would retain the gene if its O antigen were replaced by homologous recombination involving recombination within galF, even if the incoming O antigen lacked GalNAc.
Origins of the O55 gene cluster. The O55 gene cluster is atypical in that while most of its O-antigen genes are in the usual O-antigen gene cluster site between galF and gnd, two of the GDP-colitose pathway genes (wbdJ and wbdK) and the gne gene are outside of, although close to, this region.
In the E. coli O111 gene cluster, the GDP-colitose pathway genes are contiguous in the order gmd, gmm, manC, manB, wbdJ, and wbdK. Gmm, a GDP-mannose mannosyl hydrolase (11), would remove GDP-mannose from the pathway and has no obvious role. However, gmm is treated as part of the GDP-sugar pathway, as it has been found only in association with GDP-fucose, GDP-colitose, or GDP-perosamine pathway genes (37, 39, 40).
The O55 gene cluster has a 620-bp remnant gmm gene between gmd and manC (57% identity to the O157 gene at the amino acid level). The presence of only part of gmm indicates a deletion in the O55 gene cluster. The simplest hypothesis is that an ancestral gene cluster included a pathway for a GDP-sugar derived from 4-keto-6-deoxy-GDP-mannose. One can envisage such an ancestral gene cluster's gaining the ability to synthesize GDP-colitose by incorporation of genes wbdJ and wbdK by lateral transfer. If colitose were incorporated into the O antigen, the change in O antigen could have been beneficial in specific circumstances and selected.
It would not be necessary for the additional genes to be close to the main gene cluster for synthesis of colitose. However, Lawrence and Roth (16) have pointed out that close proximity of genes is important if a biosynthetic pathway is to be readily transferred by homologous recombination and proposed that selection for transferability may drive operon formation and maintenance. O-antigen gene clusters are subject to high levels of lateral transfer within E. coli and S. enterica, and the situation observed for O55 represents what one would expect as an intermediate in gene cluster assembly. The current locations of wbdJ and wbdK, probably mediated by the H repeat, as proposed by Tarr et al. (34), would suffice to enable cotransfer with the other O55 genes.
GDP-4-keto-6-deoxy-mannose, the product of Gmd action, is a branch point for synthesis of GDP-fucose, GDP-perosamine, GDP-colitose, and GDP-D-rhamnose. The post-gmd part of the original GMD-sugar pathway has been lost, presumably in the same event that deleted part of gmm and presumably also after gain of the two genes that complete the GDP-colitose pathway. The deletion event also could be driven by selection, as colitose and the original sugar would confer different antigenic specificities on the O antigen and the presence of two specificities may be undesirable. The sequence of events is speculative, but such processes are the most likely means by which the remarkable diversity of the O antigens was generated, and the O55 gene cluster has the hallmarks of an intermediate form, with all genes assembled in close proximity but not yet fully integrated. The O55 antigen is identical to the O50 antigen of S. enterica, and it is possible that assembly occurred in the common ancestor of the two species, as postulated (40) for the E. coli O111 and S. enterica O35 antigens, but this can only be assessed by sequencing the S. enterica O50 gene cluster.
wbgN is thought to be the colitose transferase because of its similarity to a transferase for the related sugar fucose. It is between manB and wzy, which are probably part of the ancestral gene cluster, so wbgN most likely evolved as the transferase using the original GDP-sugar as the substrate, but had sufficient cross specificity to function with GDP-colitose.
The gne gene is also an essential part of the O55 gene cluster, but located upstream of the traditional O-antigen locus between galF and gnd. The portion of the chromosome that transfers O55 synthesis between lineages is presumably the gne to wbdJ segment, although, as we saw for O157 of the O157:H7 clone, this can be reduced if the recipient includes a gne gene. However, we do not see it as useful to give a specific name to such an extended group of genes, as proposed by Tarr et al. (34).
Transfer of O157 gene cluster to an O55:H7 strain to generate the O157:H7 clone. It is proposed that the O157:H7 clone was derived from the O55:H7 clone by replacement of the O-antigen genes (10, 34). We sequenced DNA flanking the O-antigen genes to seek confirmation of this proposal. A convincing recombination site was found within the galF gene, but the other is more distant, between hisG and amn (about 35 kb apart in E. coli K-12 [3]). Genes located outside of the two recombination points are almost identical in the two clones (Fig. 1), gne, amn, and half of galF fitting the expectation for housekeeping genes in clones that are nearly identical by multilocus enzyme electrophoresis. The housekeeping genes between the two points are mostly from 95.1 to 98% identical in the two clones, showing a much higher level of sequence difference. Our data thus provide some detail of the recombination event proposed for the origin of the O157:H7 clone. The difference for the his, wzz, and ugd genes represents the level of divergence between the O157 donor strain and the O55:H7 clone and is within the range expected for unrelated E. coli clones (42).
The divergence for the gnd gene and the 3' half of the galF gene is higher than found for other housekeeping genes within the transferred segment (Fig. 1). The gnd gene is known to be more variable than housekeeping genes in general and this is thought to be due to proximity to the O-antigen gene cluster and maintenance of a large number of O-antigen forms (35). The galF gene is also highly variable (R. Lan, D. M. Ryan, P. BouAntoun, and P. R. Reeves, unpublished data), and the same arguments apply.
It should be noted that the genes common to the O55 and O157 gene clusters, manB, manC, gmd, wzx, wzy, and part of gmm, have substantially divergent sequences and were clearly not involved in the recent recombination event.
General conclusions. The O55 gene cluster is particularly interesting in that its origin by addition to and loss of genes from an earlier gene cluster is quite clear. The changes to give the GDP-colitose pathway presumably occurred under selection for replacement of the original O antigen by the then-novel O55 antigen. For this to occur, there is no necessity for the additional genes to be located near the other O-antigen genes, and this is most unlikely to have been the case when the two groups of genes first occurred within one cell. In addition to wbdJ and wbdK for GDP-colitose synthesis, the gne gene for UDP-GalNAc synthesis is also outside of the main gene cluster, but again very close to it. In O55 we see what appears to be an intermediate stage in bringing all the genes into a single cluster, presumably a result of selection for intraspecies transfer of the O55 antigen gene cluster
Present address: Department of Microbiology, College of Life Science, Nankai University, Tianjin 300071, People's Republic of China. ![]()
|
|
|---|
(1,2)fucosyltransferase gene (FUT2). Homozygosity for an enzyme-inactivating nonsense mutation commonly correlates with the nonsecretor phenotype. J. Biol. Chem. 270:4640-4649.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»