| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
,
Channing Laboratory, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts
Received 27 September 2007/ Accepted 28 October 2007
| ABSTRACT |
|---|
|
|
|---|
| TEXT |
|---|
|
|
|---|
The organisms of the normal intestinal microbiota establish long-term relationships with the host, where they must peacefully coexist and successfully compete with other members of the microbiota for nutrients and space. They must also withstand attacks from a variety of sources, including phages, antibacterial products from the host, and harmful products of other bacteria. Similar to the case for pathogens, the ability of these organisms to alter their surfaces would seem an obvious survival advantage in this ecosystem. B. fragilis was the first bacterium shown to produce an extensive number of surface capsular polysaccharides: eight phenotypically confirmed molecules from a single strain (15). The expression of these polysaccharides is phase variable, dictated by DNA inversion of the promoter upstream of each polysaccharide biosynthesis locus (15). A second intestinal Bacteroides species, B. thetaiotaomicron, contains the genetic material for the synthesis of seven capsular polysaccharides, four of which have invertible promoter regions (24). The inversion of each of the B. fragilis polysaccharide promoters is mediated by a single global DNA invertase, designated Mpi (6), which is a member of the serine family of site-specific recombinases (Ssrs). In contrast, inversion of the B. thetaiotaomicron polysaccharide promoters is likely mediated by individual tyrosine family site-specific recombinases (Tsrs) encoded by separate genes present just upstream of each of the invertible polysaccharide promoters (24). Therefore, although the mechanisms of inversion differ between the two species, the phenotypic outcome is the same: extensive and rapid surface diversity.
The synthesis of such a large number of capsular polysaccharides has not been demonstrated in any bacterial species outside the Bacteroidales order, and no other bacterial capsular polysaccharides are known to undergo phase variation by inversion of promoter regions. This study was undertaken to determine if these important genetic characteristics that allow such drastic surface variability are conserved among diverse Bacteroidales species or if the synthesis of multiple phase-variable polysaccharides is limited to Bacteroidales of a particular niche or possibly to only a few Bacteroidales species.
Currently, there are a significant number of Bacteroidales species for which there is a complete or partial genomic sequence available. We began these analyses by compiling a list of all species taxonomically categorized by NCBI within the Bacteroidetes phylum and for which there is a publicly available genomic sequence. Other Bacteroidetes species, not listed by NCBI but for which there is a genomic sequence publicly available, were subsequently added to the list.
We retrieved the 16S rRNA gene sequence for each member from those available at the Ribosomal Database Project II (RDP) (release 9.49, 3 April 2007) maintained by the Center for Microbial Ecology at Michigan State University (http://rdp.cme.msu.edu/). Many of these sequences contained a significant number of uncalled or ambiguous bases and were thus replaced where possible with cleaner sequence from the same species by using the RDP sequences as queries in nucleotide-nucleotide BLAST searches against either the GenBank database or a custom database prepared locally from the appropriate genomic sequence. The Tree Builder program provided by the RDP site was then used to create a phylogenetic tree using the Eubacteria alignment model and including the sequence of the Escherichia coli K-12 MG1655 16S rRNA gene (GenBank accession number U00096, bases 4164682 through 4166223) as an outgroup. Species which were phylogenetically misplaced, such as Bacteroides capsillosus (14), were removed from the list.
Inspection of the Bacteroidetes 16S rRNA gene cladogram reveals that members of the order Bacteroidales segregate to a distinct branch (Fig. 1). We obtained complete or partial genome sequence for three families of Bacteroidales: the Bacteroidaceae (six species), the Prevotellaceae (two species), and the Porphyromonadaceae (four species). These sequenced members include three oral species (P. gingivalis, T. forsythensis, and P. intermedia), one ruminal species (Prevotella ruminicola), and eight intestinal species (B. fragilis, B. thetaiotaomicron, Bacteroides vulgatus, Bacteroides caccae, Bacteroides ovatus, Bacteroides uniformis, P. distasonis, and Parabacteoides merdae). As shown in Fig. 1, the habitats of these organisms do not strictly segregate by phylogeny, as some members of the Porphyromonadacae reside in the intestine while others are found in the oral cavity.
|
The synthesis of diverse bacterial polysaccharides of all phyla of bacteria is mediated by conserved families of enzymes (for example, those involved in the synthesis of nucleotide-linked sugars, in the transfer of the initial monosaccharide of the polysaccharide repeating unit to a lipid carrier, and in the assembly of the polysaccharide and its transport to the cell surface). In addition, the genes involved in polysaccharide production are usually clustered in the genome, often in one operon or a few adjacent operons. We took advantage of these characteristics to locate putative polysaccharide biosynthesis loci in the genomes of the 12 Bacteroidales species.
We used the protein sequences of the B. fragilis 9343 polysaccharide biosynthesis proteins WcgS (BF1909), WcfY (BF1904), WcfS (BF1377), WcgX (BF1914), and UpaY (BF1367) as queries in protein-protein BLAST searches of our locally prepared databases. WcgS and WcfY are a putative nucleotide sugar dehydratase and a UDP-glucose dehydrogenase (7), respectively, which are involved in the formation of nucleotide-activated sugars from less complex precursors. WcfS and WcgX are representatives of two distinct families of initiating glycosyltransferases (7, 8), which link the first monosaccharide of a repeat unit to undecaprenyl phosphate as the first step in the assembly of the repeat unit. UpaY is a member of the UpxY family of putative polysaccharide regulatory proteins (8). Genes encoding UpxY products are the first gene of each of the eight characterized polysaccharide biosynthesis loci of B. fragilis (15) and of each of the seven polysaccharide biosynthesis loci of B. thetaiotaomicron (24).
For each species under investigation, the appropriate local database was searched using the indicated B. fragilis 9343 protein sequences as queries. Matches to any of the query proteins with a reasonably low E value (
10–4) were collected and sorted by genome position. These similarity searches detected a minimum of 6 and up to 16 distinct genetic regions, depending on the species (Table 1). DNA encompassing
10,000 bp on both sides of the match or cluster of matches was extracted from the genome sequence and translated to amino acid ORFs as before. These ORFs were then used in protein-protein BLAST searches against GenBank and were assigned a putative function based on the annotation of similar sequences in the database. Each region was further analyzed to determine if it exhibited the genetic characteristics of a polysaccharide biosynthesis locus, and the extent of DNA sequence analyzed was adjusted to ensure coverage of the locus. The genetic characteristics we used to define a locus included the presence of at least two putative glycosyltransferases and a putative polysaccharide flippase and polymerase or, alternatively, an ABC-type transporter involved in the export and assembly of polysaccharide. Regions containing these features, coupled with the presence of at least one ortholog of the proteins used to initially detect the region, were given a polysaccharide biosynthesis locus designation (Table 1).
|
The number of polysaccharide biosynthesis loci detected for each species correlates with the specific niche of each of the organisms in the mammalian alimentary tract. For example, we detected only two polysaccharide loci in the genomes of each of the three oral species. This is in contrast to the genomes of the intestinal species, which contain at least 5 (B. caccae) and up to 13 (P. distasonis) distinct polysaccharide biosynthesis loci. In contrast to those of the oral species, many of the genome sequences of the intestinal species are incomplete, so the actual number of polysaccharide loci in these species may be even greater. For the one ruminal species, P. ruminicola, we detected four polysaccharide biosynthesis loci, a number between those detected for the oral and the intestinal species. The rumen is an intermediate alimentary tract location, distal to the oral cavity but proximal to the intestine. These data reveal that the number of polysaccharide biosynthesis loci contained in the Bacteroidales species analyzed correlates well with their habitat: without exception, those species residing in the intestine contain a greater number of polysaccharide biosynthesis loci than those in the more proximal alimentary sites. P. distasonis and P. merdae are phylogenetically closer to the oral organisms T. forsythensis and P. gingivalis than to the intestinal Bacteroides spp., yet, like the Bacteroides spp., they also have an extensive number of polysaccharide biosynthesis loci. Therefore, the synthesis of multiple polysaccharides is a general characteristic of the intestinal Bacteroidales.
This niche-specific characteristic does not hold for another glycan-containing class of molecules. The closely related organisms T. forsythensis and P. distasonis each synthesize a family of surface layer glycoproteins (12, 16). The genome of T. forsythensis contains two genes whose products belong to the same family as at least nine glycoproteins of P. distasonis. We analyzed the genomes of each of the other 10 Bacteroidales species and found that only P. merdae contains genes for the synthesis of similar S-layer glycoproteins (data not shown). P. merdae is most closely related to P. distasonis and T. forsythensis (Fig. 1); thus, the presence of genes encoding this family of glycoproteins correlates with phylogenetic relatedness rather than niche. Of note, however, is the fact that the oral organism T. forsythensis contains only 2 glycoprotein orthologs, whereas P. distasonis and P. merdae contain at least 9 and 10 orthologs, respectively. Therefore, within these three related species, the two intestinal members contain more orthologs than the oral member, paralleling the situation found with the polysaccharide biosynthesis loci.
The large number of polysaccharide biosynthesis loci present in the genomes of the intestinal species suggests extensive transfer of DNA between these species, with a selection for strains encoding multiple polysaccharides. Bacteroides spp. have a large arsenal of mobile genetic elements involved in DNA transfer (20). Oral Bacteroidales species also have genetic material involved in DNA transfer (11); however, because their association with the host is pathogenic and often intermittent rather than mutualistic and lifelong like that with the intestinal species, their environment may not be optimal for DNA transfer. Alternatively, the synthesis of multiple surface polysaccharides may not confer a selective advantage to organisms in the oral cavity.
We next set out to determine if the ability of Bacteroidales species to phase vary synthesis of their polysaccharides by DNA inversion of promoters is also a niche-specific characteristic. We previously showed that seven of the eight characterized polysaccharide biosynthesis loci of B. fragilis have promoters that undergo DNA inversion (15). These promoters are contained in 168- to 193 bp-regions that are flanked by inverted repeats (IRs) of 19 to 25 bp. To find IR regions in the 74 different Bacteroidales polysaccharide loci listed in Table 1, the EMBOSS program (18) Einverted was used to retrieve all IRs in the selected regions that reached a program threshold of 35 and were separated by no more than 500 bp. IRs detected by this analysis were further considered only if they were contained within an intergenic region, were at least 15 bp long, and contained sufficient DNA between them to contain a promoter. IR-flanked regions meeting these criteria were analyzed for the presence of a B. fragilis-like consensus promoter sequence (3) using MEME (2), ClustalW (5), and custom Perl scripts. For the 74 different polysaccharide biosynthesis loci analyzed, we found 36 that contained IR regions that met our criteria (Table 2). Thirty-five of these IR regions were present just upstream of the polysaccharide biosynthesis loci of intestinal Bacteroidales, and each intestinal species contained at least two polysaccharide loci with such an IR region. The only oral species with an IR region meeting each of the criteria was P. intermedia. The position of this IR region differs from those of the intestinal species, as it is in the middle rather than at the start of the polysaccharide locus and is present just upstream of a gene encoding a putative transposase. For two of the P. distasonis polysaccharide regions, PS9 and PS12, two sets of adjacent but nonoverlapping IR regions were detected. The IR pairs that were in closer proximity to the first gene of the polysaccharide regions had a better match with the consensus promoter sequence and were further analyzed. Although there was some variability in the amount of DNA between each of the 36 IR pairs and between the downstream IR and the first gene of the locus, these distances were never more than 321 bp and 307 bp, respectively (Table 2).
|
|
The genetic architecture of the group 4 IRs, which include the seven B. fragilis polysaccharide loci IRs and four of the six B. uniformis IRs, differs from that of the other four groups in that these IRs are not present downstream of Tsr genes. Inversion of the seven B. fragilis capsule polysaccharide promoters is mediated by a single global DNA invertase, Mpi, which is an Ssr (6). The similarity of the four B. uniformis IRs to those of B. fragilis and the lack of an upstream Tsr gene suggest that inversion of these IR regions is also mediated by a global Ssr. We found that of the 12 Bacteroidales species analyzed, B. uniformis encodes a product that is the most similar to Mpi, demonstrating 90% similarity along the lengths of the products. The next most similar product to Mpi is Gfi, encoded by P. distasonis. Gfi is another Ssr that acts as a global DNA invertase, mediating promoter inversions of glycoprotein genes (12).
Inversion of the seven B. fragilis and four B. thetaiotaomicron polysaccharide promoters has been previously described (15, 24). To verify our predictions that the other 24 promoters between each of the IR pairs upstream of the intestinal Bacteroidales polysaccharide biosynthesis loci indeed invert, we performed PCR analysis as shown in Fig. 2. Two PCRs were performed for each region, one using a forward primer upstream of the IR region with a central primer between the IRs and a second using the same central primer with a downstream reverse primer outside of the IR region (for primer sequences, see Table S1 in the supplemental material). PCR products would result from both PCRs only if the DNA between the IRs inverted. As shown in Fig. 2, PCR products were generated for both PCRs for each locus. The sizes of the corresponding products correlate exactly with the sizes predicted if the regions inverted exactly between the IRs (Table S1 in the supplemental material). Therefore, each of the polysaccharide promoters that is predicted to undergo inversion does indeed invert.
|
This study has revealed that the ability of intestinal Bacteroidales species to synthesize multiple polysaccharides and to phase vary their expression by promoter inversion is a remarkably conserved characteristic. This conservation likely highlights the importance of this process to the survival of these species in the competitive human intestinal ecosystem. The mechanism of inversion, whether mediated by local Tsrs or by a global Ssr, seems less important than the result: extensive and rapid surface variability. This feature ensures that, for a single strain, there will always be a heterogeneous population of organisms, some of which may be more resistant to an attack, whether from a phage, from other bacterial members, or from the host.
| ACKNOWLEDGMENTS |
|---|
This work was supported by grant AI44193 from the National Institutes of Health (NIAID).
| FOOTNOTES |
|---|
Published ahead of print on 9 November 2007. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |