Previous Article | Next Article ![]()
Journal of Bacteriology, February 2007, p. 1330-1341, Vol. 189, No. 4
0021-9193/07/$08.00+0 doi:10.1128/JB.01058-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
Laboratoire de Génétique Microbienne, Institut National de Recherche Agronomique, 78352 Jouy-en-Josas Cedex, France,1 INSERM E44, Institut Cochin, Centre National de Référence des Streptocoques, Service de Bactériologie Hôpital Cochin, Faculté de Médecine Paris 5, 27 rue du faubourg Saint Jacques, 75679 Paris Cedex 14, France2
Received 19 July 2006/ Accepted 19 October 2006
|
|
|---|
|
|
|---|
The population structure of several oral streptococci has been studied previously (7, 19, 26, 38). Genetic typing of commensal or opportunistic pathogens has revealed considerable diversity, sometimes even within a single host. Data obtained in a Streptococcus mitis population analysis support the concept that the strains are transient, and this species appears to be maintained by clonal replacement of evolving strains rather than by stable strains (19, 26). Almost all isolates of Streptococcus mutans obtained either from 30 individuals or from a small group of families displayed distinctive restriction fragment length polymorphism or restriction endonuclease analysis patterns (7, 38). Fitzsimmons et al. suggested that the high degree of diversity observed in several mucosal bacteria may be a mechanism for avoiding immune elimination (19). S. mitis and Streptococcus oralis strains isolated from the blood of neutropenic cancer patients were also highly diverse, as they all had distinct fingerprint patterns (57). The population structure and genetic diversity of S. salivarius and S. vestibularis have been not extensively investigated, in spite of the fact that S. salivarius is the predominant oral species, especially during mouth colonization, and can be associated with caries (2, 35, 43).
Multilocus sequence typing (MLST), a method based on the nucleotide sequences of
500-bp internal fragments of multiple (usually about seven) housekeeping genes, has been widely used to study global epidemiology and bacterial population structure (15). MLST can reveal highly clonal populations, as well as freely recombining populations, and the latter are exemplified by Streptococcus uberis or Streptococcus pneumoniae populations (8, 16). Several reports have indicated that most streptococcal species have a highly recombinational population structure (33, 37, 58). The genetic relationships of group B, C, G, and A streptococcal isolates from asymptomatic carriers and from human infections were investigated by phylogenetic analysis of MLST data (3, 32, 33). Here we used the MLST method to investigate the genetic relationship between strains of S. salivarius and strains of S. vestibularis. Below we describe an analysis of data obtained from 27 S. salivarius strains and 9 S. vestibularis strains recovered from patients with septicemia or from the oral cavities of healthy individuals. Our results document the population structure of S. salivarius, the relationship between the two species, and the extent of gene exchange in the evolution of these oral streptococci.
(This work was presented in part at the 7th ASM Conference on Streptococcal Genetics, Saint Malo, France, June 2006.)
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Strains and MLST data
|
Multilocus sequence typing. We searched for candidate MLST loci among housekeeping genes which were used previously for MLST of other gram-positive bacteria and which could be amplified with degenerate primers (ddlA, tkt, glcK, pyrE, and sodA) or whose sequences were available in a database for closely related species (ilvC, pepO, thrS, and dnaE). Furthermore, we checked to make sure that the genes were usually found in distant locations in previously described streptococcal genomes, especially the genome of the closely related species Streptococcus thermophilus. The nucleotide sequences of internal fragments of the following nine genes that were selected were determined: ilvC (encoding ketol-acid reductoisomerase), ddlA (encoding D-alanine D-alanine ligase), glcK (encoding glucose kinase), pepO (encoding endopeptidase), thrS (encoding threonyl-tRNA synthetase), tkt (encoding transketolase), pyrE (encoding orotate phosphoribosyltransferase), dnaE (encoding DNA polymerase III), and sodA (superoxide dismutase). For the sodA gene, we used primers and an amplification procedure described previously (46). Primers used in this study are listed in Table 2. One bacterial colony, freshly grown on an agar plate, was suspended in 50 µl of TES (10 mM Tris-HCl, 1 mM EDTA, 25% sucrose), and DNA was extracted by lysis in a thermocycler at 95°C for 10 min and at 4°C for 15 min. The DNA obtained was used immediately for PCR amplification performed with the GeneAmp 9700 PCR system (Perkin-Elmer) by using the following cycling parameters: 5 min at 94°C, followed by 30 cycles of 30 s at 94°C, 30 s at 50°C, and 1 min at 72°C. Amplified products (459 to 527 bp of internal gene fragments) were examined on a 1.5% agarose gel. They were used for direct sequencing by a fluorescent sequencing procedure with the same PCR primers that were used for the initial PCR amplification. The sequence of each fragment on both strands was determined with an Applied Biosystems 370A DNA sequencer by using a Bigdye Terminator sequencing kit according the manufacturer's protocol (Perkin-Elmer). To ensure the accuracy of the sequence, amplicons obtained from at least two bacterial colonies were used for sequence determination.
|
View this table: [in a new window] |
TABLE 2. Oligonucleotide primers for S. salivarius and S. vestibularis
|
For pyrE sequence alignment, the nucleotide sequence of the mobile element found in allele 11 (S. salivarius) was not taken into account.
Data treatment and statistical analysis. For each locus, all the sequences were compared, and arbitrary allele numbers were assigned to the different sequences. The combination of alleles at each locus defined an allelic profile or sequence type (ST) for a strain. Strains with the same allelic profile were assigned to the same ST. The STs were identified by arbitrary numbers.
The number of polymorphic nucleotide sites and the maximal and average levels of nucleotide divergence of alleles (expressed as percentages) at a given locus were calculated using the MEGA software (version 3; http://www.megasoftware.net) (39). Phylogenetic analyses of the nucleotide sequences of each housekeeping gene separately and the concatenated sequence containing ddlA, thrS, pyrE, dnaE, and sodA were performed using the neighbor-joining (NJ) method and the same software. Because of their extensive nucleotide diversity, the glcK, ilvC, and pepO sequences could not be used in the MLST scheme, so we removed them from the concatenated sequence. A Kimura two-parameter distance model was used to estimate distances for nucleotide sequences. To determine the significance of the groups observed in trees constructed by the NJ method, a bootstrap analysis with 1,000 replicates was performed.
The split decomposition method was used to assess the degree of tree-like structure for alleles found for each locus (30). The sequence alignments were converted to NEXUS files, and the split decomposition analysis was performed with the SPLITSTREE 3.1 program (http://bibiserv.techfak.uni-bielefeld.de/splits/).
A statistical analysis based on the dN/dS ratio (41), which was the ratio of the number of substitutions that changed the amino acid sequence (dN is number of nonsynonymous substitutions per nonsynonymous site) to the number of silent substitutions (dS is the number of synonymous substitutions per synonymous site), was performed by using methods described with the START program (K. Jolley; http://mlst.zoo.ox.ac.uk/links/START). The standardized index of association (ISA) was calculated as described by Haubold and Hudson (24). In our study, the set of S. salivarius isolates analyzed for linkage equilibrium did not contain CIP55.126 and JIM8774, which are considered to be very closely related to CIP53.128 and JIM8776, respectively.
Nucleotide sequence accession numbers. The sequences of all alleles have been deposited in the GenBank database under accession numbers DQ460524 to DQ460540 (ilvC fragment), DQ460609 to DQ460638 (ddlA fragment), DQ460507 to DQ460523 (glcK fragment), DQ460541 to DQ460562 (pepO fragment), DQ460588 to DQ460608 (thrS fragment), DQ460478 to DQ460482 (tkt fragment), DQ460563 to DQ460587 (pyrE fragment), DQ460483 to DQ460506 (dnaE fragment), and EF054776 to EF054795 (sodA fragment). The sequences of large regions have been deposited in the GenBank database under accession numbers DQ46995 (dexS and pepO of S. salivarius LMG13109), DQ464996 (dexS and pepO of S. vestibularis LMG14645), DQ464998 (ilvB, ilvN, ilvC, and tyrS of S. salivarius LMG13109), DQ464994 (ilvB, ilvN, ilvC, and tyrS of S. vestibularis LMG14645), DQ464997 (tkt and tRNAThr of S. salivarius LMG13109), and DQ464993 (orf1, tkt, and trkA2 of S. vestibularis LMG14645).
|
|
|---|
![]() View larger version (47K): [in a new window] |
FIG. 1. Polymorphic nucleotide sites of ddlA alleles of S. salivarius and S. vestibularis: variable sites in each allele of the ddlA locus from the 27 S. salivarius strains and 9 S. vestibularis strains. The site numbers at the top are in vertical format, and S. vestibularis alleles are shaded.
|
|
View this table: [in a new window] |
TABLE 3. Genetic diversity at S. salivarius and S. vestibularis loci
|
![]() View larger version (15K): [in a new window] |
FIG. 2. Phylogenetic tree based on concatenated sequences of five housekeeping genes (ddlA, thrS, pyrE, dnaE, and sodA). The tree was constructed using the neighbor-joining method. Bootstrap values that are 80% are indicated at nodes. Scale bar = 0.005 nucleotide substitution per site. Disease-related isolates are indicated by an asterisk.
|
Evidence of intraspecific recombination in S. salivarius obtained by statistical analysis. Two types of statistical analysis were used to characterize intragenic recombination at S. salivarius loci analyzed by MLST.
(i) Standardized index of association. The index of association has been widely used to analyze the degree of linkage disequilibrium between alleles in MLST and multilocus enzyme electrophoresis data (49). Haubold and Hudson described the standardized index of association, which does not depend on the number of loci analyzed (24). An ISA value of zero is expected for alleles in linkage equilibrium, indicating that alleles are distributed independent of each other because of free recombination (24). The ISA value calculated for our S. salivarius data was close to zero (0.057), suggesting that there was free recombination and thus a high level of gene exchange.
(ii) Split decomposition analysis. Evidence of recombination was obtained by split decomposition analysis using the SPLITSTREE 3.1 program. The algorithm used in this software is able to display conflicting results in the phylogenetic descent of sequences. A tree-like structure is created when the descent is clonal, but an interconnecting network-like structure appears when recombination plays a role in the evolutionary history of the population analyzed (30, 52). The results obtained with unique S. salivarius alleles are shown in Fig. 3 and in Fig. S2 in the supplemental material. All but one split graph have network-like structures consistent with a recombinational population structure. The sole exception is the ddlA split graph, which is closer to a bush-like structure.
![]() View larger version (13K): [in a new window] |
FIG. 3. Split decomposition analysis of alleles present in 27 S. salivarius strains: split graphs for thrS, pyrE, and ddlA alleles. In some graphs several alleles are connected to each other by multiple pathways, forming an interconnected network and suggesting that there were recombination events. The numbers are allele numbers. Split graphs for ilvC, pepO, glcK, dnaE, and sodA alleles are shown in Fig. S2 in the supplemental material.
|
![]() View larger version (22K): [in a new window] |
FIG. 4. Phylogenetic relationships among 27 S. salivarius strains and 9 S. vestibularis strains. Trees are shown for glcK, ilvC, and pepO loci. Ssal, S. salivarius; Sves, S. vestibularis; p, pathogen; c, commensal; Spneumo, S. pneumoniae; Spsang, S. parasanguinis. The strain numbers correspond to ST numbers shown in Table 1. The trees were constructed by using the neighbor-joining method. Clusters and bootstrap values that are 80% are indicated. The scale bars indicate the number of nucleotide substitutions per site.
|
|
View this table: [in a new window] |
TABLE 4. Genetic diversity of clusters at the glcK, ilvC, and pepO loci
|
![]() View larger version (27K): [in a new window] |
FIG. 5. HGT events detected in S. salivarius pepO, ilvC, pyrE, and tkt genes. Regions where there are high and low levels of diversity are indicated by brackets, and the level of diversity is shown for each region. NS, no significant homology. (a) In the pepO region, 2,095-bp nucleotide sequences of LMG13109 cluster II and LMG14645 cluster I were compared. The regions were compared with the S. parasanguinis pepO region. (b) In the ilvC region, 2,765-bp nucleotide sequences of LMG13109 cluster II and LMG14645 cluster I were compared. The regions were compared with the S. pneumoniae ilvC region. (c) In allele 11 of the pyrE locus, a 234-bp DNA insertion containing 30 bp of S. salivarius IS1139 (gray box) and 204 bp of DNA whose function and origin are unknown is present and is flanked by a 157-bp duplication (arrows). (d) In the tkt region, 2,506-bp nucleotide sequences of S. salivarius LMG13109 and S. vestibularis LMG14645 were compared. orf1 exhibits the highest level of homology to str0310 from S. thermophilus.
|
Strikingly, the tkt loci were significantly different in S. salivarius and S. vestibularis as the S. salivarius gene could not be amplified with several oligonucleotide pairs. Successful amplification of an internal fragment was achieved only with oligonucleotides tkt-up6 and tkt-dn6, and the fragment was sequenced. The level of divergence of the S. salivarius and S. vestibularis tkt gene sequences was 35.2% (Fig. 5d). Phylogenetic analysis showed that two types of tkt genes are present in streptococci (Fig. 6). The tkt gene of S. salivarius belongs to the first group (highest level of identity with Streptococcus pyogenes and Streptococcus agalactiae genes, 78%), whereas the tkt gene of S. vestibularis belongs to the second group (highest level of identity with S. thermophilus gene, 98%). Further amplification by inverse PCR and sequencing of the flanking regions showed that the genetic organization in the tkt region was different in the two species (Fig. 5d). The S. vestibularis tkt gene is flanked by an ABC transporter gene (highest level of homology to str0310 from S. thermophilus) and trkA2 (encoding potassium uptake protein); a similar organization occurs in S. thermophilus. By contrast, the S. salivarius tkt gene is followed by a tRNAThr gene and is situated in a region flanked by tpi (encoding triose phosphate isomerase) and tmk (thymidylate kinase), which are adjacent in the S. thermophilus genome (data not shown). PCR experiments showed that the genetic organization of the tkt region is typical for each species (S. salivarius and S. vestibularis) (data not shown). The trkA2 and tpi loci are located 170 kb apart in the S. thermophilus genome, indicating that a major rearrangement occurred during differentiation of the S. vestibularis and S. salivarius loci.
![]() View larger version (24K): [in a new window] |
FIG. 6. Additive distance tree for tkt genes from streptococci and lactococci. Alignment was performed with ClustalW, and the tree was constructed by the flexible method. The levels of identity between the two clusters range from 54% to 58%, whereas the levels of identity are more than 73% for the genes within each cluster.
|
|
|
|---|
MLST of S. salivarius. As a first step in developing an MLST method, the internal fragments of eight loci (dnaE, thrS, pyrE, ilvC, pepO, ddlA, sodA, and glcK) were amplified from all strains tested. One locus, tkt, could not be amplified from S. salivarius strains. The eight loci were polymorphic, and the maximal levels of divergence of three genes, ilvC, pepO, and glcK, were more than 12%. These high levels of divergence, which could have resulted from interspecific HGT events (see below), prevented use of these three loci in an MLST scheme for S. salivarius.
An analysis of alleles and STs showed that the diversity in S. salivarius is very high. The variability is comparable to the maximal sequence divergence (5 to 32%) and the mean nucleotide diversity (16.6%) reported for group C and G streptococci and for S. oralis, respectively (33, 55). Like S. salivarius, these streptococci are considered to be commensals, even though they are sometimes associated with human disease. The level of S. salivarius diversity is significantly higher than the levels of diversity of group A streptococci (maximal divergence, 1.4 to 6.1% [17]) and group B streptococci (1.2 to 2.5% variable nucleotide sites [32]). We found that S. salivarius isolates from the same individual exhibit a high degree of genetic diversity, as observed previously for several other mucosal streptococci (S. mitis, S. mutans, and S. oralis) (19, 38, 57). The phylogenetic tree of concatenated sequences of S. salivarius did not allow us to cluster strains isolated from human blood specifically. We concluded that commensal and potentially pathogenic strains do not belong to distinct S. salivarius populations. The lack of clear separation of commensal and clinical S. salivarius isolates resembles the findings reported for several other commensal streptococci, such as group C and G streptococci (S. mitis and S. oralis), which also exhibit opportunistic pathogenicity (33, 55).
Genetic relationship of S. salivarius and S. vestibularis isolates. S. salivarius and S. vestibularis are included in the salivarius group of viridans streptococci (18). In 1988, S. vestibularis was described as a new oral species (56), while further analysis showed that it is very closely related to S. salivarius (34, 46). In this work, we found that the levels of relatedness between S. vestibularis and S. salivarius are very different at different genetic loci.
Two findings support the notion that S. salivarius and S. vestibularis should be considered separate species: (i) the strongly supported clusters of S. vestibularis alleles at the glcK, pyrE, dnaE, sodA, and ilvC loci and (ii) the extent of genomic divergence in particular loci, such as the divergence observed for tkt. Incidentally, the great differences in the sequence and genetic organization of the tkt locus could probably be used to develop a rapid test for discriminating these two bacteria.
By contrast, the lack of resolution of S. vestibularis and S. salivarius at three loci (ddlA, pepO, and thrS) suggests that there is frequent gene exchange between these two species. Considering the mosaic structure of these three S. vestibularis genes and the low levels of divergence found at five other loci (glcK, ilvC, pyrE, sodA, and tkt), we suggest that S. vestibularis may be a recently emerging population which evolves by interspecific gene exchange. However, analysis of a larger sample of S. vestibularis strains is necessary to test this hypothesis more stringently and to better define the evolutionary scheme of S. vestibularis.
Intraspecific recombination in S. salivarius and S. vestibularis populations. Analysis of different loci of S. salivarius and S. vestibularis indicated that the loci have different evolutionary histories. Extensive intraspecific recombinational exchanges should have occurred to explain the features observed.
The noncongruence between allele phylogenetic trees and the low index of association of MLST alleles can be explained by frequent recombination events. The extent of the conflicting phylogenetic results is illustrated by the fact that a strain which has a certain position in one allele tree can occupy a very different position in another allele tree, with both positions strongly supported by bootstrap analysis. The conflicting phylogenetic signals found for seven S. salivarius loci in split decomposition analysis, coupled with a low association index value (ISA, 0.057), indicate that intraspecific recombination occurs frequently in S. salivarius and plays a major role in generating sequence diversity between strains. A specific gene transfer event is revealed by the presence of pyrE allele 11 in two strains isolated from human blood 5 years apart (CIP104994 isolated in 1996 and CCHSS4 isolated in 2001). This allele contains a specific DNA insertion consisting of 30 bp of S. salivarius insertion sequence IS1139 (40), followed by 204 bp of DNA whose origin is unknown (Fig. 5c). The events required to generate this allele were unlikely to occur independently twice, suggesting that the allele was exchanged horizontally. In S. vestibularis, split decomposition analysis suggested that intragenic recombination occurred at four of the nine loci (not shown). Taken together, these results support the conclusion that intraspecific recombination occurs extensively in the S. salivarius population and probably in S. vestibularis.
HGT between oral streptococci. The presence of longer branches in ilvC, glcK, and pepO S. salivarius split graphs (see Fig. S2 in the supplemental material) are consistent with the importation of divergent genes from other species (27). This hypothesis is supported by the higher levels of homology of cluster II alleles of ilvC and pepO loci with S. pneumoniae and S. parasanguinis alleles, respectively, than with the other S. salivarius allele clusters. S. pneumoniae and S. parasanguinis are oral streptococci which might come in contact with S. salivarius and S. vestibularis in the buccal cavity. Evaluation of the extent of HGT in ilvC and pepO genes by sequence analysis indicated that exchange of 725- and 1,783-bp internal gene fragments, respectively, took place (Fig. 5a and b). Differences in chromosomal localization and high levels of nucleotide divergence between tkt genes from S. salivarius and S. vestibularis are also consistent with an HGT event. Detailed cluster analysis of these sequences and the presence of multiple variants in clusters suggest that the transfers are ancient. For example, the transfer of the ilvC divergent allele, possibly from the mitis group, likely preceded the emergence of the food species S. thermophilus in the salivarius group (4, 28). Notably, alleles of each cluster are still maintained in the population.
The extent of recombination and HGT in S. salivarius and S. vestibularis populations suggests that these bacteria have efficient mechanisms for gene acquisition and recombination, although they are not known to be naturally competent (6). Natural transformation has been described for S. pneumoniae and several viridans streptococci, such as S. mutans and Streptococcus sanguinis (1, 20, 44), and plays an important role in generating the high levels of genotypic diversity in these species. Recombinational events are responsible for the dissemination of genes encoding virulence factors, such as antibiotic resistance markers, quinolone resistance, immunoglobulin A1 protease, and competence-stimulating peptide (5, 21, 25, 45, 54). Altered penicillin binding protein genes are directly involved in penicillin resistance, and DNA sequences closely related to them appear to have been distributed horizontally between S. pneumoniae and viridans streptococci (12, 13, 22). Gene flow from human pathogens to commensal streptococci, described previously for groups A, C, and G (33, 50) and for the oral mitis group (55) and also observed here, may be involved in the evolutionary model of the commensal streptococci, which can be associated with bacterial infections. Recently, phylogenetic analyses of more than 20 streptococcal species revealed several examples of interspecific homologous recombination in housekeeping genes of viridans streptococci (29). Evidence for HGT and recombination events in this study associated S. salivarius and S. vestibularis with this group of oral streptococci, which have great potential to be receptors of foreign genes.
The absence of clustering of S. salivarius strains isolated from blood suggests that these strains are opportunistic pathogens. However, the high potential of S. salivarius and S. vestibularis for acquisition of new alleles at loci involved in general functioning of the cell and the possible acquisition of antibiotic resistance determinants (23, 51) raise the possibility that strains isolated from human infections acquired DNA that promoted their pathogenic behavior. Further studies are required to examine this possibility.
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
Published ahead of print on 3 November 2006. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»