Previous Article | Next Article ![]()
Journal of Bacteriology, October 2008, p. 6302-6317, Vol. 190, No. 19
0021-9193/08/$08.00+0 doi:10.1128/JB.01984-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Priscila D. Alves,1,2
Marie-Françoise Cochet,1,2
Michel Gautier,1,2
Michael Otto,5
J. Ross Fitzgerald,3 and
Yves Le Loir1,2*
INRA, UMR1253 STLO, F-35000 Rennes, France,1 Agrocampus Rennes, UMR1253, F-35000 Rennes, France,2 Centre for Infectious Diseases, The Chancellor's Building, New Royal Infirmary, University of Edinburgh, Edinburgh EH16 4SB, Scotland, United Kingdom,3 Research Technologies Branch, Rocky Mountain Laboratories, National Institute of Allergy and Infectious Diseases, The National Institutes of Health, Hamilton, Montana 59840,4 Laboratory of Human Bacterial Pathogenesis, Rocky Mountain Laboratories, National Institute of Allergy and Infectious Diseases, The National Institute of Health, Hamilton, Montana 598405
Received 21 December 2007/ Accepted 9 June 2008
|
|
|---|
|
|
|---|
Population genetic analyses have revealed extensive genetic variation in natural populations of S. aureus (27, 28, 46). Importantly, several previous studies have found that S. aureus clones are largely host specific and are rarely associated with cross-species transfer (22, 39, 47). In particular, the presence of ovine- and bovine-specific lineages of S. aureus was reported (30), and these data suggest that the majority of cases of bovine and ovine mastitis are caused by a small number of host-specialized clones (11).
The genome sequencing of numerous S. aureus strains of human origin has revealed a great deal of variation in gene content, particularly in the complement of mobile genetic elements (MGEs), which could reflect differences in virulence or tissue or disease tropism (32, 33). Several studies have shown that certain virulence genes are overrepresented in some clonal lineages and that some combinations correlate with pathogenic potential (35, 46).
Very recently, Herron-Olson et al. (18) sequenced the genome of a bovine strain (RF122) isolated from mastitis and identified many unique genes that were not found among the human sequenced strains. It is possible that these novel genes could contribute to the host specificity observed among bovine clones. Previously, we constructed a whole-genome microarray specific for the human methicillin-resistant S. aureus (MRSA) strain COL and carried out comparative genomic hybridizations (CGH) to explore the gene content of a small number of bovine and ovine S. aureus strains (12). We found that some bovine strains were genetically allied with the common ovine S. aureus lineage, suggesting that a similar gene complement may be required for bovine and ovine mastitis. However, the CGH analysis was limited by the microarray, which represented the genome of a single human strain and did not allow the identification of genes specific for bovine or ovine strains. Further, CGH does not allow the identification of the genomic location of components of the accessory genome. In contrast, the previously described whole-genome PCR scanning (WGPS) has been successfully used to investigate genome structure diversity in closely related strains of enterohemorrhagic Escherichia coli (34). WGPS is based on the long-range PCR (LR-PCR) amplification of bacterial chromosomes by using a set of primer pairs designed on a reference genome and gives an overall view of genome structure. We developed bioinformatic tools dedicated to WGPS to extend this approach to gram-positive species such as S. aureus (4, 5).
In order to investigate the genetic basis of the host adaptation of ruminant strains of S. aureus, we analyzed their genome structure and content using WGPS and CGH. The combination of these two complementary techniques allowed the identification and genome localization of genetic loci that were divergent in the ruminant strains examined. Unexpectedly, the majority of the regions of difference (RDs) specific for ruminant strains was located in the core genome and was not associated with MGEs. A majority of these determinants encode proteins predicted to be exported, suggesting a role in S. aureus-ruminant host interactions.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Staphylococcus aureus strains used in this study
|
DNA-DNA microarray hybridization. DNA-DNA microarray hybridization was used to assess variation in gene content among the 12 S. aureus strains. Hybridizations were performed as described previously (12, 48). Briefly, genomic DNA extracted from strains 1178, 1183, RF122, 1166, 1167, Newbould N305, 1170, 1171, 1173, and 1174 (hereafter designated test strains) were hybridized on the microarray with strain N315 (hereafter designated the control strain) genomic DNA as the reference. Three independent hybridizations were performed for each test strain with the probe set (gene) detection, as determined by using the one-sided Wilcoxon's signed rank test and the one-step Tukey's biweight estimate for signal detection (Affymetrix, Inc., Santa Clara, CA). Coding sequences (CDSs) that gave an equivalent hybridization signal for both the control and test strains were designated common and considered present in the two genomes (test and control strains). Common CDSs are indicated in Fig. 2 and in Table S1 in the supplemental material. CDSs that gave a low signal compared to that of the control N315 were designated poorly detected (see Table 4). They likely had enough sequence divergence that strong hybridization did not occur in one or two of the three independent hybridizations (allelic variation was suspected). They are indicated in Fig. 2 and in Table S1 in the supplemental material. Genes that gave no signal at all in each of the hybridizations compared to that of the control N315 were designated not detected and either were absent from the genome (gene loss was suspected) or had enough sequence divergence that hybridization did not occur under the conditions of stringency used (allelic variation was suspected). They are indicated in Fig. 2 and in Table S1 in the supplemental material. Genes that gave a signal in the test strain but not in the control N315 were designated additional (gene gain was suspected).
![]() View larger version (67K): [in a new window] |
FIG. 2. Summary and data comparison of the WGPS and the microarray analyses of the set of 12 S. aureus strains. Results of the WGPS of the 12 strains are shown in the upper lines. WGPS scanning analysis is presented as follows. Segments that yielded PCR products of identical, smaller, and bigger sizes compared to those in N315 are depicted by blue, dark blue, and light blue rectangles, respectively. Fragments that were not amplified are depicted by white rectangles. Fragments that were demonstrated as being absent by framing PCR experiments are depicted in gray. The positions of the MGEs, as determined in the N315 annotation, are given in the row above WGPS results. Results from the gene content analysis using the whole-genome microarray are shown in the lower lines. CGH analyses were carried out using N315 as the reference (Mu50, whose gene content is known, was not analyzed here). Genes that gave a signal in both N315 and the test strain were considered present and are indicated in blue. Genes that gave a low signal or no signal at all compared to that of N315 were judged as variable and are indicated in orange and red, respectively. The lower part of the figure summarizes the success rates of the WGPS. For each segment, the number of strains (1 to 12) that yielded an LR-PCR product is given.
|
|
View this table: [in a new window] |
TABLE 4. Specific loci identified by both CGH and WGPS
|
Primer design for WGPS.
WGPS first was employed in a study of Escherichia coli O157 isolates (34). Here, we intended to use it to gather information on the chromosomal structural diversity of highly divergent S. aureus strains. Accordingly, the design of primers was critical. Using GenoFrag, a software program dedicated to the design of primers optimized for WGPS that we previously developed (5), a set of 295 pairs of 25-mer primers was designed based on the N315 sequence. Primer specificity then was assessed by comparing primers to each of six publicly available S. aureus sequences (Mu50, MW2, COL, NCTC8325, MSSA476, and MRSA252) (2, 19, 25) using local software based on BLASTn (1). Primers that presented absolutely no annealing with any of the six genome sequences tested were kept, since they may be included in N315-specific regions. Using this set of primer pairs, 295 segments of
10 kb covering the whole N315 chromosome, with overlaps of
1 kb at every segment end, were predicted. Primer sequences are available upon request from the authors.
PCR analysis for WGPS. PCRs were performed using genomic DNA as the template and LR-PCR kits (GeneAmpXL, Applera, France) using PCR conditions previously described (4, 5), and PCR products were routinely analyzed by 0.5% agarose gel electrophoresis. A fragment was considered significantly different when its size differed by more than 1-kb from the expected size in N315. This threshold was determined to take account of the resolution level of size estimation in routine agarose gel electrophoresis. When a fragment was not amplified after two attempts, additional PCR studies were performed using various combinations of primers, considering three types of genomic variation: (i) the lack of the target sequence due to deletion, (ii) nucleotide divergence at the hybridization site, and (iii) the expansion of the distance between the primers by a large insertion. By such additional experiments (named overlapping PCR and framing PCR), most variations of the first two types could be discriminated from the latter type. A total of 3,540 LR-PCRs were performed on the 12 tested strains. Results were graphically represented using GeneWiz software (21).
PCR of selected host-specific determinants. PCR tests were carried out on the 30 strains studied (Table 1), using genomic DNA as the template and primers specific for the RDs that were identified by WGPS and CGH analyses (Table 2). Amplification reactions were carried out in a Bio-Rad iCycler (Bio-Rad, Marnes la Coquette, France) using the following program: 4 min at 95°C, followed by 30 cycles of 30 s at 95°C, 30 s at 55°C, and 30 s at 72°C. The program finished with an additional 7-min extension step at 72°C. PCR products were analyzed by agarose gel electrophoresis.
|
View this table: [in a new window] |
TABLE 2. Primers used to detected ovine-caprine-specific regions of difference
|
|
|
|---|
![]() View larger version (36K): [in a new window] |
FIG. 1. MLST and PFGE genotyping of the 12 S. aureus strains. (A) Radial tree depicting the phylogenetic relationships of the STs of the 12 strains used in this study and all of the S. aureus STs available in the MLST database (1,095 as of March 2008). MEGA 4.0 software and the neighbor-joining method with a bootstrap value of 1,000 were used to build the tree. (B) PFGE macrorestriction analysis of the chromosome of the 12 strains (upper). Dendrogram presenting the percentage of genetic similarity between the 12 strains (lower). The unweighted-pair group method using average linkages and a Dice coefficient (with a tolerance limit of 1.1%) were used to build the dendrogram. M, molecular size marker.
|
Further, the additional isolates used to screen by PCR for the distribution of host-specific determinants were analyzed by PFGE and compared to each other (see the additional strains in Table 1, except 1465). All strains were genotypically distinct. Using a similarity cutoff of 50%, four groups were distinguished and correlated well with host origin. In the two ovine-caprine groups (two caprine, six ovine, one bovine, and three ovine, respectively), profile similarities ranged from 51 to 90%. In the bovine group (nine bovine and one caprine), profile similarities ranged from 54 to 88%. By PFGE analysis, a fourth group containing a bovine strain (LMA1259) appeared to be distantly related to the other strains (see Fig. S2 in the supplemental material).
Variation in genome content identified by CGH. CGH experiments were performed using an oligonucleotide microarray that was representative of the genome sequences of the human strains COL, N315, Mu50, EMRSA16, MSSA476, and 8325 and the bovine strain RF122. Strain N315 was used as a reference in all hybridization experiments. Three independent hybridizations were performed for each strain examined. In Fig. 2, each CDS was ordered along the chromosome according to the genome structure of the reference strain N315. The genome location of CDSs that are designated variable (i.e., low or no hybridization signal) or present by CGH was predicted by their location in the genome of the reference strain.
Overall, CGH revealed genome diversity within the S. aureus species that was greater than that previously reported. Among the 3,894 S. aureus CDSs spotted on the microarray, 1,875 CDSs were shared by the 12 strains studied, which is equivalent to 70.5% of the average number of CDSs found in the strains (n = 2,656). Among the 697 genes that were strain dependent or variable, 204 (29.3%) genes were predicted to be associated with MGEs or related islands (see Table S1 in the supplemental material).
The classification of the variable genes according to their putative or known function revealed that some categories contained more variable genes than others (Table 3). For example, in the functional categories defense mechanisms, replication, recombination and repair, and unclassified genes (i.e., not in clusters of orthologous groups [COGs]), 47.9, 37.0, and 62.8%, respectively, of genes were variable. This is partly explained by the fact that many of the variable genes are encoded on MGEs, such as S. aureus pathogenicity islands (SaPIs) or prophages, which contain genes for virulence, antibiotic resistance, replication, repair, and phage-specific functions in addition to numerous genes of unknown function. Several other categories contain a relatively high proportion (>20%) of variable and additional genes. This is the case for cell cycle control, cell division, chromosome partitioning (30.4%), inorganic ion transport and metabolism (23.2%), signal transduction mechanisms (22.9%), cell wall/membrane/envelope biogenesis (22%), amino acid transport and metabolism (21.8%), and unknown function (20.6%). In contrast, categories such as carbohydrate transport and metabolism (9.5%), energy production and conversion (6.8%), and nucleotide transport and metabolism (7.9%) contain a low level of variable genes. These data suggest that some categories contain a larger proportion of genes involved in niche adaptation or contingency function than other categories.
|
View this table: [in a new window] |
TABLE 3. Categorization and conservation of the genes in the S. aureus strains
|
Structural diversity of S. aureus chromosomes.
A limitation of CGH analysis is that it cannot identify genes present in the test strain but absent from the reference strains represented on the microarray. Also, genes that are present in the test strain but have undergone diversification such that hybridization with the array does not occur cannot be discriminated from those genes that are absent. The WGPS approach takes advantage of the sequence data available for one or more strains to localize and identify the regions of the genomic variation found in other strains (5, 34). A set of 295 primer pairs was designed from the N315 genome sequence as previously described (5) and used in the LR-PCR analysis of the 12 strains. As expected, the LR-PCR of strain N315 resulted in 100% amplification of 295 PCR products and a range of 80 to 99% amplification for the other strains examined (see Fig. S1 and Fig. S2 in the supplemental material). Interestingly, the hypermutator strain 1183 of human origin had an LR-PCR success rate of 95% in spite of the higher rate of nucleotide substitution expected for that strain, which was manifested in a very distinct PFGE type. Among bovine strains, RF122 had an amplification success rate of 86%, while that for the other bovine strains was 94%, consistently with the difference in gene content identified for strain RF122 in contrast to the other bovine strains. Among the successfully amplified WGPS fragments, a great majority (from
97% in Mu50 to
82% in ovine strains) were indistinguishable in length from those generated with the N315 genome.
Consistently with PFGE and MLST data, WGPS amplification profiles of the strains of human origin were very similar to but distinct from ovine and caprine strains that were closely related to each other (Fig. 2). The WGPS profiles of bovine strains 1166, 1167, and Newbould contained many genomic variations at the same sites that were largely distinct from those of bovine strain RF122.
Impact of MGEs on genome diversity.
The MGEs, such as staphylococcal cassette chromosome mec, SaPI, or prophages, found in N315 were considered present in a test strain when the corresponding PCR products were successfully amplified and were of a size identical to that of products generated with N315. Sometimes, WGPS fragments that included the left and right extremities of an MGE were obtained, but size variations were observed within the internal fragments, suggesting the presence of a variant MGE in the strain. In some cases, several internal fragments of a large MGE were amplified, but amplification across the predicted site of insertion failed, suggesting the possible existence of related MGEs localized elsewhere on the chromosome. The staphylococcal cassette chromosome mec was absent from all of the strains except Mu50. Considerable variation in the number and variety of MGEs was observed. Of note, CGH results showed that tst, the gene encoding TSST-1, was present in strains RF122, 1170, 1171, and 1174, whereas the corresponding WGPS fragment that contains the SaPI-encoding tst gene in N315 was not amplified. These data indicate a distinct location for the tst-encoded SaPIs in these strains, consistently with a previous study and recent genome sequence that defined the site of insertion of SaPIbov in RF122 (13, 18). Similarly, the presence of the 29.3-kb-long vSa
(previously annotated as SaPIn2), found in N315, which encodes the staphylococcal superantigen-like family of proteins (26), was predicted to be present by WGPS in the four human strains examined. This correlated well with the CGH results for human strain 1178, whereas for human strain 1183, 17 of the 31 CDSs that compose vSa
were variable (not detected or with a low signal), suggesting a high nucleotide divergence in this locus. In all of the animal strains, WGPS fragments were amplified at the vSa
locus but contained size variations that were reflected in CGH results: up to 15 out of the 31 CDSs of vSa
were not detected in the bovine strains. In ovine-caprine strains, CGH and WGPS data suggested the presence of variants of vSa
. The 26.2-kb vSaβ (previously annotated as SaPIn3), corresponding to the enterotoxin gene cluster (seg, sen, yent2, yent1, sei, sem, and seo) and lukDE, was not amplified in most animal strains. Only one LR-PCR product at the expected size was obtained in the bovine strains 1167 and Newbould, which correlated well with the CGH data. WGPS and CGH results suggest that the vSaβ locus is highly variable in animal strains. The lukDE genes are well conserved in the animal strains, but the enterotoxin gene cluster is absent from all bovine, ovine, and caprine strains except RF122. This observation was confirmed by RF122 sequence data (18). Taken together, these data demonstrate the considerable variety in gene content and chromosomal location that exists among SaPIs encoded by different strains. None of the SaPI variations observed were host specific. Prophage
N315 was absent in all other strains except Mu50. However, some internal fragments were amplified in strains 1178 (human), 1173 (ovine), and 1174 (ovine), indicating the presence of a related phage inserted elsewhere in the genome. Most of the insertion sequence (IS) elements found in N315 were absent in the corresponding fragments in the animal strains. Similarly, none of the four copies of Tn554 found in the genome of N315 were found among the other strains.
Combining WGPS and CGH results to identify host-specific chromosomal RDs. CGH analysis provides information on the gene composition but not the chromosomal locations. It also fails to identify strain-specific genes that are not represented on the microarray. Conversely, WGPS detects variable loci in which genetic events have taken place, while the identity of the gene content cannot be determined. In order to explore the genomic diversity of S. aureus with regard to host adaptation, we combined the two datasets (Fig. 2). Based on CGH results, we grouped the variable CDSs into four categories (Table 4): (i) bovine-specific RDs (five CDSs) found to be variable (i.e., either not detected, poorly detected, or additional compared to the CDSs of the N315 reference strain) in the four bovine strains tested, whereas they were not variable in either of the other (human or ovine-caprine) strains; (ii) ovine-caprine RDs (61 CDSs) were found to be variable in ovine-caprine strains and not in the other strains; (iii) ruminant-specific RDs (13 CDSs) were found to be variable in all of the ruminant strains and not in the human ones; (iv) a fourth category comprising variable CDSs of the four ovine-caprine strains plus RF122 (101 CDSs) was identified. Once those categories had been defined, WGPS data enabled discrimination between CDSs that were likely to be absent from the genome tested (the shorter WGPS fragment) and, thus, did not hybridize in CGH (gene loss was suspected) (Table 4) and CDSs that were present in the strains (no significant size variation in the corresponding WGPS fragment compared to that of the N315 reference) but likely had undergone sufficient nucleotide divergence such that they did not hybridize in CGH (allelic variation was suspected) (Table 4).
Two of five bovine-specific CDSs belong to genes classified as pathogenic factors, two other genes were unclassified, and one gene, fmhC, was classified in the cell wall category, and its product shares identity with FemA and FemB (45). Most of the ovine-caprine-specific CDSs (26 out of 61) are represented by unclassified genes. Interestingly, 35 CDSs were suspected to have undergone allelic variation (Table 4), among which are two fmt genes that reportedly are involved in autolysis and methicillin resistance (24) and two trp genes that are predicted to exist in an operon (49). Homologs of these genes made by Mycobacterium tuberculosis are involved in virulence and colonization (42). All 13 ruminant-specific CDSs were variable (i.e., low or no hybridization signal) and were located in loci harboring size variations according to WGPS results. Interestingly, genes involved in virulence (SSL10 and clfA) and autolysis (lytN) were found among those variable genes and, thus, may have undergone considerable divergence or may be absent from ruminant strains. In the fourth category, allelic variation was suspected for 59 CDSs. These 59 CDSs included genes involved in transport (e.g., fruA, encoding a fructose permease; and mtlF, encoding a component of a mannitol-specific phosphotransferase system), in membrane bioenergetics (mnhD, encoding an Na+/H+ antiporter subunit), in protein secretion (secY, encoding a preprotein translocase), in RNA synthesis (one lysR and two gntR transcriptional regulators), and in virulence (CDSs similar to those of streptococcal hemagglutinin and hemolysin III). Among the other CDSs of the fourth category, 12 CDSs corresponded to gene gain and could not be located in WGPS fragments (Table 4). Five of the latter genes are involved in epidermin (an S. epidermidis bacteriocin) biosynthesis, suggesting that horizontal gene transfer occurred in these strains.
Compared to N315 gene content, 151 host-specific CDSs were suspected to be absent or to have undergone allelic variation, and 29 other genes were identified as additional genes. Interestingly, the analysis of the known or putative location of these genes using PSORT (31) or SignalP (6) revealed that 50.5% were predicted to be exported or secreted proteins, suggesting a possible role in bacterium-host or tissue interactions.
The categorization of these 180 genes according to their functions revealed that 65 genes were of unknown function or of no similarity. Among the 36 genes involved in cell envelope and cellular processes, all of the genes were found in subcategories such as transport/binding proteins and lipoproteins (24 genes), cell wall (5 genes), sensors (2 genes), membrane bioenergetics (4 genes), and protein secretion (1 gene) (Table 4). These subcategories group together genes that putatively are involved in interactions with the host or the environment. No genes were found in other subcategories, such as cell division, sporulation, germination, and transformation/competence. The 25 genes found in the intermediary metabolism category were evenly distributed among the subcategories. Among the 16 genes involved in information pathways, the majority were found in subcategories such as RNA synthesis (9 genes) and RNA modification (3 genes). Most of the 19 genes found in the category of other functions were classified in subcategories like pathogenic factors (10 genes), adaptation to atypical conditions, and miscellaneous (3 genes). Only six genes were found to be associated with the subcategories phage-related functions and transposon and ISs, confirming the low incidence of MGE in RDs predicted to be host specific.
Selected host-specific determinants are widely distributed among a panel of animal strains. In order to test the hypothesis that the genomic variation identified by WGPS and CGH among a small number of isolates was more widely distributed among isolates of a specific host association, we screened an additional 28 strains of human (n = 4), bovine (n = 12), ovine (n = 9), and caprine hosts (n = 3) for the presence of five selected RDs by PCR (Table 1). RDs representing predicted allelic variation in the ovine and caprine strains were sequenced in the ovine strain 1174, and oligonucleotide primers specific for the RDs were designed. For regions of gene loss, the genome sequence of strain N315 was used to design oligonucleotides for PCR (Table 2).
PCR indicated the presence of each region of difference in a large proportion (62.5 to 100%) of the ovine-caprine strains examined and their absence (or weak occurrence) in strains of human and bovine origin, indicating ovine and caprine host specificity (Fig. 3, Table 5). Of note, strains 1524 (caprine origin) and 1231 (bovine origin) gave expected PCR results, consistently with that predicted by their host origin, even though they were more closely related to strains from other animal origins, as determined by PFGE analysis (see Fig. S2 in the supplemental material).
![]() View larger version (52K): [in a new window] |
FIG. 3. Demonstration of host specificity for six ovine-caprine traits. PCR tests were run on 40 different strains isolated from human (hum) (8), bovine (bov) (16), and ovine-caprine (ov-cap) (16) hosts. PCR products were analyzed by electrophoresis on agarose gels. Host origins are indicated at the top of each gel. In the gels on the right, M indicates the RaoulI linear DNA marker (Appligene; Qbiogene, Strasbourg, France). In the gels on the left, M indicates a 2-log ladder linear DNA marker (New England Biolabs, Ozyme, France).
|
|
View this table: [in a new window] |
TABLE 5. Description of six ovine-caprine-specific regions
|
|
|
|---|
The majority of RDs among isolates from host-specific lineages are located in the core genome. In the present study, we investigated the genetic basis of S. aureus host specificity by comparing the genome content of strains causing mastitis in ruminants to the genome content of representative human isolates. A total of 180 CDSs were specific for lineages associated with ruminant hosts, including 61 CDSs that were found to be specific for strains infecting small ruminants (goat and sheep). Unexpectedly, our analyses indicated that the majority of host-specific differences were located outside of predicted MGEs that together make up the accessory genome and instead belong to the core genome. Lindsay et al. recently reported the existence of such variable core genes in community-acquired invasive isolates and nasal carriage isolates of human origins. For each dominant lineage, they found several unique combinations of core variable genes, suggesting a common ancestor followed by evolutionary divergence (28). Only 23 of the ruminant lineage-specific CDSs identified in our study (e.g., lytN, oppF, fmhC, and mtlF) were found in the list of 728 core variable genes determined by Lindsay et al., suggesting that distinct sets of genes have undergone diversification in different host-specific lineages.
Combining WGPS and CGH results, we discovered that some genes are absent from ovine-caprine strains but present in isolates from other hosts. For example, we found that SA0170, encoding a hypothetical protein, and SA0171 (fdh, encoding an NAD-dependent formate dehydrogenase) are absent from ovine-caprine isolates but are encoded together in an operon in human and bovine isolates (49). SA0170 previously was shown to be up-regulated upon internalization in human epithelial cells (14), and both genes were more upregulated in biofilm than planktonic culture conditions (38). These results suggest that these genes are involved in interactions with and persistence in the human host. Further functional analyses are required to check whether the operon is dispensable for the infection of ovine or caprine hosts and whether its loss is a true host-specific adaptation. We also identified seven genes that were present in all S. aureus strains tested but contained allelic variation in ovine and caprine strains. Of these, six genes encoded proteins of unknown function, including five that are predicted to be membrane associated or secreted. Of note, SA0647 is found in several other staphylococcal species, including Staphylococcus haemolyticus, Staphylococcus saprophyticus, and Staphylococcus epidermidis, and is upregulated by mgr, a global regulator of gene expression in S. aureus (29). Also, gene SA2478, part of a four-gene operon, also is induced upon internalization in human epithelial cells (14). The RDs specific for ovine and caprine strains included lytN, a gene encoding a 372-amino-acid protein with a putative muramidase activity that may be involved in autolysis (37, 43) and that is downregulated by mgr (29) and upregulated by rot (40).
Importance of predicted secreted and exported proteins in host-specific determinants.
Remarkably, we discovered that 91 (50.5%) of the putative host-specific determinants were predicted to be extracytoplasmic, indicating a possible role in host-pathogen interactions. These proteins represent a very high proportion of the 130 to 145 proteins (i.e.,
5% of the entire predicted proteome of S. aureus) predicted to be exported or secreted by S. aureus in a secretomic study (41). Similarly, in human isolates, Lindsay et al. found that many core variable genes were known or predicted to be expressed on the bacterial surface and to interact with the host during nasal colonization and infection (28). Recently, Herron-Olson et al. reported that bovine strain RF122 genes encoding surface proteins predicted to be involved in host interactions show higher-than-average rates of nonsynonymous substitution and gene decay relative to those of their homologs in the sequenced human isolates (18). They also showed that some virulence-associated genes, like clfA, encoding clumping factor, exist as pseudo-genes in the RF122 genome (18). We found that clfA represented an allelic variant in RF122 and ovine-caprine strains. Taken together, these data are consistent with an important role for the conserved secretome in host adaptation.
Identification of a clonal lineage of S. aureus that is specific for ovine and caprine hosts. On the basis of PFGE and MLST analysis, strains isolated from sheep and goats were closely related, which is consistent with previous findings (47). CGH and WGPS also indicated the close relatedness of ovine-caprine strains, whereas bovine isolates contained much greater variation in gene content, consistently with previous studies of the virulence gene content of bovine mastitis-associated isolates (47, 50). In contrast to bovine mastitis, which usually is subclinical, S. aureus ovine mastitis typically is clinical in nature, whereas coagulase-negative staphylococci are associated with subclinical mastitis (8). It is possible that variation in S. aureus genome content specific for the ovine-caprine clone contributes to the increased severity of infection associated with sheep and goat mastitis.
Three of the four strains isolated from cows were closely related to each other, whereas strain RF122 (a strain that reproducibly induced clinical mastitis in experimental infections) was more closely related to the ovine-caprine strains. RF122 was found to share more variable CDSs with the ovine-caprine clone than with the other bovine strains, as indicated in a previous CGH study (14). In fact, we found that 117 genes varied similarly in RF122 and the ovine and caprine strains but not in all other strains. In agreement with MLST and PFGE results, these data suggest that some bovine strains are more closely related to the ovine-caprine-specific clone than to the other bovine strains. Therefore, it is possible that the determinants found in common for RF122 and ovine-caprine strains likely are due to their similar clonal origin. However, this does not rule out the possibility that some of these differences represent adaptations to the ruminant mammary gland, even though they are not shared with the other bovine strains of more distant relatedness.
All of the strains of goat and caprine origin were found to be closely related. It is apparent from our work and that of others that there is a single major lineage of S. aureus associated with mastitis infections of sheep and goat (30). At this point, it is not possible to identify which of the RDs represent true host-specific genetic adaptations and which are only lineage specific. However, host-specific determinants selected in this study gave a PCR profile consistent with their host origin in the vast majority of strains tested, notably for a caprine strain that grouped in a bovine-associated clonal lineage (as determined by PFGE analysis) and a bovine strain that grouped in an ovine-caprine cluster.
This study highlights the extensive variation that exists among human and animal isolates of S. aureus. We found that the core genome contains the majority of RDs that were specific for ruminant strains, including many that encoded surface-associated or extracellular proteins. These loci represent excellent candidates for studies of the molecular basis of S. aureus host specificity.
N.B.Z. was the recipient of a Ph.D. grant from the French Ministry of Research and Education. P.D.A. is the recipient of a CAPES grant from the Brazilian government (CAPES-COFECUB project 539/06). J.R.F. is supported by the Biotechnology and Biological Sciences Research Council, United Kingdom, research grant BB/D521222/1.
Published ahead of print on 20 June 2008. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
Present address: Laboratoire D'Études et de Recherche en Pathologie Équine, AFSSA, Site de Dozulé, Service Bactériologie-Immunologie, Route de Goustranville, 14430 Dozulé, France. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»