Previous Article | Next Article ![]()
Journal of Bacteriology, January 2005, p. 576-592, Vol. 187, No. 2
0021-9193/05/$08.00+0 doi:10.1128/JB.187.2.576-592.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Microbiology and Immunology, University of Arkansas for Medical Sciences, Little Rock, Arkansas,1 Wyeth Research, Pearl River, New York,2 Wyeth Research, Cambridge, Massachusetts3
Received 19 April 2004/ Accepted 5 October 2004
|
|
|---|
|
|
|---|
Much of the research aimed at identifying novel therapeutic targets for the treatment of S. aureus infections has been directed at identification of specific virulence factors and regulatory circuits that are relevant to the disease process. Almost invariably, these studies used the prototypic lab strain NCTC 8325 or its derivatives. The single most widely studied strain is RN6390, which is an 8325-4 strain generated by curing three prophage from NCTC 8325 (1, 12, 45). However, RN6390 has an 11-bp deletion in rsbU, which encodes a positive regulator essential for the activity of the stress response sigma factor SigB (33). This raises the possibility that regulatory networks defined in studies focusing on RN6390 are not representative of the situation observed with clinical isolates, and we recently confirmed that this is, in fact, the case. For instance, we demonstrated that mutation of the staphylococcal accessory regulator (sarA) has a disparate phenotypic effect in RN6390 by comparison to clinical isolates (6). Perhaps more importantly, direct comparisons in our murine septic arthritis model indicate that RN6390 is less virulent than our prototype clinical isolate (7), and preliminary experiments suggest that this is also true with our rabbit model of postsurgical osteomyelitis (data not shown).
Even among clinical isolates, there is both genotypic and phenotypic evidence to suggest that a limited number of clonal variants are of particular significance. For instance, Booth et al. (8) used pulsed-field gel electrophoresis to examine 405 clinical isolates from various types of infection and diverse geographic locations and identified 90 different lineages. Two of these accounted for more than half of all infections. One of the genetic characteristics that best defined the two most prevalent lineages was the presence of cna, which encodes the primary, if not the only, collagen binding adhesin produced by S. aureus (23). Specifically, cna was present in more than 93% of the strains in the two most prevalent lineages. This is significant in that cna is present in less than half of all S. aureus strains (24). In another study, Peacock et al. (49) examined the presence of genes encoding 33 putative virulence factors in 334 S. aureus isolates and found that seven genes (fnbA, ica, sdrE, sej, eta, hlg, and cna) were significantly more common in invasive isolates than in carriage isolates. With respect to phenotypic characteristics, Papakyriacou et al. (48) found that the most prevalent clinical isolates of S. aureus have a phenotype characterized by a high binding capacity for host proteins and limited production of extracellular virulence factors (e.g., toxins and degradative enzymes). It was suggested that such strains may predominate because they have a phenotype that favors the colonization phase of infection (48).
We have focused our efforts on clinical isolates of S. aureus, most notably UAMS-1 and UAMS-601. UAMS-1 is an oxacillin-susceptible strain (OSSA) isolated from the bone of a patient suffering from osteomyelitis (22). UAMS-601 is an ORSA isolate responsible for an outbreak in the neonatal intensive care unit of our hospital (64). Importantly, both of these strains share phenotypic and genotypic properties associated with the predominant clonal lineages discussed above. Specifically, both are cna-positive strains that also contain other genes that Peacock et al. (49) found were associated with invasive isolates, and both have a high binding capacity for host proteins and produce limited amounts of most exoproteins (6, 23). Both of these strains have an intact rsbU locus, and both are highly virulent in our animal models of musculoskeletal infection (6).
The characteristics that define the UAMS isolates and contribute to their virulence in our animal models may be due to unique genetic content and/or differential expression of the regulatory circuits that control the coordinated production of conserved virulence factors. In this study, we address the first of these possibilities. Although survey studies like those reported by Peacock et al. (49) have provided important information toward this goal, such studies are by necessity limited to studying particular loci of interest. In fact, few studies have addressed the overall genomic diversity among clinical isolates of S. aureus. An exception is the work of Fitzgerald et al. (20), who used microarray-based comparative genomic hybridizations (CGH) to define 18 "regions of difference" among several S. aureus strains. The microarray used in these experiments represented >90% of the COL genome. They found that individual strains lacked between 22 and 332 of the COL genes and that 78% of the genome was invariant. However, the converse experiments could not be performed, and the authors noted that "identification of genes present in ... test strains but absent in COL is likely to provide new insight into the molecular mechanisms of pathogenic clone adaptation to specialized niches." To date, little has been done in this regard, but the completed sequencing of additional S. aureus strains, including several clinical isolates, has allowed the development of more-comprehensive microarrays that can potentially be used to address this issue.
To this end, we carried out CGH experiments using two sets of microarrays that collectively represent the genomes of seven sequenced strains of S. aureus (COL, NCTC 8325, Mu50, N315, MW2, EMRSA-16 [SANGER-252], and SANGER-476). Hybridizations were done with genomic DNA from 10 strains, including the two virulent UAMS isolates (UAMS-1 and UAMS-601), the prototypical lab strain RN6390, and all seven sequenced strains. Inclusion of the sequenced strains assisted the development of cutoffs to determine the presence, absence, or divergence of open reading frames (ORFs) in the nonsequenced strains. We also included epidemiological typing methods to place our clinical isolates into well-defined epidemiological paradigms. Collectively, this allowed us to define the unique genetic content among clinically important strains of S. aureus, to cluster strains according to genetic similarity, and to extrapolate our findings to epidemiologically related strains worldwide.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Strains
|
DNA isolation, labeling and hybridization. For the PFGRC arrays, genomic DNA was isolated as described previously (40). The integrity of all DNA preparations was checked by PCR, using strain-specific primers (cna, sarA, or mecA) (Table 2). Genomic DNA was digested to completion with EcoRI (New England Biolabs, Beverly, Mass.) and repurified by using the QIAGEN QIAquick PCR purification kit (QIAGEN, Valencia, Calif.). Digested DNA (1 to 2 µg) was labeled with either Cy3 or Cy5, using the Genisphere Array 350RP kit (Genisphere, Hatfield, Pa.) according to the manufacturer's instructions. Hybridization was carried out for 12 h at 55°C with constant mixing, using the Ventana Discovery System hybridization station (Ventana Medical Systems, Tucson, Ariz.). This was followed by an additional 2-h hybridization at 55°C with Cy3- and Cy5-labeled Genisphere dendrimers as per the manufacturer's protocol and a final wash with 2x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate) at 37°C. Each experiment was run as a competitive hybridization by using Cy3-labeled DNA from one of the 10 strains of interest and Cy-5-labeled DNA from COL. Additionally, experiments focusing on UAMS-1, UAMS-601, RN6390, and N315 were repeated with the opposite fluorophore to rule out any dye-specific differences in hybridization pattern. Finally, we repeated the hybridization of UAMS-1 versus COL a total of four times to assess the reproducibility of our results.
|
View this table: [in a new window] |
TABLE 2. Primers
|
Array scanning and analysis. Hybridized PFGRC arrays were scanned using the Perkin-Elmer ScanArray 5000 and read using ScanArray Express software (Perkin-Elmer, Boston, Mass.). Raw fluorescence intensity values were corrected for background by using ScanArray Express software and subsequently normalized by using the Lowess algorithm. Normalized values were expressed as a ratio of test strain intensity to COL strain intensity. These ratios were imported into Spotfire DecisionSite software (Spotfire, Somerville, Mass.) for graphical visualization and further analysis. The data were filtered for low spot signal intensity by using a threshold of the mean plus one standard deviation of the intensity observed from all empty and control spots on the array. Spots having COL channel normalized signal intensity less than this value were excluded from further analysis. However, the 111 unique spots from strains MW2, N315, and Mu50 were not subjected to this threshold, since these would be expected to have low signal intensity in the COL channel.
To develop cutoffs to determine "present" versus "absent" or "divergent" ORFs in our experiments, we included comparisons of each of the sequenced S. aureus strains versus COL. This allowed us to compare our microarray data with the results of BLAST comparisons. Specifically, BLASTN was used to compare the sequence of each ORF on the PFGRC array to the genomes of Mu50, N315, and MW2. This allowed for categorization of each ORF according to percent identity to COL, Mu50, N315, and MW2 sequences. Each ORF on the array was then categorized as present (
90% nucleotide sequence identity), divergent (70 to 90% nucleotide sequence identity), highly divergent (
70% nucleotide sequence identity), or absent (no hit based on a minimum alignment of 40 bp). For our analysis, the "absent" and "highly divergent" categories were subsequently combined under the heading of "absent." Using this information, the following cutoffs were determined based on the ratio of normalized test strain intensity to normalized COL strain intensity: "absent," ratio of <0.3; "divergent," ratio of
0.3 but <0.5; "present," ratio of
0.5. However, because the signal in the COL channel was low for those ORFs from strains other than COL, the criteria above were modified for these ORFs as follows: "present," ratio of
2.0; "absent," ratio of <2.0. Error rates were calculated based on the agreement of results from the BLAST-based versus the array-based categorization (see results).
Hybridized Affymetrix GeneChips were scanned as previously described (15). Signal intensities for elements tiled onto each chip were normalized to account for loading errors and differences in labeling efficiencies by dividing the log-transformed signal intensity of each qualifier (putative ORF or intergenic region) by the mean signal intensity for each individual chip. Results were analyzed with GeneSpring version 6.0 (Silicon Genetics, Redwood, Calif.) and Spotfire Decision Site. Based on the normalized signal intensities of each element, adjusted-call predictions were made to determine if an element was present, absent, or indeterminate (marginal) for UAMS-1, UAMS-601, and each of the sequenced S. aureus strains as previously described (16). Using this methodology,
97% of ORFs were correctly determined to be absent or present for each of the sequenced strains.
Hierarchical clustering. Clustering was performed with PFGRC microarray data, using Spotfire Decision Site software. Consensus "present," "absent," and "divergent" calls for all CGH experiments were converted to integers and subjected to the complete linkage clustering method with the correlation similarity measure and average value ordering function. Calls for COL were based on the updated annotation of the microarrays as discussed above. Hierarchical clustering with the Affymetrix GeneChip data was performed by using GeneSpring version 6.0's Pearson correlation for all samples to develop a dendrogram that simultaneously compared the normalized signal intensity of each qualifier for a given strain to the signal intensity of the same qualifier across all strains analyzed.
Southern blotting for verification of microarray data. DNA probes for Southern blotting were generated by PCR, using the primers shown in Table 2. Amplified fragments were labeled by using the Roche DIG-High Prime kit (Roche, Basel, Switzerland). Genomic DNA preparations were digested to completion with either EcoRI or EcoRI and AluI (sarT and sarU blots). Blots were then transferred, hybridized, and developed as previously described (64).
Epidemiological typing. Multilocus sequence typing (MLST) was performed for strains UAMS-1, UAMS-601, and RN6390 according to the method of Maiden et al. (38). Primer sequences (Table 2) and specific protocols can be viewed at www.mlst.net. Amplification products were sequenced in triplicate to ensure their identity. The MLST sequence types for all of the sequenced strains were kindly provided by B.N. Kreiswirth (Public Health Research Institute, Newark, N.J.). SCCmec typing (46) was performed via multiplex PCR to identify polymorphisms in the staphylococcal chromosomal cassette region containing the mec element. Typing based on polymorphisms within the staphylococcal protein A gene (spa) was performed as previously described (62).
|
|
|---|
![]() View larger version (12K): [in a new window] |
FIG. 1. Dendrogram illustrating genetic relationships between strains. The dendrogram was derived from PFGRC array data based on Spotfire hierarchical clustering. MLST for UAMS-1, UAMS-601, and RN6390 was performed according to the method of Maiden et al. (38). The spa genes for UAMS-1, UAMS-601, and RN6390 were amplified, sequenced, and typed according to the method of Shopsin et al. (62). Agr subgroups were determined by direct sequence analysis of the agr locus. The percentage of genes present or absent in each strain by comparison to UAMS-1 was determined from final PFGRC microarray results.
|
Adhesins. The PFGRC array used in these experiments included spots corresponding to 15 recognized adhesin genes. Unfortunately, the collagen adhesin gene cna was not included. Since the presence of cna correlates well with virulence (8, 49), we assessed its presence in our strains by Southern blotting. The cna gene was present only in UAMS-1, UAMS-601, EMRSA-16, MW2, and SANGER-476 (data not shown). These results were confirmed by Affymetrix data and correspond to our hierarchical clustering (Fig. 1), suggesting that cna may, in fact, represent a marker of overall genetic relatedness. Other adhesin genes that were not represented on the PFGRC array included those encoding the plasminogen, vitronectin, and thrombospondin binding proteins. Of the adhesin genes that were included on the PFGRC array, the gene encoding clumping factor B (clfB) was unambiguously present in 8 of 10 strains by microarray analysis. However, Southern blotting confirmed the presence of clfB in all 10 strains (Fig. 2). Similarly, the gene encoding clumping factor A (clfA) was classified by microarray as absent in N315 and Mu50, but direct sequence analysis confirmed that clfA is present in both of these strains. These results are consistent with experiments done with the Affymetrix GeneChips, which contains three clfA alleles and four clfB alleles. More directly, hybridizations done with the Affymetrix arrays confirmed that clfA and clfB were present in all 10 strains. Furthermore, RN6390, 8325, COL, MW2, and SANGER-476 were found to contain the same clfA allele (BAB94629, as did the Mu50/N315 cluster (BAB569731) and the UAMS-1/UAMS-601/EMRSA-16 cluster (86.7% amino acid identity to BAB41975. All 10 strains also encoded the fibrinogen binding protein COL-SA1168; however, the fibrinogen-binding precursor-related protein (COL-SA1169) was absent or divergent in UAMS-1, UAMS-601, and EMRSA-16.
![]() View larger version (63K): [in a new window] |
FIG. 2. Southern hybridization for validation of array data. Genomic DNA isolated from the strain indicated above each lane was hybridized with probes corresponding to the genes shown to the right of each panel. The sarU probe used in these experiments also included a region corresponding to sarT, while the sarT probe was limited to sarT itself.
|
Peacock et al. (49) also found that fnbA, which encodes a fibronectin-binding protein, was significantly more common in invasive isolates. This is one of two very similar genes (the other is designated fnbB) that, in those strains in which both are present, are contiguous within the S. aureus chromosome. The fact that fnbA, but not fnbB, was more common among invasive isolates therefore suggests that such strains do not encode fnbB. Although two strains (UAMS-601 and SANGER-476) initially gave a "divergent" result for fnbA, it was subsequently confirmed by PCR that fnbA was present in all 10 strains (Fig. 3). In contrast, fnbB was classified as absent by microarray in UAMS-1, UAMS-601, EMRSA-16, SANGER-476, and Mu50. Analysis of the TIGR Comprehensive Microbial Resource indicated that Mu50 does indeed contain fnbB (NT02SA2496); however, the specific allele present in Mu50 was not represented on the array. Direct sequence analysis also indicated that MW2 and N315 have genes that share 80 and 84% nucleic acid identity with fnbB, respectively. We therefore undertook a PCR-based approach to confirm the presence of fnbB in each strain, using primers designed to amplify regions specific to the fnbB allele (Table 2). By PCR, fnbB was present in 8325, RN6390, COL, MW2, and SANGER-476 (Fig. 3). Mu50 and N315 also had a positive PCR, although the region amplified was larger. However, fnbB was absent in UAMS-1, UAMS-601, and EMRSA-16 based on PCR results (Fig. 3), as well as sequence analysis in the case of EMRSA-16. Affymetrix data confirmed these results, since all strains except UAMS-1, UAMS-601, and EMRSA-16 contained one of the three fnbB alleles present on the Affymetrix GeneChip.
![]() View larger version (26K): [in a new window] |
FIG. 3. PCR verification of fnbA and fnbB. Genomic DNA isolated from the indicated strains was used as a template in PCRs with primers that amplify unique regions of fnbA or fnbB. The results observed with UAMS-1, COL, and N315 are representative of the results observed with the other seven strains as described in the text.
|
|
View this table: [in a new window] |
TABLE 3. Adhesins
|
![]() View larger version (14K): [in a new window] |
FIG. 4. Microtiter plate assay of biofilm formation. Biofilm formation was assessed as described by Beenken et al. (5). Results are shown as absorbance at 560 nm (A560) and represent the average and standard errors of the means from nine independent experiments. Strain U929 (UAMS-1 sarA mutant) was included as a negative control. Distribution of cna and sasG among each of the 10 strains was confirmed by microarray results, Southern hybridization, and BLAST searches. Capsular serotype was determined from microarray results for all strains, and confirmed in UAMS-1 by serotyping analysis (Chia Y. Lee, personal communication). The designation "d" indicates that MW2 and SANGER-476 contain a divergent form of sasG (53).
|
|
View this table: [in a new window] |
TABLE 4. Exoenzymesa
|
Toxins. The genetic analysis of toxin genes is complicated by the occurrence of a large number of variants that share considerable homology with each other (42). Four enterotoxin ORFs (seb, sei, ent, and an ORF for an enterotoxin family protein) were represented on the version 1 PFGRC arrays used in these experiments. The Affymetrix GeneChip included probe sets representing a total of 19 enterotoxin genes, many of which were represented by multiple alleles. We found that only COL encoded seb, as represented on both the PFGRC array and Affymetrix GeneChip (Table 5). Both sei (COL-SA0887) and ent (COL-SA0886) were present only in MW2, SANGER-476, and COL, according to the PFGRC arrays. However, the Affymetrix arrays also included a second allele for sei (N315-SA1646) that was present in UAMS-1, UAMS-601, EMRSA-16, Mu50, and N315. A search of the TIGR Comprehensive Microbial Resource database shows that the protein with highest homology to SEI in MW2 is designated SEG2. Similarly, the protein with highest homology to ENT in MW2 is designated SEK. It is therefore unclear whether our microarray analysis has failed to discern the genes encoding two antigenic types of enterotoxin or if the annotation is simply inconsistent.
|
View this table: [in a new window] |
TABLE 5. Toxins and superantigensa
|
Eleven exotoxin-like ORFs were represented on the PFGRC arrays. Similar to the enterotoxin genes, the exotoxin-like (set) genes of S. aureus also share considerable homology and a confusing nomenclature. The annotation for these genes on the PFGRC microarray is confusing as well. For example, three ORFs on the version 1.0 PFGRC arrays were described as encoding exotoxin 3 (COL-SA0468, COL-SA0478, and COL-SA1180). However, products of SA0468 and SA0478 were most closely related to exotoxin 6 and exotoxin 15 of N315, respectively. Similarly, two ORFs (COL-SA0469 and COL-SA1178) were described as encoding exotoxin 2, and the product of one of these (SA0469) was homologous to exotoxin 7 of N315. Exotoxin-like protein genes set13 and set14 were homologous to set5 and set4 of COL, respectively. Microarray data from such homologous genes can complicate interpretation; nevertheless, the exotoxin-like genes 1, 2, 4, 6, 8, 9, 14, and 15 were all classified by the PFGRC microarray as absent or divergent in UAMS-1, UAMS-601, and EMRSA-16. However, the Affymetrix GeneChips were more comprehensive than the PFGRC arrays and included a total of 25 exotoxin-like genes. Multiple alleles were included for some of these, and alleles of exotoxin-like genes 1, 2, and 4 were present in UAMS-1, UAMS-601, and EMRSA-16. Given the functional similarity among the products of these genes and the presence of multiple exotoxin-like genes in every strain, it is certainly possible that these genotypic differences are phenotypically transparent and merely represent allelic variations. More directly, it seems likely based on our data for both the enterotoxin and exotoxin-like genes that all of the strains examined in these experiments have the capacity to produce at least one superantigen. As with the enterotoxin genes, a PCR-based approach may be more efficient for accurately determining the presence or absence of specific exotoxin-like alleles.
The genes encoding exfoliative toxins A (eta) and B (etb) are not present in COL and were not present on the version 1.0 PFGRC array. Both were present on the Affymetrix GeneChip, and our hybridization experiments indicated that neither eta or etb was present in any of the 10 strains studied. However, a gene (COL-SA1184) that encodes a protein with 80% amino acid identity to the product of an S. hyicus gene that is described as an exfoliative toxin was present in all strains. This gene was previously incorrectly designated an exfoliative toxin A gene in the COL genome (and PFGRC arrays), but this designation has since been corrected.
S. aureus may also encode the genes for several bicomponent leukotoxins, including the genes encoding Panton-Valentine leukocidin (PVL), gamma hemolysin, and lukD/E. This is important because leukotoxin production has been associated with some specific disease manifestations including life-threatening pneumonia (25, 36). The PFGRC arrays contained ORFs corresponding to lukDE and gamma hemolysin but not the PVL genes. The genes encoding PVL (lukF-PV and lukS-PV) were included on the Affymetrix arrays and were present only in MW2. This is consistent with previous reports linking the PVL genes with community-acquired isolates (14, 25). The genes encoding gamma hemolysin, component A, and several other putative hemolysins contained on the array were present in all 10 strains. However, lukDE was classified as absent in UAMS-1, UAMS-601, and EMRSA-16, and this was subsequently confirmed by Southern blotting (Fig. 2). This is not surprising considering that lukD and lukE are contained on RD1 (20), which contains other genes that are also absent in the UAMS strains and EMRSA-16 (see above). However, as noted above, the redundant nature of S. aureus virulence factors makes it difficult to predict whether the absence of either or both of these genes would have a significant impact on the phenotype of these strains.
Notably, the gene encoding toxic shock syndrome toxin 1 (tst) was not included on the PFGRC microarray. However, Affymetrix data reveal that this gene is present in UAMS-1, UAMS-601, N315, and Mu50 but not EMRSA-16. This is one of the few differences observed between the UAMS isolates and EMRSA-16.
Regulatory elements. Regulatory circuits in S. aureus are complicated and involve a number of interactive elements. Most notable among these is the staphylococcal accessory regulator (sarA) and the accessory gene regulator (agr), both of which have been correlated with virulence (7, 10). However, recent reports have described a number of other loci (Table 6), most of which appear to function, at least in part, by modulating the activity of sarA and/or agr (9, 45). The agr locus is known to have strain-dependent differences in nucleotide sequence, particularly in agrD and specific regions of agrB and agrC, and these differences define agr subtypes (29). Importantly, these subtypes have functional significance as well, since components of the agr locus from one group may inhibit the agr response in another group (29). Consistent with these observations, we found that the agrB and agrC genes were classified as divergent or absent in several strains by PFGRC microarray analysis. All of the agr genes were classified as present in UAMS-1, RN6390, and 8325, which could indicate that these strains share the same agr subtype as COL whereas the other strains represent different groups. However, for UAMS-1 two of four replicate experiments gave ambiguous results for the agrC gene (SA2025), and results from the Affymetrix GeneChip experiments indicated that SA2025 was not present in UAMS-1 (data not shown). Since COL is agrC subtype I, this is consistent with our previous agr subgroup typing data indicating that both UAMS-1 and UAMS-601 are agrC subtype III (6). The Affymetrix GeneChips were more comprehensive than the PFGRC arrays with respect to the agr locus, since they included representative alleles from each of the four major agr subgroups. We found the Affymetrix data to be correct in predicting agr subtype, since it was identical to subgroup delineation by direct sequence comparison (Fig. 1).
|
View this table: [in a new window] |
TABLE 6. Regulatory elementsa,b
|
Most of the other recognized regulatory components were conserved among all strains. This includes rot, yyc, arl, sae, lrg, vra, svr, and srr. Several families of putative transcriptional regulators were also represented on the array, and the majority of these were also conserved among all strains (Table 6). However, some strain-specific differences were seen. For example, COL-SA0072 and COL-SA0074 are members of the LysR family of transcriptional regulators, and both of these ORFs were absent in UAMS-1, UAMS-601, EMRSA-16, Mu50, and N315. A similar pattern was seen with the MarR and GntR families of transcriptional regulators. For example, COL-SA2524 encodes a transcriptional regulator of the MarR family, and was present only in COL, 8325, and RN6390. Similarly, SA0091 encodes a transcriptional regulator of the GntR family and was present only in COL, 8325, RN6390, Mu50, and N315 (Fig. 2). Other members of the GntR and MarR families of transcriptional regulators were represented on the array but were conserved among all strains.
Other virulence factors. Fitzgerald et al. (20) found eight genes that are required for bacteriocin (epidermin) synthesis in the same region of difference (RD1) as the spl genes and lukD/E. Consistent with this observation, the array data from both PFGRC microarrays and Affymetrix GeneChips confirms that UAMS-1, UAMS-601, and EMRSA-16 lack the epidermin biosynthesis genes (Table 7). Epidermin is a type A lantibiotic (58), which is an antimicrobial peptide produced by gram-positive organisms and active against other gram-positive organisms. The epidermin ORFs were also absent in Mu50 and N315, indicating a possible trend among hospital-acquired isolates. The major histocompatibility complex class II-like molecules represented by SA0089 (antigen, 67 kDa), SA0985, and SA2197 (putative map protein) were conserved among all strains. Another map protein ORF, SA2002, was classified as absent among SANGER-476, Mu50, MW2, and N315. Strain-specific differences were also seen with the gene encoding immunodominant antigen B (isaB), which was absent in UAMS-1, UAMS-601, and EMRSA-16 (FIG. 2). In contrast, the gene encoding immunodominant antigen A (isaA) was conserved among all strains. These proteins were identified as immunodominant based on Western blots done with sera from humans with methicillin-resistant S. aureus sepsis (37). The absence of isaB in the UAMS strains and EMRSA-16 is therefore intriguing in that it could be correlated with immune evasion.
|
View this table: [in a new window] |
TABLE 7. Other virulence factorsa
|
|
|
|---|
Both to investigate the genetic basis for these differences and to facilitate our ability to genetically manipulate UAMS-1, we used CGH with a genome-scale, amplicon-based microarray to assess the genetic relationships between two clinical isolates (UAMS-1 and UAMS-601), the laboratory strain RN6390, and seven sequenced strains of S. aureus. A central concern in studies of this nature is the assignment of cutoffs to define "present" versus "absent" ORFs based on the hybridization ratio of a test strain to the reference strain used to generate the array. An emerging theme in CGH experiments that include sequenced strains is the use of BLAST comparisons to define empirical cutoffs for microarray data (51, 67). We utilized this approach to develop empirical cutoffs and found a high level of agreement (93.6 to 97.6% depending on the strain under study) between our empirical cutoffs and the BLAST-based comparison data.
It is not possible to eliminate all ambiguity in microarray-based CGH experiments. Some of this ambiguity arises from sequence divergence between the strains used to generate the array and the strains under study. Other limitations include errors generated from poor microarray printing, inefficient labeling of genomic DNA, improper data normalization, or poor hybridization of labeled probes. Indeed, we found 104 spots (4.04%) in which the majority of experiments failed the low signal intensity threshold in the COL channel; that is, the presence or absence of the ORF could not be unambiguously determined in multiple strains because the COL channel intensity failed to meet the mean-plus-one-standard-deviation requirement. An additional 125 (4.64%) spots on the array failed the low signal threshold in at least one experiment. For those strains with available genome sequence, these spots could be examined by direct sequence comparison. However, for strains RN6390, UAMS-1, and UAMS-601, it was necessary to use alternative means to address these discrepancies. We first attempted to account for microarray failures through the analysis of comparative data from Affymetrix GeneChip experiments. Although the Affymetrix GeneChip generally included a greater number of allelic variants, allowing for enhanced specificity relative to the PFGRC arrays, the results from corresponding ORFs on each set of arrays compared very favorably (93.3 to 98.5%). For strain UAMS-1, 2,444 spots from the PFGRC arrays were also represented on the Affymetrix GeneChip, and 2,281 (93.3%) of these had identical calls between PFGRC and Affymetrix. For strain UAMS-601, 2,473 PFGRC spots were represented on the Affymetrix arrays, and of these, 2,321 (93.85%) had identical calls. For strain RN6390, 2,422 of 2,460 detection calls (98.5%) were in agreement between the PFGRC and Affymetrix arrays. The increased accuracy observed with RN6390 by comparison to the UAMS strains may reflect the fact that both the PFGRC and Affymetrix arrays are based primarily on sequence data from the COL genome and the close relationship between COL and RN6390 (52). Indeed, COL and other isolates with the MLST sequence type ST250 are thought to be descendant from isolates with an MLST sequence type of ST8, which includes RN6390 and NCTC8325 (18).
Importantly, the experiments utilizing the PFGRC microarrays were performed independently of those utilizing the Affymetrix GeneChips, with unique methods for data analysis and cutoff determination. This approach allowed for not only a more comprehensive analysis but also a greater level of confidence in the appropriateness of our empirical cutoffs. We also tried to resolve any remaining ambiguities by using Southern hybridization and/or PCR. Indeed, we believe the results of our experiments confirm the need to combine the high-throughput advantage of CGH with confirmation using more gene-specific approaches, particularly in those cases involving divergent genes thought to be of particular clinical significance.
We found that 2,029 (78.4%) of the 2,589 spots on the PFGRC array that yielded results above the low signal threshold were present in all strains and thus represent the core genetic content of S. aureus. This number is essentially identical to the results from the work of Fitzgerald et al. (20), who found that 78% of the S. aureus genome was invariant. Given our focus on clinical isolates in general and musculoskeletal isolates in particular, we chose to analyze our results in the specific context of the genomic content of UAMS-1. Our data clearly demonstrate that UAMS-1 is most similar to UAMS-601 and EMRSA-16. Importantly, preliminary testing has confirmed that both UAMS-601 and EMRSA-16 are also hypervirulent in our models (data not shown), and the enhanced virulence of UAMS-601 has also been independently confirmed with a rabbit endocarditis model (Arnold Bayer, personal communication). It is therefore noteworthy that clustering based solely on genotypic information could be correlated to relative virulence, at least with respect to the two clusters that defined the extremes in our analysis (e.g., those that included UAMS-1 and RN6390, respectively). Indeed, hierarchical clustering based on microarray data fit very well with the clusters defined by other epidemiological typing methods and with studies suggesting the existence of specific clonal variants of particular clinical significance. For example, EMRSA-16 and the closely related strain EMRSA-15 are the dominant ORSA clones in the United Kingdom (43). These clones account for more than 95% of all ORSA bacteremias in the United Kingdom (30). EMRSA-16 has a core gene profile virtually identical to previously characterized OSSA clones, leading to the hypothesis that EMRSA-16 emerged from an OSSA strain after acquiring an SCCmecA element (43). Interestingly, UAMS-1 and UAMS-601 appear to share a similar relationship. Specifically, UAMS-601 is an ORSA isolate with the same MLST sequence type (ST36) as EMRSA-16, while UAMS-1 is an oxacillin-susceptible strain with an MLST sequence type of 30 (ST30). It has been postulated that ST30 is the ancestral sequence type of ST36 (17). Enright et al. also analyzed 155 isolates in the United Kingdom and found that most ORSA isolates were ST36 and that 87% of the OSSA isolates were closely related to EMRSA-16 and had an MLST genotype of ST30 (17). It is therefore important to focus not only on the dominant ORSA clones but also on the most prevalent OSSA isolates that give rise to these ORSA strains. We believe the data we report clearly indicate that UAMS-1 is a prototype example of such an OSSA isolate. It is also of interest to note that RN6390, COL, and 8325 all clustered closely together and were the most distantly related stains with respect to the UAMS strains and EMRSA-16. This is an important consideration given the experimental emphasis placed on RN6390. Collectively, we believe the results reported here clearly demonstrate that this emphasis is misplaced and that it should be refocused on more relevant strains like the UAMS isolates and EMRSA-16.
Our experiments were strictly genotypic and do not imply phenotypic differences, and they cannot account for point mutations or other small changes that may produce large differences in the expression level and/or function of individual loci. Nevertheless, we believe that our data have important implications with respect to the phenotype of the predominant clinical isolates of S. aureus. We base this belief on the presumed impact such genotypic differences would have on phenotypes previously reported from studies done with other strains of S. aureus. For instance, both sarT and sarU are absent in UAMS-1, UAMS-601, and EMRSA-16. SarT and SarU are reportedly members of a secondary amplification loop that modulates agr expression. Specifically, Manna and Cheung (39) reported that production of RNAIII is elevated in an RN6390 sarT mutant. This observation alone would suggest that the UAMS strains would be expected to produce elevated levels of RNAIII based on the absence of sarT. However, previous results from our laboratory have confirmed that UAMS-1 produces reduced amounts of RNAIII by comparison to RN6390 production (6). At the same time, Schmidt et al. (61) proposed a model in which sarT represses expression of sarU. Since sarU induces production of RNAIII, the absence of sarU in the UAMS strains would be consistent with their reduced expression of agr. Moreover, in this model, mutation of sarT would result in increased expression of agr, but only in the presence of sarU (i.e., in the absence of SarT-mediated repression, SarU would be free to up-regulate expression of RNAIII). In the absence of both sarT and sarU, the result would presumably be defined by the absence of sarU and the inability to induce RNAIII production, at least through this alternative pathway.
SarT is also reported to repress production of alpha-toxin and to induce sarS transcription (9, 45, 61). SarS, in turn, is an activator of protein A production (12, 58). This is not consistent with the observation that the UAMS isolates lack sarT but transcribe sarS at high levels (data not shown) and make an abundance of protein A (6). However, to the extent that the UAMS isolates produce reduced amounts of RNAIII, the increased transcription of sarS in these strains is consistent with the fact that sarS transcription is repressed by agr (11). It is also well established that induction of agr expression results in reduced expression of spa (1, 9, 45). Once again, this would suggest that the phenotype of UAMS-1 and other clinical isolates is defined by the absence of sarT and sarU rather than the impact of sarT alone. At the same time, the absence of sarT alone could explain why mutation of sarA results in decreased hemolytic activity in RN6390 and increased hemolytic activity in the UAMS isolates. Specifically, in the model of Schmidt et al. (61), mutation of sarA represses hla transcription indirectly by virtue of increased expression of sarT. Perhaps in the absence of SarT, SarA is able to bind the hla promoter and repress transcription. This is consistent with reports indicating that purified SarA binds the hla promoter at least under in vitro conditions (13, 65). Assuming that the binding of SarA would result in repression of transcription as suggested by Arvidson and Tegmark (1), mutation of sarA in strains that lack sarT would result in increased transcription of hla and increased hemolytic activity as was observed in both UAMS-1 and UAMS-601.
While our studies were limited to 10 strains, other investigators have found that sarT is absent in many if not most clinical isolates (P. M. Dunman, personal communication). Indeed, Fitzgerald et al. (20) found that a significant number of clinical isolates of S. aureus lack a 5-kb chromosomal region that they referred to as RD5. This region was originally reported to encode an FnbA homologue, two SarA homologues, and a protein with homology to a multidrug resistance protein made by Pyrococcus abyssi. Based on BLAST analysis, the two sarA homologs correspond to sarT and sarU. This emphasizes not only that a significant number of clinical isolates lack sarT but also that these same isolates almost certainly lack the sarU gene as well. This would suggest that such isolates would exhibit phenotypic traits like those observed with the UAMS isolates. This would include both quantitative (e.g., high expression of sarS and spa and low expression of agr and hla) and qualitative (e.g., the impact of mutating sarA on transcription of hla) differences as defined by comparison to RN6390. This is consistent with the results reported by Papakyriacou et al. (48), who found that the most prevalent clinical isolates of S. aureus have a phenotype characterized by a high binding capacity for host proteins and limited production of extracellular virulence factors (e.g., toxins and degradative enzymes). While this could be interpreted to suggest that the role of agr in S. aureus pathogenesis has been overinterpreted based on studies done in RN6390, it must be emphasized that mutation of agr in UAMS-1 has been shown to result in a reduced capacity to cause both osteomyelitis and septic arthritis (6, 22). Nevertheless, it is possible that RN6390 expresses agr at levels that have a negative impact on certain attributes that ultimately contribute to the ability of S. aureus to cause at least some forms of infection. Indeed, Vuong et al. (68) found that strains that produce high levels of RNAIII have a reduced capacity to form a biofilm, and recent studies from our laboratory confirmed that mutation of agr in RN6390 results in an enhanced capacity to form a biofilm (5). Importantly, this was not true for UAMS-1 or UAMS-601. To the extent that biofilm formation is an important attribute in the pathogenesis of S. aureus infection, this is consistent with the hypothesis that agr is functionally overexpressed in RN6390.
Our results suggest that the absence of RD5 may also have a more direct impact on biofilm formation. Specifically, the 10 strains we examined could be divided into two distinct groups, one of which (UAMS-1, UAMS-601, EMRSA-16, MW2, and SANGER-476) formed a more robust biofilm than the other (RN6390, COL, 8325, Mu50, and N315). All of the biofilm-forming strains were serotype 8 strains that encode cna, and three of five (UAMS-1, UAMS-601, and EMRSA-16) also lacked sasG (SA2505) while the other two (MW2 and SANGER-476) encode a divergent form of sasG. The fact that the three strains that lacked any form of sasG also lacked sarT and sarU is consistent with the results reported by Roche et al. (53), who demonstrated that sasG is also included within RD5. With respect to the association between cna and serotype, our results are consistent with the results of Ryding et al. (56), who found that cna was present almost exclusively in serotype 8 strains. However, it is unlikely that either of these is directly associated with biofilm formation based on previous studies from our laboratory demonstrating that at least some cna-negative, serotype 5 strains form a biofilm comparable to that of the UAMS isolates (5). In contrast, sasG encodes an LPXTG-containing protein that is a homolog of Pls, and Savolainen et al. (60) demonstrated that expression of pls is associated with a reduced capacity to bind fibrinogen and fibronectin. Because biofilm formation by UAMS-1 and UAMS-601 in vitro is dependent on coating the substrate with plasma proteins (5), this suggests that the absence of sasG may be directly responsible for the increased capacity of these strains to form a biofilm. If so, then the divergent form of sasG present in MW2 and SANGER-476 would presumably have reduced function by comparison to strains that encode other forms of sasG. SasG has also been associated with an enhanced capacity to colonize nasal epithelial cells (54). This suggests that strains that either lack sasG or encode a functionally divergent form may have a reduced capacity to colonize nasal mucosal surfaces. Alternatively, these strains may utilize an alternative means of colonizing the nasal mucosa. For instance, Weidenmaier et al. (69) has suggested that cell wall teichoic acids are both necessary and sufficient for nasal colonization.
Overall, our results support the hypothesis that UAMS-1, UAMS-601, and EMRSA-16 are closely related strains that are genetically distinct from the more commonly studied laboratory strains, including RN6390. Indeed, Holden et al. (28) noted that EMRSA-16 is the most genetically diverse sequenced strain of S. aureus to date, since approximately 6% of the genome is novel compared with other published genomes. These results, together with the observation that RN6390 carries at least one relevant mutation (e.g., rsbU), clearly emphasize the need to reassess the relevance of studies focusing on 8325-4 strains like RN6390 with use of clinical isolates. Moreover, mounting evidence from other laboratories indicates that certain of the genotypic properties shared by EMRSA-16 and the UAMS strains are characteristic of the most predominant clinical isolates worldwide. These results strongly suggest that these strains are appropriate choices for these experiments. At present, it is difficult to determine which of the specific characteristics shared by these strains make a direct contribution to virulence. This will require mutagenesis and gain-of-function studies, both of which are somewhat complicated by the recalcitrance of clinical isolates to genetic manipulation. Nevertheless, the work presented here establishes a background for future experiments aimed at identifying the relevant virulence factors and the regulatory networks that control them, and having access to a complete genome sequence for a representative strain from this group will greatly facilitate these experiments.
We thank Robin Cline, Chris Mader, Joseph Gilbert, and Scott Peterson for technical support regarding the PFGRC arrays. The availability of strains from the NIAID-sponsored Network on Antimicrobial Resistance in Staphylococcus aureus is also gratefully acknowledged. We also thank the UAMS Microarray Core Facility for technical support, Alan Gies of the UAMS Core Sequencing Facility for DNA sequencing, Mark Enright for providing MLST profiles of all sequenced strains, and Barry Kreiswirth for providing the spa types of all sequenced strains and for generating the spa profile from sequence data for UAMS-1, UAMS-601, and RN6390.
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
B in Staphylococcus aureus reveals its function as a global regulator of virulence genes. J. Bacteriol. 180:4814-4820.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»