Previous Article | Next Article ![]()
Journal of Bacteriology, November 2004, p. 7601-7609, Vol. 186, No. 22
0021-9193/04/$08.00+0 DOI: 10.1128/JB.186.22.7601-7609.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
School of Biological Sciences, University of Wollongong, Wollongong,1 Queensland Institute of Medical Research, Brisbane,2 Menzies School of Health Research,3 and Institute of Advanced Studies, Charles Darwin University, Darwin, Australia4
Received 27 May 2004/ Accepted 11 August 2004
|
|
|---|
|
|
|---|
GAS is a human-specific pathogen and can cause a wide range of diseases, from benign mucosal and skin infections to life-threatening diseases and sequelae, such as acute poststreptococcal glomerulonephritis and rheumatic heart disease (11). Diversity in the repertoire of the genes encoding FBPs may have implications for GAS tissue tropism, persistence within the human host, and the spectrum of diseases that the strains can cause. For instance, Neeman et al. (29) have shown that there is an association between sfbI-positive GAS strains and persistence after antibiotic treatment. Likewise, an association between prtF2 and GAS invasive diseases has been observed (12, 37). SfbI, Sof, and PrtF2 are distinct proteins, and while sfbI and prtF2 are located in the same chromosomal location called the fibronectin-collagen-T antigen (FCT) locus (5), the sof gene is situated outside this locus.
PrtF2 was originally described by Jaffe et al. (18). Subsequently, Rocha and Fischetti (32) described another FBP designated PFBP. PrtF2 and PFBP have very high sequence identity and possess similar domains. More recently, Terao et al. (37) identified FbaB, an FBP from the M3 and M18 GAS serotypes. This protein and PFBP also have the same leader sequence and exhibit high sequence similarity in the C-proximal region of the protein, which contains the fibronectin binding domains. These observations raise the important question of the evolutionary relationship between the FBP genes.
In order to address this question and increase our understanding of the evolution of PrtF2, we selected 51 prtF2-positive and 11 prtF2-negative genotypically distinct GAS strains. Here we report characterization of the two distinct genotypes of PrtF2 by PCR and DNA sequence analysis and examination of the strains by pulsed-field gel electrophoresis (PFGE) to determine the evolutionary relationship of the prtf2 genotypes. The epidemiological and evolutionary implications of the data are discussed below.
|
|
|---|
Screening for genes encoding fibronectin binding proteins. All GAS strains were screened for genes encoding FBPs, including prtF2, pfbp, fbaB, sfbI, sof, sfbX, fbp54, and fbaA. The prtF2, sfbI, sof, and fbp54 status of these strains has been described previously (12). However, prtF2 PCR performed with primers situated within the fibronectin binding repeat domains described by Delvecchio et al. (12) does not differentiate between the two genotypes of prtF2 (pfbp and fbaB). Therefore, two sets of PCR primers were designed and utilized in this study. The first amplification with primers VPrtf2-F and VPrtF2-R designed for the signal sequence and cell wall anchor region, respectively, distinguished between the two genotypes of prtF2 and confirmed the mutual exclusiveness of the two prtF2 genotypes, and the second PCR amplification with primers PFBP-F and ManR4 designed for the flanking region of the prtF2 open reading frame also distinguished between the two genotypes and confirmed the location of the genotypes in the chromosome. Primers SfbXF1 and SfbXR1 were used to screen for sfbX in all strains (19), and primers FbaA-F and FbaA-R were used to screen for fbaA in all strains (Table 1). PCRs were carried out in 50-µl (total volume) mixtures containing 2 µl of DNA template extracted with a QIAGEN DNeasy tissue kit, 10 mM Tris-HCl (pH 8.3), 10 mM KCl, 2 mM MgCl2, 50 pmol of each primer, each deoxynucleoside triphosphate at a concentration of 200 µM, and 1 U of Taq DNA polymerase.
|
View this table: [in a new window] |
TABLE 1. Primers used for gene detection
|
PFGE and phylogenetic analysis. PFGE was carried out by using the following modification of the method described by Chatellier et al. (7). Briefly, single colonies of each GAS strain were used to inoculate 2 ml of Todd-Hewitt broth (Difco) supplemented with 1% yeast extract and grown overnight at 37°C. Cells were harvested by centrifugation, washed twice with TSE buffer (10 mM Tris-Cl, 1.0 M NaCl, 50 mM EDTA [pH 8.0]), and resuspended in 200 µl of Tris-EDTA buffer (pH 7.5). An equal volume of prewarmed 1.5% low-melting-point preparative-grade agarose (Bio-Rad, Richmond, Calif.) was mixed with each cell suspension, transferred into gel block molds, and allowed to solidify. The blocks were treated for 4 h at 37°C with 400 µl of lysis buffer (6 mM Tris-Cl [pH 7.6], 100 mM EDTA [pH 7.5], 1 M NaCl, 0.5% Brij 58, 0.2% deoxycholate, 0.5% sodium lauroyl sarcosine) containing freshly added lysozyme (1 mg/ml), mutanolysin (100 U/ml), and RNase (20 µg/ml). The lysis buffer was replaced with 300 µl of deproteination solution (1 µg of proteinase K per ml, 1% sodium lauroyl sarcosine, 500 mM EDTA [pH 8.5]), and the blocks were incubated overnight at 50°C. The blocks were prepared for restriction enzyme digestion by washing them three times in Tris-EDTA buffer (pH 7.5) for 30 min each time; the first wash solution contained 0.5 mM phenylmethylsulfonyl fluoride. The blocks were stored in 1 M EDTA (pH 8.5) at 4°C until they were used.
Prior to digestion with restriction enzyme, 2- to 3-mm slices were aseptically cut from the blocks, rinsed for 10 min with sterile distilled water, and equilibrated for 30 min in SmaI restriction buffer. The slices were then incubated overnight at 25°C in fresh restriction buffer containing 20 U of SmaI restriction enzyme (Roche). The digested DNA was resolved by PFGE by using a CHEF-DRTM electrophoresis cell (Bio-Rad, Sydney, Australia) and the following parameters: 0.5x Tris-borate-EDTA running buffer, 6 V/cm, and linearly ramped switch times of 2 to 40 s at 10°C for 23 h.
DNA concatemers (New England Biolabs, Beverly, Mass.) were included as molecular size standards. The gel was then stained with ethidium bromide (1 µg/ml) for 30 min and visualized with UV illumination by using a GelDoc 1000 image analysis station (Bio-Rad, Sydney, Australia).
PFGE restriction fragment patterns were visually assessed by using the criteria of Tenover et al. (35) and were analyzed by using the GelCompar software (version 4.2; Applied Maths, Kortrijk, Belgium). Genetic similarities were compared by clustering methods (unweighted pair group method with arithmetic means) by using the Dice coefficient. A tolerance in the band positions of 0.2% was used for comparisons of fingerprint profiles. MacClade v. 3.08 (27) was used to assign genotypes to ancestral nodes of the PFGE dendrogram.
Nucleotide sequence accession numbers. The nucleotide sequences of the 18 genes sequenced in this study have been deposited in the GenBank database under the following accession numbers: NS101, AY612216; NS1140, AY612217; NS125, AY612218; NS178, AY612219; NS179, AY612220; NS192, AY612221; NS195, AY612222; NS210, AY612223; NS235, AY612224; NS240, AY612225; NS265, AY612226; NS436, AY612227; NS506, AY612228; NS53, AY612229; NS564, AY612230; NS581, AY612231; NS730, AY612232; NS803, AY612233.
|
|
|---|
|
View this table: [in a new window] |
TABLE 2. Characteristics of GAS strains used in this study
|
![]() View larger version (41K): [in a new window] |
FIG. 1. Arrangement of the FCT region. The following strains were examined (accession numbers are indicated in parentheses): M1, strain SF370 (AE006482/3); M6, strain D471 (U01312, L10919, and AY049087); M12, strains A735 and A374 (AF447492, AY049088, and AF071083); M49, strains CS101/B737 (U49397 and AY049089) and 100076 (U31980); M3, strain MGAS315 (AE014138); M5, strain Manfredo (http://www.sanger.ac.uk/Projects/S_pyogenes); and M18, strain MGAS8382 (AE009963/4). The open reading frame designations are indicated above the M1 genome. The region is demarcated by two highly conserved open reading frames, Spy0123 and Spy0136 (striped arrows). With the exception of sfbI, which is used instead of prtF1, all other designations are those reported by Podbielski et al. (30) and Bessen and Kalia (5). The dotted lines represent gaps introduced to aid alignment. Open reading frames with no significant homology are not shaded. The chevron within nra of M18 indicates a stop codon caused by a single point mutation. Similarly, the single open reading frame in M49 spanning eftLSLA and eftLSLB is caused by a single point mutation that results in replacement of a stop codon (TAA) with a Glu (Q) codon (CAA).
|
![]() View larger version (14K): [in a new window] |
FIG. 2. Comparison of pfbp-type and fbaB-type variants. The percentages indicate the levels of identity observed in pairwise comparisons of DNA sequences both within and between variants. The two mature proteins differ significantly in the central domain but are highly similar in all other domains. The central domain is highly conserved in the fbaB type, but there is considerable variation in the pfbp-type central domain. The pfbp-type variants always possess three nonidentical fibronectin binding repeats in the FBRD, while the fbaB-type variants have variable numbers of repeats due to duplication or deletion of repeat 2 (R2). The overall homology of the FBRD of the fbaB type could not be determined due to the resultant gaps in the alignment; however, the individual repeats are highly homologous both within and between variants.
|
![]() View larger version (24K): [in a new window] |
FIG. 3. Parts of the pfbp-type coding regions of the amino terminus have been horizontally transferred between different pfbp-type genes, producing mosaic alleles. Throughout the sequence highly conserved junction regions were identified (shaded) (>93.1% identity in pairwise comparisons) flanking distinct sections or cassettes, which appear to have been exchanged between alleles, resulting in a mosaic structure. Regions with identity to the pfbp-type gene of M12 strain A735 (prtF2.12) are underlined to illustrate the mobility of individual DNA cassettes.
|
The pfbp lineage is more recent than the fbaB lineage. PFGE analysis of 32 strains, including 7 pfbp-positive, 14 fbaB-positive, and 11 prtF2-negative GAS strains, revealed that they produced 32 PFGE patterns, which is consistent with the conclusion that the strains are genetically distinct. Genetic similarity was determined by band-based clustering in the PFGE profiles by using 0.2% tolerance (Fig. 4). The strains were very diverse (<45% similarity) when diversity was measured by using the criteria of Tenover et al. (35). The isolates subjected to PFGE belonged to diverse emm sequence types, with the exception of strains NS1122 and NS931, which were emm sequence type 65. However, despite having the same emm sequence type and a prtF2-negative genotype, NS1122 and NS931 were found to contain distinct Vir types and to produce genetically unrelated PFGE fingerprint profiles, indicating that these strains are genetically distinct.
![]() View larger version (102K): [in a new window] |
FIG. 4. Dendrogram generated by the GelCompar software, showing the genetic relationships among 21 S. pyogenes isolates possessing prtF2 genotypes and 11 isolates not possessing the gene. The dendrogram was constructed by cluster analysis (unweighted pair group method with arithmetic means) of the PFGE patterns obtained after macrorestriction with the SmaI enzyme. A tolerance in the band positions of 0.2% was used for comparison of fingerprint profiles. PFGE fingerprint patterns are shown next to the corresponding branches of the dendrogram. The prtF2 genotypes and emm sequence types are indicated after the strain number. P, pfbp type; F, fbaB type; N, prtF2 not present. The scale indicates percent similarity.
|
![]() View larger version (38K): [in a new window] |
FIG. 5. Evolution of prtF2 genotypes, inferred by parsimony by using MacClade v.3.08 (27), based on the PFGE dendrogram (Fig. 4). The trees indicate the evolution patterns obtained when the gains and losses of a genotype were equally weighted (A) and when gains were weighted higher than losses of a genotype (B).
|
We believe that the analysis shown in Fig. 5A (which is most commonly used) (27) represents the most reasonable working hypothesis for the evolution of different prtF2 genotypes. Nevertheless, as Fig. 5B reveals, a small change in the weight given to gene gains and losses has an impact on the analysis, so that Fig. 5A should be considered only a starting point. Furthermore, these analyses assume that the topology shown in the 0.2% tolerance PFGE dendrogram (Fig. 4) is correct. Although some branches may be misplaced by this analysis, the evolutionary scenario described above represents a reasonable working hypothesis to direct subsequent investigations. Overall, the analysis above suggests that the pfbp type had a more recent origin and arose only in an fbaB-type genetic background.
Distribution of fbaB-type, pfbp-type, and other GAS genes encoding fibronectin binding proteins. GAS may possess genes encoding several FBPs, including fbaB, pfbp, sfbI, sof, sfbX, fbaA, and fbp54. We investigated the distribution of sfbI, sof, sfbX, fbaA, and fbp54 in the 32 fbaB-type and 19 pfbp-type GAS strains examined in this study. The fbaA and fbp54 genes were found in the majority of GAS strains possessing both the pfbp and fbaB genotypes examined. Thirteen of 19 (68.4%) pfbp-type GAS strains possessed sfbI, sof, and sfbX. In contrast, only 2 of 32 (6.3%) the fbaB-type GAS strains possessed sfbI, sof, and sfbX, while in 21 of 32 (65.6%) fbaB-type GAS strains these genes were not detected (Table 2). GAS strains have been defined as belonging to either class I or class II M types based on epitopes present within the conserved C repeat region of M proteins (2, 3). Class II GAS strains are usually sof positive, while class I GAS strains are generally sof negative (2, 4, 15). Examination of the sof status of the 51 prtF2-positive GAS strains examined in this study revealed that 16 of 19 (84.2%) of the pfbp-positive strains possessed sof. By contrast, only 4 of 32 (13.3%) the fbaB-positive strains contained sof, suggesting that the pfbp type is linked to class II GAS and the fbaB type is linked to class I GAS.
|
|
|---|
The gene encoding the FBP PrtF2 is situated in the highly recombinatorial FCT region in GAS (5). PrtF2 facilitates the binding of host fibronectin, enabling GAS to adhere to and be internalized by host epithelial cells (18, 25). Epidemiological evidence suggests that the presence of this gene may confer upon the pathogen a greater propensity to cause invasive diseases (12). In the present study we examined 51 prtF2-positive GAS strains isolated from patients in northern Australia, and here we describe two genotypes of prtF2, which are mutually exclusive. Both genotypes map to the same chromosomal location within the FCT locus, between genes encoding the potential gene regulator (msmR) and a hypothetical protein (Spy0136), and the two types are designated the pfbp type and fbaB type.
Other researchers have shown that expression of PrtF2 is influenced by the global negative regulator nra, which is located within the FCT region (30). Recent evidence obtained by Kreikemeyer et al. (25) and the fact nra is not always found in prtF2-positive strains (5) suggest that prtF2 expression is controlled by additional regulatory elements. Immediately upstream of prtF2 is a putative transcriptional regulator gene that has been termed msmR. MsmR belongs to the AraC family of regulators (5). Furthermore, the GAS genes controlled by this putative regulator have not been identified yet. In all strains studied to date, msmR is present upstream of prtF2. This may suggest that msmR affects prtF2 expression directly or regulates other genes that are required for PrtF2 function.
The major difference between the pfbp gene sequence (32) and the original prtF2 gene sequence (18) is the presence of two single-base-pair sequence changes in prtF2. These changes result in a frameshift mutation that results in the loss of the additional 105 amino acids at the N terminus of PFBP, which was not identified in PrtF2. However, the upstream prtF2 gene sequence (GenBank accession number U31980) contains genetic information coding for part of the 105-amino-acid sequence. None of the 18 prtF2 genes sequenced in this study contains a frameshift mutation similar to that reported by Jaffe et al. (18), suggesting that this mutation is infrequent. Clearly, however, the reported sequences of prtF2 (18), pfbp (32), and fbaB (37) are closely related. We therefore suggest that pfbp and fbaB represent two distinct genotypes of prtF2.
Multiple-alignment analysis of the pfbp-type DNA sequences revealed the presence of a mosaic structure within the pfbp-type central domain. This mosaic structure is not present in fbaB-type sequences, suggesting that only the pfbp-type gene sequences are capable of the intergenomic exchange required to produce such arrangements. Several surface-exposed GAS proteins have been shown to display a mosaic gene structure. These include the proteins encoded by emm (41), ska (22), and sfbI (39). It is believed that this type of genetic recombination plays a fundamental role in providing a mechanism for evading host immune responses. This is highlighted by the serotype-specific protection displayed by anti-M-protein antibodies (26). Such horizontal gene transfer may also produce mosaic gene structures that generate functional diversity of surface-exposed proteins. For instance, various M or M-like proteins have been found to bind fibrinogen, Fc domains of various human immunoglobulins, plasminogen, and/or proteins involved in the complement cascade (23).
In this study, we used software that utilized both the PFGE dendrogram and the prtF2 genotype status to infer the evolution of the different genotypes by minimizing the total number of changes over the tree. The results obtained with this methodology suggested that the smaller, more conserved fbaB genotype had a more ancient origin than the pfbp type. This pattern of inheritance was similar when gains or losses were equally weighted or when gains were weighted higher than losses. As the pfbp type appears only in an fbaB-type background, we hypothesize that the pfbp type arose from an insertional event within the fbaB type and that the genetic rearrangement seen in the central domain of this gene is the result of a subsequent process. This also suggests that acquisition of the pfbp-type gene may have some selectable advantage over acquisition of the fbaB-type gene. One possible advantage may be the result of extra functions gained by the protein through the additional pfbp-type central domain sequence. Additionally or alternatively, the mosaic structure of the pfbp-type central domain may allow immune evasion to occur.
Interestingly all pfbp-type genes sequenced contained three repeat regions in the FBRD, while fbaB-type gene sequences had two to four repeats. Jaffe et al. (18) localized the upstream fibronectin binding domain (UFBD) for PrtF2 between amino acids 679 and 783. Residues in the PrtF2 UFBD critical for fibronectin binding were mapped to positions 679 to 717. Homology of the FbaB UFBD region only begins at amino acid 740 of PrtF2. It is not known if the truncated UFBD of FbaB alters fibronectin binding. Perhaps three fibronectin binding repeats are a functional constraint for an intact UFBD in PFBP. Variations in the number of repeats in the FBRD of FbaB may be due loss of this constraint when the UFBD is truncated.
Thirteen of 19 (68.4%) pfbp-type GAS strains investigated in this study possessed sfbI, sof, and sfbX. In contrast, only 2 of 32 (6.3%) fbaB-type GAS strains possessed sfbI, sof, and sfbX. Interestingly, 21 of 32 (65.6%) fbaB-type GAS strains lacked these three genes. The biological implications of the finding that the pfbp type is generally associated with other FBP genes (sfbI, sof, and sfbX) and the biological implications of the finding that the fbaB type is generally associated with fewer FBP genes are not known. A previous study showed that there is linkage association between the sfbI and sof genes (14). In this study we found that there is a linkage association between pfbp and sof. Sixteen of 19 (84.2%) pfbp-type strains possessed sof. By contrast, only 4 of 32 (13.3%) fbaB-type strains contained sof. Class I GAS strains are generally sof negative and have been associated with acute rheumatic fever, unlike class II GAS strains, which are generally sof positive and are usually associated with skin-tropic infections (2, 4, 15). Therefore, our data suggest that the pfbp type may be linked to class II GAS strains and the fbaB type may be linked to class I strains.
GAS is a highly specific yet extremely versatile pathogen of humans. An increasing number of studies are revealing the extent of the genetic diversity displayed by this species and are implicating horizontal gene transfer and recombination as major mechanisms influencing the generation of this diversity. Additional work that leads to a better understanding of the evolution of GAS and, in particular, the emergence of highly virulent strains is warranted and may provide new strategies for predicting GAS disease trends and pathogenic mechanisms.
We thank Kent Wu for his assistance with the analysis of the pulsed-field gel electrophoresis data with the GelCompar software.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»