Previous Article | Next Article ![]()
Journal of Bacteriology, February 2007, p. 1473-1477, Vol. 189, No. 4
0021-9193/07/$08.00+0 doi:10.1128/JB.01227-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom,1 Institute for Cell and Molecular Biosciences, The Medical School, University of Newcastle upon Tyne, Framlington Place, Newcastle upon Tyne, NE2 4HH, United Kingdom2
Received 4 August 2006/ Accepted 19 September 2006
|
|
|---|
|
|
|---|
![]() View larger version (21K): [in a new window] |
FIG. 1. Phylogenetic diversity of the sequenced S. pyogenes strains: unrooted neighbor-joining tree constructed using concatenated sequences of the seven loci used in MLST for a representative selection of STs from the S. pyogenes MLST database (www.mlst.net). The tree was constructed using ClustalX (23) and Phylip (9) with the Kimura two-parameter method, and the plot was generated using NJplot (17). The positions of all 12 strains for which complete genome sequences are available are shown.
|
1.3 Mb) (Fig. 2), which probably resulted from reciprocal recombination between rrn-comX regions that are similar distances from the terminus of replication (14) (Fig. 2). Notably, this large inversion is visible in the comparison of Manfredo with MGAS10394 and MGAS8232 (Fig. 2), which are more closely related to Manfredo than SSI-1 is (Fig. 1). This suggests that the inversion occurred independently in Manfredo and SSI-1. There is an additional smaller (
200-kb) rearrangement near the terminus of strain SSI-1 (compared to MGAS315) (Fig. 2), due to reciprocal recombination between prophages across the replication axis (14). This smaller inversion was not found in the Manfredo chromosome, which lacks prophages inserted at equivalent sites (Fig. 2). It seems clear that intrachromosomal recombination is an important mechanism contributing to the evolution of both S. pyogenes genomes and prophages and has the potential to generate novel recombinant prophages with alternative cargos (14).
![]() View larger version (74K): [in a new window] |
FIG. 2. Comparison of the genome structures of S. pyogenes: pairwise comparisons of the S. pyogenes MGAS315, SSI-1, Manfredo, MGAS10394, and MGAS8232 chromosomes displayed using the Artemis Comparison Tool (ACT) (6). The sequences were aligned using the predicted replication origins (oriC) (left), with the terminus of replication in the center. The colored bars separating the genomes (red and blue) represent matches identified by BLASTN (1). Red lines link matches in the same orientation, and blue lines link matches in the reverse orientation. Regions of the chromosomes containing prophages are indicated by pink boxes.
|
Man.1,
Man.2,
Man.3,
Man.4, and
Man.5) (Fig. 2) that exhibit mosaic relationships with other S. pyogenes prophages. To illustrate the relationships of the prophages, for each of the sequenced S. pyogenes strains all of the resident prophages were concatenated (joined end to end) and compared using DOTTER (21), which displays nucleotide similarity as a dot matrix plot (Fig. 3). In Fig. 3, individual colored bars on the x and y axes represent the concatenated prophage sequences for each of the strains indicated, and vertical and horizontal lines indicate the junctions between individual prophages on the x and y axes, respectively. Diagonal lines indicate sequence similarity; self-matching of the concatenated sequences generated the continuous central diagonal line. Lines on either side of the continuous central diagonal line indicate regions where there is extended sequence identity in the forward (parallel) and reverse (perpendicular) orientations for intersecting sequences. Of particular note is the fact that the distribution of prophages and the overall genetic relatedness between the strains are clearly not congruent. Divergent prophages are inserted at exactly the same sites in closely related strains, while identical sites in divergent strains can be occupied by highly conserved prophages. For example,
Man.5 and
10394.8 are clearly distinct prophages (Fig. 3) occupying the same sites in the closely related serotype M5 strain Manfredo and serotype M6 strain MGAS10394 (Fig. 1), while
Man.4 and
10750.1 are similar prophages (Fig. 3) at the same chromosomal location in distantly related serotype M5 strain Manfredo and serotype M4 strain MGAS1075 (Fig. 3).
![]() View larger version (79K): [in a new window] |
FIG. 3. Comparative nucleotide sequence analysis of S. pyogenes prophages: dot matrix showing the relatedness of the nucleotide sequences of prophages generated with DOTTER (21). The prophages used in the comparison (in order) were joined end to end and were obtained from Manfredo ( Man.1, Man.2, Man.3, Man.4, and Man.5), SSI-1 (SPsP1, SPsP2, SPsP3, SPsP4, and SPsP5) (14), SF370 (370.1, 370.2, 370.3, and 370.4) (10), MGAS315 ( 315.1, 315.2, 315.3, 315.4, 315.5, and 315.6) (5), MGAS8232 ( speA, speC, speL/M, 370.3-like, and sda) (20), MGAS10394 ( 10394.1, 10394.2, 10394.3, 10394.4, 10394.5, 10394.6, 10394.7, and 10394.8) (3), MGAS6180 ( 6180.1, 6180.2, 6180.3, and 6180.4) (11), MGAS5005 ( 5005.1, 5005.2, and 5005.3) (22), MGAS2096 ( 2096.1 and 2096.2) (4), MGAS9429 ( 9429.1, 9429.2, and 9429.3) (4), MGAS10270 ( 10270.1, 10270.2, 10270.3, 10270.4, and 10270.5) (4), and MGAS10750 ( 10270.1, 10270.2, 10270.3, and 10270.4) (4). The colored bars indicate the extents of the concatenated prophages for the strains, and the vertical and horizontal lines indicate the extents of the individual prophages. The green arrows indicate prophages referred to in the text.
|
Man.1, streptococcal phage DNase SpyM50534; for
Man.2, streptococcal phage DNase SpyM50691; for
Man.3, exotoxin H SpyM51021 and exotoxin I (pseudogene) SpyM501024; and for
Man.4, streptococcal phage DNase SpyM501263 and exotoxin C SpyM501264. The fifth prophage,
Man.5, does not contain any CDSs with similarity to CDSs encoding known virulence factors and appears to be a satellite phage. The potential for prophage recombination events to generate novel combinations of virulence genes is highlighted by the fact that mosaic structures are evident in the regions carrying the virulence determinants. For example, prophages
Man.2 and
8232.4, which are inserted at the same attachment site in their respective genomes, exhibit extended similarity to each other but also include clearly divergent sequences (Fig. 3). The sequences at left ends of these prophages are divergent and encode streptococcal phage DNases with less than 30% amino acid identity. Notably, the streptococcal phage DNase of
8232.4 is almost identical (99.624% amino acid identity) to the streptococcal phage DNase encoded by
Man.1, a prophage that displays far less sequence conservation with
8232.4 than
Man.2 displays (Fig. 3).
Excluding genes associated with prophages,
68% of the CDSs in Manfredo have orthologs in all of the other sequenced strains. Therefore, taking into account the contribution that prophages make, between 14 and 18% of the genome is composed of CDSs that are not conserved in one or more of the sequenced strains. Some of the CDSs in this variable component may contribute to the clinical differences between the strains. For the 12 sequenced strains, there is very strong evidence linking both serotype M5 strain Manfredo and serotype M18 strain MGAS8232 with ARF (7, 20), and the genetic relationship between these two strains is interesting (Fig. 1). However, reciprocal FASTA (16) analysis identified only a single CDS that was conserved in Manfredo and MGAS8232 but absent in the other S. pyogenes strains. This CDS encodes a surface-anchored protein (SpyM50104) that is a shortened allelic variant of a fibronectin-binding protein commonly found in the variable FCT (9) region of S. pyogenes genomes. The fibronectin-binding protein variants of Manfredo and MGAS8232 appear to be truncated compared to the other variants, but they retain functionally important N-terminal signal and C-terminal sortase processing motifs (VPXTG) that predict that they are likely to be expressed on the cell surface. However, while the conservation of this locus in strains that have been very clearly associated with ARF may be worthy of further investigation, it must be emphasized that there is no evidence at this stage that the locus influences the pathology of these strains. It must also be emphasized that it is by no means clear that these are the only two representatives of the 12 sequenced strains that have the capacity to cause ARF, since establishing epidemiological relationships between ARF and individual S. pyogenes strains is notoriously difficult (13). Alternatively, it is possible that subtle differences in the conserved components of the genome may be more important in distinguishing ARF isolates. Investigating the distribution and functional effects of single-nucleotide polymorphisms (SNPs) in ARF strains and related non-ARF strains may hold the key to unraveling the causes of this complex disease. To this end, 14,962 SNPs were identified in the comparison of ARF strains (Manfredo and MGAS8232), in contrast to the 15,460 SNPs identified in a comparison of similarly related strains (Manfredo and MGAS10394). To pinpoint the functionally significant SNPs among the genetic noise, wider population studies are required.
Nucleotide sequence accession number. The sequence and annotation of the Manfredo genome has been deposited in the EMBL database under accession number AM295007.
This work was supported by the Wellcome Trust through its Beowulf Genomics initiative.
Published ahead of print on 29 September 2006. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»