Previous Article | Next Article ![]()
Journal of Bacteriology, August 2009, p. 4854-4862, Vol. 191, No. 15
0021-9193/09/$08.00+0 doi:10.1128/JB.01272-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Division of Infection and Immunity, Glasgow Biomedical Research Centre, University of Glasgow, 120 University Place, Glasgow G12 8TA, United Kingdom,1 the Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom,2 Allegheny General Hospital, Allegheny-Singer Research Institute, Center for Genomic Sciences, Pittsburgh, Pennsylvania 15212,3 Department of Microbiology and Immunology, Drexel University College of Medicine, Allegheny Campus, Pittsburgh, Pennsylvania 15212,4 Centro de Investigaciones Biológicas and Ciber de Enfermedades Respiratorias, Ramiro de Maeztu 9, 28040 Madrid, Spain5
Received 10 September 2008/ Accepted 27 May 2009
|
|
|---|
Spn_6, and this represents the first time that an addition system in a pneumophage has been identified. Collectively, the temperate pneumophages contain a diverse set of genes with various levels of similarity among them. |
|
|---|
Bacteriophages of S. pneumoniae (pneumophages) were first identified in 1975 from samples isolated from throat swabs of healthy children by two independent groups (46, 65). Since then, pneumophages have been identified from different sources and a variety of locations (44). The abundance of temperate bacteriophages in S. pneumoniae has been reported in different studies in the past (6, 54). Up to 76% of clinical isolates have been showed to contain prophages (or prophage remnants) when studied with a DNA probe specific for the major autolysin gene, lytA, which hybridizes with many of the endolysin genes of temperate pneumococcal phages (54). Hybridization analyses have identified highly similar prophages among pneumococcal clinical isolates even of different capsular serotypes, a result which indicates the widespread distribution of these mobile genetic elements among virulent strains (26).
Only three S. pneumoniae bacteriophage genomes have been characterized in detail, and their sequences have been determined. Dp-1 and Cp-1 are lytic bacteriophages, whereas MM1 is a temperate pneumophage (45, 50, 52). Genes coding for virulence factors such as toxins or secreted enzymes have been associated with the presence of prophages in both gram-negative (67) and gram-positive bacteria, such as Streptococcus pyogenes (7) and Staphylococcus aureus (23). Because a considerable number of toxin genes are located in prophages, phage dynamics are of apparent importance for bacterial pathogenesis. Unfortunately, the role of temperate bacteriophages in the virulence of S. pneumoniae remains mostly unknown.
Recently, the availability of relatively inexpensive next-generation sequencing technologies has permitted the complete genomic analysis of dozens of genomes of pneumococcal clinical isolates. In this report, we present a comparative genomic analysis of 10 pneumophages identified in the genomes of newly sequenced S. pneumoniae strains. The proteome of these phages has been predicted and annotated by comparative sequence analyses by using the available databases at the National Center for Biotechnological Information website (http://www.ncbi.nlm.nih.gov/). This systematic characterization of pneumophage genomes provides for a substantial increase in our knowledge of the global proteome and the overall genetic diversity of this important human pathogen. The comparative analysis of multiple temperate bacteriophages from a single species offers a unique opportunity to study one of the mechanisms of lateral gene transfer that drive prokaryotic genetic diversity.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Summary of genome sizes, gene contents, gene diversities, and proteome contents for S. pneumoniae prophages
|
Phage preparations were negatively stained with the NanoVan (methylamine vanadate; Nanoprobes) staining solution on carbon-reinforced, Formvar-coated copper grids (300 mesh) as described previously (37) with minor modifications. Minor modifications consisted of impregnating the grid with 5 µl of the phage sample and leaving it for 1 min. Afterwards, 5 µl of the stain was placed on the grid for 1 min and washed with the same volume of water for 1 min. The grids were dried at room temperature for 1 h. Samples were observed using a Zeiss LEO 902 electron microscope working at 80 kV. Phage DNA was purified from crude extracts of strain CGSSp6BS73. Briefly, the pellet obtained from the lysate after mitomycin C induction was resuspended in 10 mM Tris buffer (pH 8.0) and treated with DNase I (1 mg/ml). Then, it was treated with 50 mM EDTA, 0.5% sodium dodecyl sulfate, and 100 µg/ml proteinase K for 2 h at 37°C. Finally, DNA was isolated following phenol-chloroform steps and resuspended in 100 µl of Tris-EDTA buffer (10 mM Tris, 1 mM EDTA [pH 8.0]).
Bioinformatic methods and cluster analysis.
Prophage genomes were obtained from the sequence of their host (Table 1) followed by confirmation by restriction digest (see Fig. S1 in the supplemental material). Gene prediction, annotation, and sequence clustering were performed using Glimmer 3.0 (14), Artemis (57), and TribeMCL (22), respectively. Genome comparisons were generated using BLAST algorithms and analyzed using the Artemis Comparison Tool (13). The criteria used to identify putative open reading frames (ORFs) were the presence of (i) the potential to code for a polypeptide of more than 33 amino acid (aa) residues and (ii) a putative ribosome binding site (2) and, at 3 to 9 nucleotides (nt) downstream of the central G of the ribosome binding site (31), an ATG, GTG, or TTG codon that could serve as a start codon. Phylogenetic trees were constructed with Phylip Neighbor (http://evolution.genetics.washington.edu/phylip.html) and visualized with TreeView X (http://darwin.zoology.gla.ac.uk/
rpage/treeviewx/). The dot matrix was calculated using Dotter with a sliding window of 25 bp (62). Identity values for nucleotide comparison were obtained using the Artemis Comparison Tool (http://www.sanger.ac.uk/Software/ACT/). Prediction of transmembrane helices in proteins and of signal peptides were carried out using the TMHMM (http://www.cbs.dtu.dk/services/TMHMM/) and SignalP (http://www.cbs.dtu.dk/services/SignalP/) servers, respectively.
In certain cases for the phage genomes, the sequences were incomplete due to gaps in the assembly. PCR gap closure was performed as described previously (29), by sequencing PCR amplicons targeted to fill gaps between neighboring contigs. Gaps were inferred by scaffolding to all sequenced S. pneumoniae genomes using Nucmer, and primers were designed for the ends of the contigs. A PCR was run with a 7-min extension period, and PCR products were sequenced using Sanger sequencing. The primers designed for gap amplification, as well as additional primers designed within the gap based on scaffolding information, were also used as sequencing primers. If primers designed within the inferred gaps were not successful, primer walk sequencing was employed.
|
|
|---|
The complete genomes of 10 double-stranded-DNA S. pneumoniae temperate bacteriophages (
Spn_OXC,
Spn_3,
Spn_6,
Spn_9,
Spn_11,
Spn_14,
Spn_18,
Spn_19,
Spn_23, and MM1-2008) were sequenced and/or analyzed using bioinformatic tools (see above) and compared to those of MM1 and MM1-1998, two closely related pneumophages that have been previously characterized (50). Combined sequence analysis of the phage genomes showed that they range from 31 to 42 kb in size and have an average GC content of 39.5% (Table 1), which is similar to the 39.7% GC content reported for the S. pneumoniae genome (64). Comparative analyses have revealed that the genomes of
Spn_OXC and
Spn_3 showed 98.6% identity (hereafter, they are referred to as
Spn_OXC/
Spn_3 bacteriophage). Similarly, since more than 99% identity has been found among the prophage from the 23F strain (MM1-2008) and the genomes of the temperate bacteriophages MM1 (50) and MM1-1998 (43), unless stated otherwise, these three phages are referred to collectively as MM1-like.
The majority of the ORFs carried by temperate pneumophages are transcribed from one strand, although the lysogeny module is usually transcribed from the opposite strand (see Table S1 to S9 in the supplemental material).
The 10 pneumophages studied here were grouped into three classes based on comparisons of the complete prophages (Fig. 1A) as well as their predicted encoded proteins (Fig. 1B): group 1, containing
Spn_OXC/
Spn_3,
Spn_11, and
Spn_14; group 2, including
Spn_6,
Spn_9,
Spn_19, and
Spn_23; and group 3, with MM1-2008 and
Spn_18. The sizes of the genomes vary slightly, with group 1 phages having genomes between 31 to 33 kb, group 2 phages having genomes from 39 to 42 kb, and group 3 phages having genomes from 37 to 40 kb (Table 1).
![]() View larger version (42K): [in a new window] |
FIG. 1. Groups of pneumococcal prophages identified by genome comparisons. For brevity, the prefix " Spn_" of phage names has been omitted. (A) Dot slot matrix calculated for the genome sequences of the S. pneumoniae prophages. The dot matrix was calculated using Dotter (62). (B) Unrooted tree based on the phage's predicted protein coding sequences. Sequences were clustered with TribeMCL using an E value cutoff of 10–30, and the tree was generated using similarity coefficients calculated between each phage pair on the basis of shared clusters.
|
|
View this table: [in a new window] |
TABLE 2. Summary of the main characteristics identified in the genetic modules in the three different groups of temperate bacteriophages
|
Phage integrases are responsible for the integration of the phage genome into the bacterial chromosome. The putative integrases that are encoded by the temperate bacteriophages analyzed in this study belong to the integrase family of tyrosine recombinases (Int family). The Int family of integrases, such as the
integrase, utilize a catalytic tyrosine to mediate strand cleavage and require other proteins encoded by the phage or the host bacteria (27). In addition to the catalytic tyrosine, five other residues are highly conserved in the tyrosine recombinase family, i.e., the RKHRH pentad (27). With the notable exception of the integrases encoded by MM1-like prophages, the putative integrases identified in the temperate pneumococcal bacteriophages clustered in the same three phage groups described above. Interestingly, the integrases encoded by MM1-like prophages were more similar to those of group 2 phages (Fig. 1A) than to that of
Spn_18 (see below). Predicted integrases encoded by
Spn_OXC/
Spn_3,
Spn_11, and
Spn_14 are highly related (group 1). They all have 382 aa, with the only exception being that encoded in the
Spn_14 genome, which has a size of 360 aa due to a frameshift. All alleged integrases of group 2 are identical in length, 375 aa, with alignment showing only five mismatches over the entire sequence. However, similarities between the group 2 and group 3 integrases have been identified only in the N-terminal part of the amino acid sequence. The first 51 aa residues are almost identical to those in this group of integrases, but the rest of the sequence is more divergent. In agreement with the characterization of integrases from the Int family (27), an amino acid comparison of group 2 integrases showed that the identity of the first 51 aa residues could be related to the capacity of binding to the DNA, whereas the recognition of the core region and the catalytic activity would be in the C-terminal part of the integrase protein.
Phages
Spn_OXC/
Spn_3,
Spn_11, and
Spn_14 are found integrated into their host strains between genes SP_0019 and SP_0020 in the TIGR4 genome (64). The att core sequence was identified by the alignment of the attL, attR of
Spn_OXC/
Spn_3, and the regions downstream and upstream of the cited genes in TIGR4. An overlapping region of 21 nt was identified as the att core region (attOXC) (5'-CTTTTTCATAATAATCTCCCT-3'). Two additional attOXC sequences are present in the TIGR4 genomes, i.e., between genes SP_0257 and SP_0258 and between genes SP_0260 and SP_0261. Phages
Spn_6,
Spn_9,
Spn_19,
Spn_23, and MM1/MM1-2008 are integrated at the same host genomic site as phage MM1 is inserted into strain Spain-23F1, that is, between genes SP_1563 and SP_1564 of the TIGR4 genome (26). The att core region in this case, or attMM1, is a 15-nt sequence (5'-TTATAATTCATCCGC-3'), and it is present only once in the TIGR4 genome (26). Phage
Spn_18 is inserted between genes SP_0020 and SP_0021. The att core region has not been identified, but genome comparisons (not shown) indicated that a remnant phage is located at the same position in a serotype 23F strain sequenced at the Sanger Institute.
The sequence analysis of the lysogeny modules of the pneumophages has revealed that the theory of modular evolution of bacteriophages cannot explain the diversity in gene content observed in this region (9). This theory suggests that the product of evolution is not a given virus but a family of interchangeable genetic elements (modules), each of which is multigenic and can be considered as a functional unit; exchange of a given module for another functionally equivalent module occurs by recombination among viruses belonging to the same interbreeding population. In contrast, as was previously suggested for phages from S. thermophilus (49) and other dairy bacteria (16), the unit for evolutionary exchange in S. pneumoniae bacteriophages is not a group of functional genes but could be as small as a single gene. A clear example is found in the lysogeny modules of S. pneumoniae group 1 phages, which are highly related but among which the putative exonuclease encoded by orf2 in
Spn_OXC/
Spn_3 and
Spn_14 has been replaced in the genome of
Spn_11 by a gene with a different origin and that encodes a hypothetical protein. Similarly, sequence analysis of lysogeny modules of group 2 and 3 pneumophages have revealed variation in gene content in the ORF immediately downstream of the integrase gene and genes located between the transcriptional regulator and the antirepressor (see Table S1 to S9 in the supplemental material). The sequence comparison of lysogenic modules showed that horizontal genetic exchange, besides point mutations, small deletions, and insertions that have also been identified, plays an important role in gene variation in S. pneumoniae bacteriophages.
Replication module.
The replication module is located adjacent to the lysogeny module in the genomes of all temperate bacteriophages studied and is highly conserved within each of the phage groups identified in this study; however, differences in the 3'-terminal flanking region were observed. Although replication modules between groups do not share remarkable similarities, the dot plot matrix (Fig. 1A) showed a conserved region in this module for all genomes studied. The region contains one conserved gene, cg1 or cg2 (conserved gene 1 or 2) (in
Spn_OXC/
Spn_3 and
Spn_23), or the two conserved genes (cg1 and cg2) (in all the rest of the phages), which encoded highly related gene products with unknown functions. These pairs of genes are the orf24 genes in
Spn_OXC/
Spn_3, orf20-orf21 in
Spn_11, orf25-orf26 in
Spn_14, orf29-orf30 in
Spn_6, orf30-orf31 in
Spn_9, orf28-orf29 in
Spn_19, orf31 (cg1) in
Spn_23, orf22-orf23 in MM1 and MM1-2008, and orf25-orf26 in
Spn_18. The product of the first of these two genes, absent only in
Spn_23, represents a protein with a size that ranged from 104 to 109 aa. These proteins showed 75.2% identity to each other. In contrast, the gene downstream is less conserved and absent only in
Spn_OXC/
Spn_3. The corresponding gene products are predicted to vary from 141 to 231 aa, with only 14.5% identity, mainly located at the C terminus. In any case, the location of these genes at the 3'-terminal flanking region of the replication module, and the high similarity shared between them make these sequences good candidates for locations where modular recombination can take place.
In the replication module of group 2 phages (
Spn_6,
Spn_9,
Spn_19, and
Spn_23) located immediately downstream of the gene encoding the replication protein, there is a gene encoding a protein showing a conserved virulence-associated domain (VirE or VapE) (see Tables S1 to S9 in the supplemental material). The proteins encoded in this position showed 96.2% amino acid identity among them, and their consensus sequence showed a 29% amino acid identity (E value, 10–34; identity, 104/353 bases) with the virulence-associated protein E (VapE) of Dichelobacter nodosus, in which this domain was originally identified (36). vapE is part of the vap regions of D. nodosus that have been associated with virulence (8). The mechanism of VapE in the virulence of D. nodosus has not been determined yet, but the presence of an integrase gene, showing similarities to integrase genes of Shigella flexneri phage Sf6 and coliphages P4 and
R7 located immediately upstream of vapE, suggested a role for bacteriophages in the evolution and transfer of these bacterial virulence determinants (13). Moreover, a vapE-like gene has also been identified in a pathogenicity island of S. aureus that also contains the toxic shock syndrome toxin-1 (tst) gene (41) and in phages of Vibrio parahaemolyticus, although no role was determined for their presence (58). The role of vapE-like genes in the virulence of S. pneumoniae remains to be clarified. However, the proximity of vapE-like genes to cg1 and cg2 in the genome of group 2 bacteriophages suggests the possibility of exchange of this putative virulence factor with other bacteriophages.
The module of replication in the genome of
Spn_6 contains two genes located downstream of cg1 and cg2 which are on the complementary strand (orf32 and orf33). Sequence comparisons have revealed that this pair of genes may encode a toxin-antitoxin system (TA) similar to the MazEF system identified in Escherichia coli (1). Bacterial TA systems generally consist of a toxic protein and the cognate antidote (or antitoxin) that is proteolytically unstable (25, 69). The TA cassettes have a characteristic organization in which the gene for the antitoxin component precedes the toxin gene, which usually overlaps the last nucleotides of the antitoxin gene. TA gene pairs function to ensure plasmid maintenance after cell replication by eliminating plasmid-free cells that emerge as a result of segregation or replication defects. Chromosomal homologues of TA genes are widely distributed in bacteria and induce reversible cell cycle arrest or programmed cell death in response to starvation or other adverse conditions (28). TA systems have also been identified in E. coli temperate bacteriophages P1 and N15. P1 and N15 are temperate bacteriophages that are stably maintained as a circular plasmid and as a linear plasmid, respectively. The genes phd (standing for "prevent host death") and doc ("death on cure") are those encoding, respectively, the antitoxin and the toxin in bacteriophage P1 (38). The bacteriophage N15 encodes a TA system homologous to the tad-ata module of the Paracoccus aminophilus plasmid pAMI2 (18). Conserved TA domains homologous to the MazEF system of E. coli (1) in the protein products of genes orf32 and orf33 located in the replication module of phage
Spn_6 were predicted using RASTA-Bacteria, an online tool available for the identification of TA loci in prokaryotes (59). Homologous genes have been identified in a defective prophage present in strain CGSSp14BS69,
Spn_14.2, and in EJ-1, a mosaic bacteriophage of Streptococcus mitis (56). The putative antitoxin is encoded by the gene located upstream of the putative toxin gene. The mazE-like gene products identified in
Spn_6,
Spn_14.2, and EJ-1 are 95 aa in length and show 73.7% identity. The RASTA-Bacteria tool showed that the products of the mazE-like genes identified in the prophage genomes showed 34 to 36% identity (E value, 10–5) with the MazE conserved domain of E. coli (COG2336). The mazE-like gene in
Spn_6,
Spn_14.2, and EJ-1, as previously demonstrated in the MazEF system in E. coli (1, 20), overlaps the mazF-like gene. The mazF-like products are 117 aa long in
Spn_14.2 and
Spn_6 but are 73 aa long in EJ-1. The pneumophage mazF-like genes showed 38.5% identity among the group, and the RASTA-Bacteria tool showed 31% identity between the products of mazF-like genes of
Spn_6 and
Spn_14.2 and the conserved domain of MazF of E. coli (COG 2337) (E value, 10–15) and 40% identity in the case of the putative toxin MazF-like of phage EJ-1 (E value, 10–15). mazEF is a stress-induced "suicide module" that triggers cell death in E. coli when a stress condition interrupts the expression of mazE. This leaves MazF unimpeded to exert its toxic effect and causes cell death (19, 21). Recently, a mazE and pemK TA system has also been identified in the defective prophage LJ771 of Lactobacillus johnsonii (15). This TA cassette is located between the phage endolysin gene and the attR of LJ771, in which phage-encoded virulence factors in other streptococcal species have been identified (32, 33). mazE and pemK were cloned singly or together in E. coli, and their expression was studied during in vitro growth of the L. johnsonii lysogenic strain. This study concludes that the TA cassette identified in L. johnsonii behaves as a typical "addiction system" (15). The role of the MazEF system in S. pneumoniae bacteriophages has not been ascertained. It may function as a mechanism for the maintenance of the temperate bacteriophage in the chromosome of the host as previously observed in the TA systems in E. coli bacteriophages (38). Besides, it may have a putative physiological function, as shown by a recent study that describes a homologous MazEF system in the chromosome of Streptococcus mutans which showed similarities with the TA system identified in the pneumophages (39). The genes located upstream and downstream of the TA genes in
Spn_6 show high similarities to those from other pneumophages, so they may allow recombination at this locus.
Overall, group 3 bacteriophages (MM1, MM1-2008, and
Spn_18) are highly related, but in their replication modules, differences in gene content were identified. orf13 and orf14, identified in phage MM1, correspond to overlapping genes that are predicted to encode two components of a five-cytosine-specific DNA methyltransferase (C5-MTase) (50) that have been replaced in
Spn_18 by a gene identical to that encoding a C5-MTase in the temperate pneumophage VO1 (51). DNA methyltransferases appear to provide functions that are also beneficial to the host cell and usually are found as part of restriction-modification (R-M) systems (30). Recent studies on R-M systems have shown that these genes are among the most rapidly evolving and that their variation may have an effect on the fitness of the host (3). Variability affecting the presence or absence of C5-MTase in MM1-like bacteriophages confirms previous information suggesting that variation in R-M systems may also occur through selection acting on laterally transferred genes (35, 51).
Packaging module.
The majority of the packaging modules in the phage genomes studied here are essentially composed of three genes encoding the small and large subunits of the terminase and the portal protein, although in a few cases, the terminase is encoded by a single gene (see Table S1 to S9 in the supplemental material). Terminases are responsible for the recognition of their phage DNAs, ATP-dependent cleavage of the DNA concatemer, and packaging of the DNA molecules into the empty capsid shells through the portal protein (34). Genes encoding the putative terminase components were observed for all pneumophage genomes in this study, although the individual genes all predicted to fulfill the same functions appeared to have different origins. The packaging module of
Spn_11 represents the only exception, as a putative structural protein encoded by a gene located upstream of the terminase gene has been identified. This gene organization coincides with the ones previously described for the S. mitis prophage SM1 and the lactococcal r1t bacteriophage (61). Sequence comparison of packaging modules in group 2 phages showed that horizontal gene exchange can play a role in module variation. This is represented by the gene encoding the terminase small subunit of
Spn_23, which showed an origin different from that of the ones in the phages of the same group (Fig. 1A; see the supplemental material).
Morphology module.
Most of the differences found in the genomes of the group 1 phages are located within the morphology cluster; however, with the exception of
Spn_11, they appear to be similar to the Sfi21-like Siphoviridae phages (10). In contrast, the genome organizations of the morphology module of phage
Spn_11 and phages in group 2 and group 3 resemble that of the Sfi11-like phage group. The gene organization observed in the morphology module of
Spn_11 resembles homologous genes in the SM1 bacteriophage and in the lactococcal phage r1t, while others in this family maintain their capsid genes as a cluster located downstream of the terminase gene (17, 61, 66).
SM1 is a bacteriophage of S. mitis that was indirectly identified in the search for genetic loci mediating binding of the bacterium to human platelets (4, 5, 61). PblA and PblB are two surface-expressed proteins that are involved in the platelet binding activity of S. mitis but are, in fact, the tape measure protein and a tail fiber of the SM1 bacteriophage. pblA and pblB form part of the same operon, and mutations made in each of the genes have shown that the expression of both genes is required for the adhesion to human platelets of S. mitis SF100 in vitro (4) and for its virulence in an animal model of infective endocarditis (47). PblA and PblB function as adhesins and are expressed and liberated at the cell surface due to the activity of the holin and endolysin of SM1 in S. mitis SF100 (47). Similarities throughout the genomes of
Spn_11 and SM1 have been identified. Interestingly, an operon including pblA- and pblB-like genes has been identified in the morphology module of
Spn_11 (not shown). The amino acid sequences of PblA from SM1 and
Spn_11 showed 58.7% identity. There is an additional gene located between pblA and pblB, with an unknown function, in the genome of SM1. At the same position in the genome of phage
Spn_11, a homologous gene (with 63.4% identity to the gene in SM1) was identified. A PblB-like protein is also encoded by the
Spn_11 genome and, interestingly, by the genomes of all the group 1 and group 2 phages but not by MM1-like phages. The predicted products of pblB-like genes are greater than 1,000 aa in length, contain long repeats, and possess C termini that are rich in aromatic amino acids. The tryptophan-rich repeats observed at the C-terminal ends of PblA in SM1 and PblA-like protein in
Spn_11 appear to be responsible for binding to the cell wall (47) in a similar way to the choline-binding repeats characteristic of the S. pneumoniae major autolytic enzyme, LytA (24), and other choline-binding proteins (44). The function of pblA and pblB-like genes encoded by S. pneumoniae bacteriophages is unknown, but it is conceivable that they may play a role in adhesion. Nevertheless, this assumption needs to receive experimental support.
Lysis module.
The essential functions of the lysis module of temperate bacteriophages are usually performed by the products of the holin and endolysin genes. The holins are small molecules that accumulate in the membrane and at a specific time form holes that permeabilize it, whereas the endolysin molecules accumulate at the cytosol until the holes are formed so they can reach the cell wall (68). In many streptococcal phages, an unusual holin/endolysin arrangement that is characterized by two holin-like coding sequences that are located immediately upstream of the endolysin gene has been observed (11, 50, 60). This arrangement was also observed in the majority of the lysis modules studied here, as two holin-like sequences (hol1 and hol2) have been identified upstream of the endolysin gene. The exceptions are represented by phages
Spn_9 and
Spn_19, which have only one holin gene (hol2). hol1 genes encode highly related proteins (138 aa) showing 92% identity, whereas hol2 genes encode proteins of 110 or 111 aa, with 72% identity. Holins have been grouped into three classes according to the number of potential transmembrane domains. Class I members have the potential to form three transmembrane domains, and class II members can form only two transmembrane domains. The third class of holins comprises atypical or unclassified holins (68). Hol1 proteins belong to class I holins, as they have the potential to form three transmembrane domains, while hol2 gene products showed only one transmembrane domain (from Ile7 to Val24). This region in Hol2 proteins might also correspond to a cleavage signal sequence with a potential signal processing site located between Ala26 and Val27, as predicted by bioinformatic analysis (not shown). Similar features have been described for the RI protein, the bacteriophage T4 antiholin (53). Consequently, the hol2 gene products are most probably antiholins.
All the temperate bacteriophages, except
Spn_18, harbor typical lytA-like alleles, as they all encode products the same size as the major pneumococcal autolysin, LytA (318 aa) (42). It is noteworthy that the N-acetylmuramoyl-L-alanine amidases encoded by the temperate pneumophages studied here share a high degree of similarity among one another (91.5%) and with the host LytA enzyme (68.3% identity). The presence of lytA-like alleles in all the genome studies here suggests that LytA-like amidases are the most abundant endolysins from temperate pneumophages and strongly suggests that recombination exchange between the endolysin-encoding gene of the prophage and the host gene lytA participates in the evolution of these genes.
New bacteriophages identified in additional S. pneumoniae strains sequenced by the J. Craig Venter Institute.
While this work was in preparation, the genomes of seven additional S. pneumoniae strains were sequenced at the J. Craig Venter Institute, and the data became available online. Temperate bacteriophages have been found in four of those genomes: those of SP195, CDC3059-06, Hungary19A-6, and CDC1873-00. The newly identified prophages presented the same modular organization as already described for pneumophages and showed different degrees of similarity between them. Interestingly, the new phage genomes grouped perfectly well with the pneumophage genomes studied in this work and share the same intrinsic peculiarities described in this report for each of the groups (Fig. 1). Notably, the genomes of two bacteriophages have been identified in the sequence of strains SP195 (
Spn_195_1 and
Spn_195_2) and Hungary19A-6 (
Spn_H_1 and
Spn_H_2). Phage
Spn_195_1 is inserted into attMM1 and was shown to be highly related to
Spn_9 and other group 2 phages, whereas
Spn_195_2, which integrates into attOXC, was related to group 1 phages (Fig. 1). The genome of the strain Hungary19A-6 also contained two bacteriophages:
Spn_H_1, inserted into attOXC, and
Spn_H_2, inserted into att
Spn_18. It is noteworthy that the gene encoding the
Spn_H_2 endolysin is truncated, indicating that this phage is defective and depends on the endolysin encoded by
Spn_H_1 and/or on the host LytA N-acetylmuramoyl-L-alanine amidase for the liberation of its progeny. The phages identified in the genomes of strains CDC3059-06 and CDC1873-00 are both inserted in attOXC and belong to group 1 phages (Fig. 1B). Interestingly, conserved genes cg1 and cg2 as well as putative virulence factors like vapE-like and pblB-like genes are also present in the prophages identified in this group of strains.
Conclusions. The main objective of this study was to compare and analyze the sequences of 10 temperate bacteriophages of S. pneumoniae. The presence of temperate bacteriophages in pneumococcal isolates is quite high, but their genetic contents have not yet been well characterized. Our analysis suggested that the genomes of temperate pneumophages can be placed into groups but that there is intergroup recombination that takes place as well as horizontal gene exchanges between phage populations within a group. The remarkable number of genes encoding proteins with no similarities to annotated proteins showed the potential for identifying novel products of biological importance. Moreover, the genome analysis performed in this study has provided us with the knowledge to design a novel method for the detection and identification of temperate bacteriophages in clinical isolates of S. pneumoniae as recently described (55). In addition and most importantly, the sequences of new bacteriophage genomes will help to ascertain the implication of temperate bacteriophages in the virulence of S. pneumoniae.
P. Romero is the recipient of a postdoctoral fellowship from the Spanish Ministerio de Educacion y Ciencia (EX-2006-0759). This work was supported by a grant from the Dirección General de Investigación Científica y Técnica (SAF2006-00390) and by grants DC05659, DC04173, and DC02148 from the U.S. NIH NIDCD. CIBER de Enfermedades Respiratorias (CibeRes) is an initiative of ISCIII.
Published ahead of print on 5 June 2009. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
Sfi21. Virology 233:136-148.[CrossRef][Medline]
owski. 2001. Mechanisms of plasmid stable maintenance with special focus on plasmid addiction system. Acta Biochim. Pol. 48:1003-1023.[Medline]
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»