Previous Article | Next Article ![]()
Journal of Bacteriology, December 2003, p. 6968-6975, Vol. 185, No. 23
0021-9193/03/$08.00+0 DOI: 10.1128/JB.185.23.6968-6975.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Veterans Affairs Medical Center and University of California, San Francisco, California 94121
Received 9 June 2003/ Accepted 7 September 2003
| ABSTRACT |
|---|
|
|
|---|
| TEXT |
|---|
|
|
|---|
Streptococcus mitis is a leading cause of infective endocarditis (16). Despite the prominent role of S. mitis in endocarditis, few virulence determinants of this organism have been identified. Our laboratory's previous studies have examined the direct binding of human platelets by S. mitis (4, 28, 29). This interaction is thought to be important both for the initiation of endocardial infection and for the subsequent development of endocardial vegetations. Our investigators recently identified two cell wall-associated proteins (PblA and PblB) of S. mitis that contribute to platelet binding by this organism (4). Of note, these two proteins are encoded by an inducible prophage (SM1) that is present in the genome of S. mitis strain SF100 (5). Although the precise mechanism by which PblA and PblB enhance binding has not been defined, the localization of these proteins suggests that they may mediate the direct attachment of S. mitis to the platelet surface. Thus, PblA and PblB appear to be highly unusual examples of phage-encoded virulence factors that serve as adhesins for human tissue.
In our previous work, we showed that the mature SM1 phage particle was composed of an isometric, icosahedral head and a long noncontractile tail (5). The phage genome was estimated to be approximately 35 kb in length. Sequence analysis of a portion of the genome revealed that the putative virulence genes pblA and pblB reside in a region that encodes morphogenetic proteins. Moreover, Western blotting studies demonstrated that, in addition to being localized to the bacterial cell wall, alternate forms of PblA and PblB were components of the phage particle.
Since no other phages of S. mitis have been reported to date and because PblA and PblB are rare examples of phage structural proteins that are associated with bacterial adherence, we sought to examine SM1 in greater detail. Given the association between the Pbl proteins and platelet binding, we also sought to identify any other potential virulence factors encoded by the phage. For these reasons, we have completed the sequencing of the SM1 genome and have begun to assign functions to the various open reading frames (ORFs) encoded therein.
Phage DNA was extracted from purified phage particles and sequenced to sixfold coverage from shotgun subclone libraries of the SM1 genome (Genome Therapeutics, Waltham, Mass.). Closure of the gaps between contigs was accomplished by sequencing gap-specific PCR products generated using the SM1 genome as a template. The full phage genome was found to consist of 34,692 bp and has a G+C content of 39 mol%. The genomic sequence was analyzed using version 2.1 of GeneMarkHMM for prokaryotes (http://opal.biology.gatech.edu/GeneMark/gmhmm2_prok.cgi) (21) to define potential coding regions. The algorithm identified 56 ORFs that encode proteins of 40 or more amino acids (Fig. 1). These ORFs are grouped into three discrete modular regions that reflect the phage life cycle: (i) integration and immunity, (ii) replicative and early lytic functions, and (iii) morphogenesis and lysis. The integration/immunity module, comprised of eight genes, is located immediately downstream of the left-end att site (Fig. 1).
|
The immunity regions of many phages of low-GC-content gram-positive bacteria contain a Cro-like protein encoded divergently, but immediately upstream, from the repressor (19). However, SM1 appears to have a Cro-like protein encoded by gene 8, in a position that separates it from the putative promoter-operator region by three genes (genes 5 to 7). Although it is currently not known whether this separation affects the functional regulation of lysis and lysogeny, it does represent an unusual organizational feature of the SM1 genome.
The region of the phage genome extending from the end of the integration/immunity module toward the terminase gene defines the module related to replication and early lytic functions. Proteins encoded within this region show varying degrees of identity to proteins from numerous other phages, including Streptococcus thermophilus, S. pneumoniae, Streptococcus pyogenes, Streptococcus agalactiae, and Lactococcus lactis. Although it was not possible to assign probable functions to many of the proteins in this region, due to a lack of similarity to proteins of known function, motif and PSI-BLAST searches did reveal possible functions for a number of gene products. gp15, encoded by one of the most 5'-proximal genes in the region, has a high percentage of identity to antirepressors, including that of phage LambdaSa2 of S. agalactiae 2603V/R (77%) and a phage of S. pyogenes MGAS8232 (72%). These antirepressors function in other phages to relieve the inhibition mediated by the cI-like repressor proteins by binding directly to these repressors subsequent to initiation of the SOS response upon DNA damage (10, 26, 27). Therefore, gp15 may facilitate the entry of SM1 into a lytic phase of development upon induction by DNA-damaging agents.
The products of genes 19 and 21 are homologues of DNA replication proteins from a wide range of species. Analyses using the ProDom (8) and Pfam (3) algorithms indicated that gp19 and gp21 contain DnaD and DnaC motifs, respectively. This suggests that gp19 and gp21 function in genome replication. DnaD has been shown to mediate interaction with the replication initiation protein DnaA, with DnaC, and also with DnaD (7, 17). Therefore, gp19 may mediate primosome assembly and replication initiation in a manner similar to DnaD. The ATP-binding DnaC protein has been demonstrated to be an energy-dependent helicase (9). Thus, the gp21 orthologue may serve to unwind the phage genome during replication.
Gene 34 appears to encode a phage structural component having homology to a protein from at least two low-GC-content phages. Of interest, both a lactococcal phage (r1t) and a related, uninducible phage of S. pyogenes (SF370.3) contain proteins homologous to gp34 that are in similar positions within their respective genomes (11, 30) (Fig. 2). Specifically, the SM1, r1t, and SF370.3 appear to have a potential portal protein encoded by a gene located upstream of the gene encoding the terminase, while the other phages in this family maintain their head genes as a cluster located downstream of the terminase. Therefore, although the location of gene 35 and its homologues is unusual relative to most low-GC-content phages, it may reflect an organization that is characteristic of a subfamily of related phages.
|
The third modular region of the phage includes genes encoding the morphogenetic and lytic proteins. gp38 shows greatest identity (79%) to the terminase of phage r1t. A number of putative structural proteins are encoded downstream of gene 38. PSI-BLAST searches indicated that genes 40 and 46 encode the major head and tail proteins of the phage, respectively. Other components of the phage include PblA and PblB (gp49 and gp51), which may function in the phage as putative tail tape measure and tail fiber/host recognition proteins, respectively. The protein encoded by gene 55 shows high identity to the holin of the S. pneumoniae phage VO1 (84%). The putative phage lysin, a homologue of the N-acetylmuramoyl-L-alanine amidase of the S. pneumoniae phage Dp-1 (72%), is encoded by gene 56. Most of the remaining genes (gp41 to -45, -47, -48, -50, and -52 to -54) show similarity to phage proteins that have no known function.
Interestingly, phages SM1, r1t, and SF370.3 appear to have specific genes in distinct positions relative to many other low-GC-content phages. The products of genes 38 through 49 of SM1 are homologous with, and are arranged similarly to, that of gp29 through -40 of phage r1t and gp26 through -37 of SF370.3. In addition, the position of the putative structural protein encoded by gene 35 outside of the morphogenetic module is conserved among SM1, r1t, and SF370.3 (Fig. 2). Thus, the arrangement of these genes, as well as the gp34 homologues discussed above, may be indicative of a discrete subfamily.
The phage att and cos sites are two distinct DNA domains that define the ends of the integrated and mature phage genome, respectively. To identify the att sites (attL and attR) of the integrated phage, inverse PCR (4) was utilized, using primers designed to read out from the phage sequence and into the genome of the lysogen. The junction regions between the integrated phage genome and host chromosomal sequences contained a repeated, 16-bp sequence (5'-GTAGATGGATTCTAAT-3' [attL] and 5'-GTAGATGGTTTCTAAT-3' [attR]) representing the sequences duplicated as a result of the recombination between attP and attB. Examination of the sequence of the phage genome revealed that the phage attachment sequence (attP) was identical to that of attL. Furthermore, sequencing of the region of SF100 chromosomal DNA from which the phage had excised revealed that attB was identical to attR.
We had reported previously that an 18-kb EcoRI restriction fragment of the SM1 genome could be resolved into 15-kb and 3-kb fragments by heating the products of the restriction digest prior to separation by agarose electrophoresis. This indicated that the cos region is present within the 18-kb EcoRI restriction fragment. When we compared this and various other restriction patterns predicted from the phage sequence data, we were able to narrow the position of the cos site to an intergenic region between genes 33 and 34. Primers that read outward from the genes and toward the cos site were used for sequencing from the SM1 genome. Comparison of the sequence obtained with the whole genome sequence revealed that SM1 has a cos site with a 13-nucleotide 3' overhang (Fig. 3).
|
AB) were isolated as described previously (5). The protein components were separated by SDS-PAGE on 4-to-15% gradient Ready Gels (Bio-Rad) and either stained with silver to visualize all proteins within the purified preparations or used to isolate proteins for N-terminal sequencing. Three major and several minor phage structural proteins were apparent on the stained gels (Fig. 4). N-terminal sequence analysis of the two largest major proteins (apparent masses of approximately 190 and 160 kDa [NH2-XEAQLSKGNL-COOH]) indicated that both are encoded by gene 40. This gene is predicted to encode a 32-kDa protein (gp40). Thus, these large structural proteins may represent stable hexameric and pentameric forms, respectively, of gp40. Furthermore, these sequences revealed that the major head protein had not undergone proteolytic processing. Sequence analysis of the smallest of the three major proteins (NH2-ATEANVTTAK-COOH) indicated that it is encoded by gene 46. The apparent molecular mass of this protein (28 kDa) is somewhat higher than predicted (19 kDa) for this gene product.
|
AB (Fig. 4). The 190- and 160-kDa proteins described above were present in both SM1 and SM1
AB. However, the protein identified as gp46, in addition to four other proteins that have yet to be characterized (Fig. 4), were absent from SM1
AB. These findings corroborate the bioinformatic data indicating that gp46 is the major tail protein and that gp40 is the SM1 major capsid protein.
Complete sequencing and extensive bioinformatic analyses have revealed several interesting characteristics of the SM1 genome. As has been described for other members of the Siphoviridae, the ORFs of SM1 appear to be arranged in discrete modules that reflect the phage life cycle. However, we did find some notable differences in gene arrangements between SM1 and typical members of the phage family. In
, the integrase is separated from the repressor by 14 genes. However, like many phages of low-GC-content gram-positive bacteria, the integrase and repressor genes of SM1 were clustered within just a few genes of each other, in a manner distinct from that of the prototypic Siphoviridae (19). Therefore, this conserved genetic organization, which is also present in SM1, may be a hallmark that distinguishes these phages from the larger superfamily. Additional unusual organizational features present in the genomes of SM1, r1t, and SF370.3 include the distinctive position of a putative structural gene (gp34 in SM1), as well as the one-to-one alignment of 12 morphogenetic genes with those of phages. These features further suggest that these phages are members of a discrete subdivision of Siphoviridae.
Classically, morphological features have been used for grouping or subdividing phages (1). However, as phages are characterized at the molecular level, differences in assembly mechanisms are being recognized which may not result in obvious morphological differences. In prototypic phage such as
, the heads are assembled from the major capsid protein, along with a few accessory proteins, into an icosahedral shell. In most cases, these structures can be disassembled to release monomers of the major capsid protein. However, it was recently shown that the lambdoid coliphage HK97 may assemble its head by a second mechanism in which it is able to form a completely covalently linked shell consisting of a catenated latticework of hexamers and pentamers (32). This form of highly cross-linked capsid is also the mature form of the head for a number of other phages, including the coliphage HK022 (18), L5 of Mycobacterium tuberculosis (13), and D3 of Pseudomonas aeruginosa (12). Thus, morphologically similar phages can be subdivided, based on differences in the capsid assembly mechanism.
Although SM1 and r1t resemble other members of the family Siphoviridae morphologically, our results with SM1 along with studies of r1t by others (30) indicate that these phages use a third, novel mechanism for head morphogenesis. That is, the major capsid proteins form discrete stable hexameric and pentameric oligomers. In HK97, stable hexamers and pentamers are catenated to form a covalently linked superstructure, as is evident from the relatively small amount of these oligomers present within phage preparations separated by SDS-PAGE (25). However, preparations of SM1 separated by SDS-PAGE (Fig. 4) revealed abundant amounts of pentamers and hexamers, indicating that they are indeed free to dissociate from the entire head structure. This distinct mechanism for capsid assembly may represent another feature that distinguishes SM1 and r1t from other double-stranded DNA icosahedral phages.
We showed previously that SM1 encodes two putative virulence factors, PblA and PblB, that mediate the binding of S. mitis to human platelets. Bioinformatic analysis of the proteins encoded in the SM1 genome did not reveal the presence of any other traditional virulence factors, such as toxins. However, the carboxyl terminus of one gene product showed 43% identity to MafB, a putative neisserial adhesin. gp35 of SM1 appears to have arisen through the acquisition of a small portion of a neisserial chromosomal locus that contains mafB followed by a short neisserial ORF of unknown function. Despite the homology to a putative adhesin, further genetic and biochemical analysis will be required to determine whether gp35 is localized to the SF100 surface and whether gp35 contributes to the virulence of S. mitis.
Phage evolution appears to involve a great deal of genetic exchange that leads to a mosaic genome (14, 15). Such mosaicism is thought to occur through illegitimate recombination across the entire chromosome (14). Close inspection of the SM1 genome revealed that it contains homologs of over 20 different individual phages or bacterial species. The presence of regions of homology to neisserial sequences within genes 15 and 35 of the functional SM1 genome (Table 1) is of particular interest, since these neisserial coding sequences are likely to represent an abrupt shift for SM1 away from its ancestral subfamily. This high level of genetic plasticity may enhance its ability to disseminate its virulence factors.
|
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |