Previous Article | Next Article ![]()
Journal of Bacteriology, September 2006, p. 6101-6114, Vol. 188, No. 17
0021-9193/06/$08.00+0 doi:10.1128/JB.00581-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Département de biochimie et de microbiologie, Faculté des sciences et de génie, Groupe de recherche en écologie buccale, Faculté de médecine dentaire, Felix d'Hérelle Reference Centre for Bacterial Viruses, Université Laval, Québec, Québec, Canada G1K 7P4
Received 24 April 2006/ Accepted 19 June 2006
|
|
|---|
|
|
|---|
Classification of the phages is still controversial, and different propositions have been made in recent years (10, 65, 74). Previous classification studies relied on the comparison of virus morphology and DNA-DNA hybridizations using whole genomes. According to these criteria, 12 lactococcal phage species representing distinct phage groups were proposed (37, 38). This classification of lactococcal phages was recently revisited, and 10 genetically diverse groups of phages, sharing very limited nucleotide similarities, were proposed (26). Among these, two new species were identified, one of which was the type phage Q54.
Phage genomics has considerably expanded in the last 5 years, and as of June 2006, 362 complete genomes are available at GenBank. Of those, 14 are from lactococcal phages. Among the available genomes, only members of the three main groups of L. lactis phages are represented, namely, the 936, c2, and P335 species. Ten of the genomes are from phages of the P335 species (4, 9, 15, 40, 50, 68, 74, 75), whereas only four sequences are from 936- and c2-like phages, namely, bIL170 and sk1 (936 species) as well as c2 and bIL67 (c2 species) (12, 18, 48, 67). The lack of genome sequences from the less frequently isolated phage species is probably explained by the higher industrial incidences of failed fermentations due to the members of the three above predominant species (37, 39).
According to the proposed classification scheme, lactococcal phages are highly diverse between species but rather homogeneous within the same species. For example, virulent phages within the 936 or c2 species are highly similar at the DNA level and up to 90% of their genome length can be aligned with over 70% nucleotide identity (15, 18, 48). As a result, most of the open reading frames (ORFs) share more than 80% amino acid identity, some being almost identical. Point mutations and small insertion/deletions (indels) were shown to account for most of the differences in the genome sequences of lytic phages (18, 48). Due to the lack of temperate members, homologous recombination between members of the 936 and c2 species is not expected to occur frequently, although it may occur during coinfection (15). On the other hand, the P335 group of phages, comprising both temperate and lytic members, is less conserved and has a polythetic nature. Indeed, several P335-like phages share conserved modules, but there is no single ORF shared by all members of this group (4, 9, 15, 23, 24, 40). Comparative genomics of these P335-like phages thus revealed a highly mosaic structure, defining functional modules that are exchangeable through homologous recombination, in addition to point mutations and indels (15, 23, 57).
Structural proteins that make up the viral particle have been studied for only a few lactococcal phages including c2 (48) and p2/936 species (69) as well as the P335-like phages ul36 (40), r1t (75), BK5-T (50), and TP901-1 (76). Posttranslational modifications of structural proteins, especially the major capsid protein (MCP), have been well documented for a number of model phages such as the coliphage HK97 (29). The MCP from the lactococcal phage c2 was also shown to undergo proteolytic cleavage and cross-linking during capsid assembly (48). Although proteolysis of the capsid has been reported for a number of other members, cross-linking among lactococcal phages has been so far reported only for phages c2 and r1t (48, 75). In other phages, cross-linking of viral subunits does not appear to be frequent, except perhaps for mycobacteriophages, where this mechanism seems to be more common (28, 34).
Programmed translational frameshifting is well documented in double-stranded DNA (dsDNA) bacteriophages, including Escherichia coli phage
(44). Generally, it involves tail proteins, more specifically the major tail protein (MTP) and an overlapping ORF preceding the tape measure protein (79). Most of the frameshift events recognized so far are of the 1 type, where a slippery sequence composed of repeated nucleotides enables the ribosome to slip one nucleotide backward before continuing translation on the corresponding overlapping frame. An unusual 2 frameshift was recently demonstrated in tail proteins from phage Mu (79). Only a few examples of +1 frameshifting have been documented in E. coli, with the well-described examples of release factor 2 (RF2) (17) and an expression system involving human transferrin (Tf) (25). In the case of Tf, a "shifty stop" with the sequence CCC.UGA was found to be responsible for ribosome frameshifting (25). Recently, the first case of programmed +1 translational frameshifting in dsDNA phages was reported for the Listeria monocytogenes temperate phage PSA, where elongated versions of the capsid protein and MTP were identified (80). In the MTP frameshift, a CCC.UGA "shifty stop" at the end of the tail gene was also found to be responsible for the event. Notably, a pseudoknot 3' to the slippery signal was probably also involved in the mechanism (80). Secondary structures downstream from slippery signals have been shown to promote frameshifting (43).
Here, we report the microbiological and molecular characterization of a novel lactococcal phage species, for which the lytic phage Q54 is currently the only member. Complete genomic and transcriptional analyses are provided which reveal an unusual module configuration and shed light on its peculiar origin. A detailed analysis of the structural proteins uncovered interesting features, including cross-linking of the MCP and a programmed +1 translational frameshift between the MTP and the receptor-binding protein (RBP). To our knowledge, this is only the second case of programmed +1 translational frameshifting in dsDNA phages and the first case of a translational coupling between a major tail protein and an RBP, the latter being involved in host recognition.
|
|
|---|
Microbiological assays. One-step growth curve assays were performed as reported elsewhere (55). The burst size was determined by dividing the average phage titer after the exponential phase by the average titer before infected cells began to release virions (55). The efficiency of plaque formation (EOP) was calculated by dividing the phage titer on the tested L. lactis strain by the titer on the sensitive control L. lactis strain.
Phage DNA analysis. Phage Q54 genomic DNA was isolated using the Maxi Lambda DNA purification kit (QIAGEN). Genome sequencing was first performed using a shotgun cloning strategy (Integrated Genomics Inc., Chicago, Ill.). Completion of the genome was done by direct sequencing of the phage DNA with primers and using an ABI Prism 3700 apparatus from the genomic platform at the research center of the Centre Hospitalier de l'Université Laval. The cos site was determined by direct sequencing of the phage DNA and by sequencing of a PCR product encompassing the T4-ligated cohesive termini. Standard PCR and ligation procedures were used (66).
RNA isolation and Northern blot analysis. Two hundred milliliters of GM17 was inoculated (1%) with an overnight culture of L. lactis SMQ-562, and cells were grown at 30°C until the optical density at 600 nm reached 0.1. CaCl2 was added to a final concentration of 5 mM, and incubation was continued for 5 min. Cells were then centrifuged at room temperature for 2 min at 10,000 rpm in a Beckman GSA rotor. The pellet was suspended in 20 ml of GM17 containing 5 mM CaCl2 to obtain a final optical density at 600 nm of 1.0. At time zero, a sample of 1.5 ml was removed and quickly centrifuged for 15 s, the supernatant was removed, and the cell pellet was snap frozen in liquid nitrogen. Cells were stored at 80°C until RNA extraction. Phage Q54 was added to the remaining culture at a multiplicity of infection of 10. Aliquots of 1.5 ml were withdrawn at various time intervals and processed as described above. Lysis of the remaining phage-infected culture after 45 min was indicative of the completion of the phage lytic cycle.
The next day, cells were thawed on ice and resuspended in a total volume of 100 µl of a sucrose solution (0.5 M) containing 120 mg/ml lysozyme. The mixture was incubated 10 min at 37°C prior to the addition of 1 ml of Trizol (Invitrogen), as recommended by the supplier. This step was necessary to achieve rapid and complete cell lysis. Following its isolation, the RNA was treated with 30 units RNase-free DNase I (Roche) for 30 min at 37°C in the presence of Protector RNase inhibitor (Roche) and quantified by UV spectrometry at 260 nm. Northern blotting was performed after separation of 5 µg of RNA through a 1% agarose-formaldehyde denaturing gel (66). Probes used to detect mRNAs consisted of 32P-labeled oligonucleotides (25- to 35-mer; sequences available upon request) complementary to the coding strand of the gene of interest. Labeling reaction mixtures consisted of 2 pmol of oligonucleotide, 1x reaction buffer, 30 µCi of [
-32P]ATP (GE Healthcare), and 20 units of T4 polynucleotide kinase (Roche) in a final volume of 20 µl. Incubation was performed at 37°C for 30 min, after which 50 µl of distilled water was added. Probes were quickly purified using Micro Bio-Spin 30 columns (Bio-Rad Laboratories). The efficiency of labeling of the purified probes was determined by liquid scintillation counting. A minimum specific activity of 2 x 106 cpm/pmol oligonucleotide was our cutoff in order to use the labeled probe in the hybridization experiments. Membranes were prehybridized at 42°C for at least 1 h in 5 ml of UltraHyb-Oligo hybridization buffer (Ambion) in a 50-ml disposable screw-cap tube. Hybridization was performed overnight at 42°C in the presence of 5 x 106 cpm of the freshly labeled probe. Membranes were washed twice at 42°C for 30 min in 2x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate) plus 0.5% sodium dodecyl sulfate (SDS) and then exposed to Kodak XOMAT-AR films at 80°C with intensifying screens.
SDS-PAGE analysis of structural proteins.
Purified phages (
1 x 1011 PFU/ml) were analyzed for structural proteins by standard Tris-glycine 12% SDS-polyacrylamide gel electrophoresis (PAGE) procedures (42). Samples were mixed with 2x sample loading buffer and boiled for 10 min before loading. Proteins were detected after Coomassie blue staining. For phage Q54, further concentration of the phage suspension was required before electrophoresis. This was achieved by centrifuging 1.5 ml of CsCl-purified phage suspension through Nanosep 3k columns (Pall), which led to a further 1-log concentration factor. Protein bands were cut out of the gel, digested with trypsin, and identified by liquid chromatography-tandem mass spectrometry (LC-MS/MS) (Génome Québec Innovation Centre, McGill University). For comparison purposes, the structural proteins of the reference phage c2 (48) were also analyzed by SDS-PAGE.
Bioinformatics analysis. DNA sequence analysis, contig assembly, and editing were done using the Staden package 1.5.3 (available at http://staden.sourceforge.net/). Some editing was also done using BioEdit 7.0.5.2 (33). Protein sequence comparisons were performed using BLASTP 2.2.13 (2), and conserved domains were identified using RPS-BLAST 2.2.11 and the Conserved Domains Database (CDD) at the National Center for Biotechnology Information (NCBI) (51, 52). PSI-BLAST was also used to search for more distant homologous proteins when standard BLAST hits gave only proteins with putative or unknown functions. InterProScan at EMBL-EBI (http://www.ebi.ac.uk/InterProScan/) was used to identify conserved domains in some proteins that did not show any significant similarity by BLAST searches. Putative RNA secondary structures were determined using Vienna RNA secondary structure prediction v1.5 at http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi and MFOLD at http://bioweb.pasteur.fr/seqanal/interfaces/mfold-simple.html (81).
Nucleotide sequence accession number. The sequence data reported in this paper have been deposited in EMBL/GenBank/DDBJ databases under accession number DQ490056.
|
|
|---|
![]() View larger version (189K): [in a new window] |
FIG. 1. Electron micrograph of phage Q54. Bar, 50 nm.
|
The sensitivity of phage Q54 towards three phage resistance mechanisms, namely, AbiK (32), AbiQ (31), and AbiT (6), was determined. Numerous phage resistance mechanisms are currently used by the dairy industry to control the three main phage species (936, c2, and P335). Thus, we were interested to know if the new phage species Q54 could be controlled by these abortive infection mechanisms (Abis). High-copy-number plasmids expressing the three Abi systems were transferred into L. lactis strain SMQ-562, and the resulting transformants were challenged with Q54. AbiQ and AbiT were very effective against Q54, with mean EOP values of
108 and 3.3 x 107 ± 2.2 x 107, respectively. On the other hand, AbiK was ineffective against this phage (EOP of 0.81 ± 0.54). AbiK is generally very efficient against the three main phage species (32), which confirms the distinct nature of Q54.
Complete nucleotide sequence of Q54 genome.
A shotgun cloning strategy was used to sequence
90% of the Q54 genome. Primer walking and direct sequencing of the phage DNA were performed to complete the remaining 10% covering both genome extremities. Runoff sequencing reactions enabled us to locate the exact ends of the linear phage genome. No terminal redundancy was observed, suggesting a cos-type phage. Confirmation was made by ligating the cohesive termini with T4 DNA ligase followed by sequencing of a PCR fragment spanning the resulting junction (Fig. 2). The cohesive termini are composed of single-stranded 10-bp 3' overhangs (Fig. 3A). The complete genome sequence of 26,537 bp has a molar G+C content of 37.1 mol%, which is similar to those of L. lactis and other lactococcal phages (5, 12, 40, 48).
![]() View larger version (18K): [in a new window] |
FIG. 2. Q54 complete genome (26,537 bp). (A) Genomic organization. A scale (in kilobases) is shown above the genetic map. Each arrow represents a putative ORF, and the numbering refers to Table 1. Gray arrows represent ORFs for which a putative function could be inferred from bioinformatics analyses (indicated above the ORFs). White arrows represent ORFs for which no putative function could be attributed. Hairpins represent putative Rho-independent transcriptional terminators. Arrows with heavy outlines represent gene products detected by LC-MS/MS analysis (Fig. 4). (B) Phage Q54 mRNA transcripts detected in the course of an infection of L. lactis SMQ-562. Short thick lines above the transcripts indicate probes. The size (in kilobases) is indicated on the right or left of each transcript (arrows), and the line thickness is representative of their relative abundance, with the thicker line corresponding to the highest concentration. Arrowheads show the direction of transcription, and the temporal expression (early or late) is indicated below the genome scale.
|
![]() View larger version (19K): [in a new window] |
FIG. 3. Analysis of Q54 cos region. (A) Analysis of the cos site and flanking regions. Typical features found in the cos vicinity of other phages, such as direct repeats (DR1 and DR2), inverted repeats (IR), A/T-rich region, G- and C-rich segments (G and C box, respectively), and -R-like sequences ( -like), are indicated. Details of the cohesive termini are also shown. The sites of cleavage resulting in 3'-overhang termini are indicated by vertical arrows. Italic characters with reverse highlighting represent nucleotides that are identical to a consensus sequence found in all lactococcal phages for which the cos region has been sequenced (see text for details). (B) Multiple alignment of -R-like sequences from Q54, c2, and sk1 with those of -R sequences from phage . Conserved nucleotides are shaded.
|
-like R consensus sequences (19). In
, the R sites are 16 nucleotides long whereas in Q54 the consensus was extended to 19 nucleotides from comparison with those previously reported for phages c2 and sk1 (Fig. 3B) (62). The
-R sites are involved in terminase recognition and binding as well as termination of packaging. However, in contrast to phages
and c2, the Q54
-R sequences are unevenly distributed across a 900-bp region. In the former, the
-R sites are regularly spaced (
250 bp between R sites). Alignment of the R sites from
with the
-like R sites from phages Q54, c2, and sk1 (Fig. 3B) highlighted common bases likely involved in the packaging process. Comparison of Q54 ORFs and functional assignment. Further bioinformatics analyses enabled the identification of 47 putative ORFs of 35 amino acids or more (Fig. 2A; Table 1) in the genome of phage Q54. The standard ATG start codon and also alternative start codons were used to locate ORFs. These were determined on the basis of the presence of a suitable ribosome binding site complementary to the 3' end of the 16S rRNA of L. lactis located upstream of the selected start codon as well as the absence of extensive overlapping between ORFs. The longest gap observed was 286 bp between ORFs 9 and 10, and the longest overlap tolerated was 13 bp between ORFs 10 and 11 (Table 1). There are two main groups of ORFs in Q54: one group spanning almost the entire genome and which is on the upper strand and a second group of 10 ORFs (orf35 to orf47) encoded by the lower strand (approximately 3.5 kb) at the right end of the genome (Fig. 2A).
|
View this table: [in a new window] |
TABLE 1. Putative ORFs deduced from Q54 genome sequence and their predicted functions
|
(i) ORF11.
The best BLAST result for ORF11 is a protein with no predicted function from L. lactis SK11 (39% identity; 77/197) (accession no. ZP_00382242). Similarities were also found with proteins of the essential recombination factor (ERF) family, including Sak2 from phage P335 (44% identity; 67/152) (8). A conserved ERF domain (pfam04404) was also identified in ORF11. This domain is present in DNA single-stranded annealing proteins of the ERF superfamily of proteins, which function in RecA-dependent and RecA-independent DNA recombination pathways, such as RecT, Red-ß, and Rad52. In phages with cohesive ends (such as Q54), the role of such a recombination protein was proposed to be similar to that of Red from phage
, which increases the amount of packageable DNA (67).
(ii) ORF12. ORF12 shows similarity with several bacterial single-stranded binding (SSB) proteins including those of L. lactis strains IL1403 (5) and SK11 (Table 1). Similarity was also observed with phage-encoded SSB proteins such as ORF9 from Streptococcus thermophilus phage 7201 (59% identity; 97/153) (70) and ORF1 from Streptococcus pneumoniae phage VO1 (58) (55% identity; 98/180). SSB proteins are involved in protection of single-stranded DNA segments generated during DNA replication (60, 78).
(iii) ORF16. CDD searches revealed the presence of conserved domains within ORF16, namely, S1 (ribosomal protein S1-like RNA-binding domain: smart00316 and pfam00575), RNB (RNB-like protein: pfam00773), and VacB (exoribonuclease R: COG0557). These domains are found in RNA-binding proteins involved in RNA metabolism (e.g., transcription). Comparison with protein databases showed similarity with several chromosome-encoded RNases and exoribonucleases from diverse bacterial origins (Table 1).
(iv) ORF17. The deduced protein ORF17 is similar to gp80 from Listeria phage P100 (Table 1) (11) and ORF98 from Staphylococcus aureus phage K (29% identity; 75/251) (59), which have unknown functions. After PSI-BLAST analysis (four iterations), we also found homologues for which a DNA polymerase function was predicted. The most significant hits included a DNA-directed DNA polymerase from the uncultured archaeon GZfos19C8 (15% identity; 49/315, E = 1E-49) and a DNA polymerase delta small subunit from Methanothermobacter thermautotrophicus strain Delta H (14% identity; 47/314, E = 3E-47) (accession no. AAU82757 and AAB85882, respectively). An InterProScan analysis also revealed the presence of a metallophosphoesterase domain (IPR004843) found in a large number of proteins involved in phosphorylation, including DNA polymerases, exonucleases, and other phosphatases.
(v) ORF18. ORF18 is similar to a putative protein from Streptococcus pyogenes prophage 315.3 (29% identity; 47/158) (3) and also to a conserved phage-associated protein from Streptococcus suis 89/1591 (24% identity; 39/157) (accession no. ZP_00875013). There is also similarity with a crossover junction endodeoxyribonuclease, RuvC from Chlorobium limicola DSM 245 (25% identity; 34/131, E = 0.016) (accession no. ZP_00511492). After two iterations of PSI-BLAST analysis, further hits were obtained with RuvC-like Holliday junction resolvases, including ORFm3 from lactococcal phages bIL170 (936 species, 25% identity; 40/157) (18) and ORF53 from phage sk1 (936 species, 25% identity; 40/158) (12). No hits were found with gene products from phages of the c2 species. However, BLAST analyses using ORF55 from phage sk1 revealed significant similarity with ORFl2 from phages c2 (44% identity; 71/161) and ORF23 from phage bIL67 (42% identity; 69/161) (67), which was recently demonstrated to be a RuvC-like resolvase (21).
Structural genes and morphogenesis. Phage Q54 morphology is similar to that of the prolate-headed phages of the c2 species, and compared using BLAST analyses, most of the structural proteins were related to phage c2 or bIL67, or other members of the c2 species for which partial sequences are available.
(i) ORF20. The only significant hits are ORFl4 from phage c2 (21% identity; 46/210), ORF25 from phage bIL67 (20% identity; 37/177), and a putative portal protein from a Streptococcus agalactiae prophage (17% identity; 28/159, E = 0.006) (accession no. AAN00717). The similarity with the latter protein and its location upstream from the capsid protein suggest that ORF20 from Q54 might be the putative portal protein. Also, a protein corresponding to this ORF was identified in the virion structure by SDS-PAGE mass spectrometry analysis (Fig. 4). It was previously suggested that ORFl4 of phage c2 might be a portal protein (22).
![]() View larger version (26K): [in a new window] |
FIG. 4. LC-MS/MS analysis of structural proteins of phage Q54. (A) Coomassie blue staining of a 10% SDS-polyacrylamide gel showing structural proteins from phages Q54 and c2. Arrows and letters on the right indicate bands cut out from the gel and identified by LC-MS/MS. The sizes (in kilodaltons) of the different proteins from the broad-range molecular mass standard (M) are indicated on the left. (B) Identification of Q54 viral proteins from corresponding bands shown in panel A.
|
(iii) ORF23. ORF23 is most likely the MTP, because it is similar to homologues in phages c2 (ORFl7) and bIL67 (ORF28) (29% identity; 61/205 and 62/211, respectively) (48, 67). Again, a very low percentage of amino acid identity was observed, contrasting with the 88% identity (182/205) found between ORFl7 and ORF28 from phages c2 and bIL67, respectively. We also detected the corresponding protein in the structure of the virion (Fig. 4).
(iv) ORF24. This protein is the third most conserved in Q54, showing similarity with several tail host specificity proteins and accessory tail fibers from diverse phages infecting both L. lactis and S. thermophilus. We noted 59% identity (120/201) with ORF45 from the lytic lactococcal phage 4268 of the P335 species (74). Similarity was also observed, although with slightly lower E values, with ORFl15 from L. lactis phage c2 (29% identity; 62/211) (48). InterProScan analyses also revealed the presence of a galactose-binding-like domain (IPR008979) within ORF24 of phage Q54. This domain is involved in binding to cell surface-attached galactose-containing carbohydrates of prokaryotes and eukaryotes (36). Moreover, ORF24 was found to be similar to the variable region 1 (VR1) of S. thermophilus phages Sfi11, Sfi19, Sfi21, MD2, and DT2, which were previously shown to be involved in host recognition (30). These findings further support the predicted function of ORF24 as the receptor-binding protein (73).
(v) ORF26. ORF26 shares similarity with several c2-like phage tail proteins, and the best match was with the tape measure protein (ORFl10) of phage c2 (34% identity; 193/567). Other matches included the industrial lactococcal phage isolates 5469 (35% identity; 180/507) and 5440 (28% identity; 148/528) (64). ORF26 was also found in the structure of Q54 as revealed by SDS-PAGE and mass spectrometry analyses (Fig. 4).
(vi) ORF28.
The ORF28 gene product has similarities with ORF32 of phage bIL67 (34% identity; 172/505) and ORFl12 of phage c2 (33% identity; 171/505), both predicted to be terminases (48, 67). A conserved Terminase_1 domain (pfam03354; COG4626) was also identified in ORF28 while CDD searches were being performed. The size of the gene product also suggests that it is the large terminase subunit, although no putative small terminase subunit could be identified by bioinformatic analysis. The presence of only one identifiable subunit of the terminase is not unusual, as it was observed for various lactococcal phages such as c2 (48), 4268 (74), and
LC3 (4) as well as for the Lactobacillus phage
adh (1).
Lysis module of phage Q54. (i) ORF33. No significant BLAST hit was obtained with the deduced protein ORF33. However, the size and the location of this ORF within the genome suggest that it may encode the phage holin. Using InterProScan, we found three transmembrane-spanning segments (amino acids [aa] 10 to 30, 45 to 65, and 77 to 99) and 14 charged amino acids (including six lysine residues) within the 26 C-terminal residues of ORF33. These two elements are characteristics of holins, and the presence of three transmembrane segments would place this protein into class I of the holins (41, 77). Similar to the holins of the c2-like phages, no dual-start motif could be identified in the N terminus of ORF33, as opposed to the holin from phages of the 936 species (41).
(ii) ORF34. This protein is predicted to be the endolysin, and it is one of two gene products of phage Q54 showing an amino acid identity greater than 60%. CDD searches revealed the presence of an N-acetylmuramoyl-L-alanine amidase conserved domain (smart00644, pfam01510). BLAST searches revealed several hits with the endolysins from various phages, especially those of the lactococcal phages of the 936 species, including bIL170 (ORFl22, 61% identity; 154/250) (18) and sk1 (ORF20, 59% identity; 125/211) (12). Notably, a pairwise alignment of the orf34 of Q54 with orfl22 of phage bIL170 showed 60% nucleotide identity over the entire gene, which makes this gene the most conserved at the nucleotide level.
(iii) ORF35, ORF40, and ORF41. BLAST searches with ORF35 gave no significant result, at least based on a threshold E value of 1.0 or below. Nevertheless, a single hit was found with ORFe25 from phage bIL170 when a lower stringency was used for the search (28% identity; 14/50, E = 1.8) (18). ORF40 was found to be similar to ORFe24 from phage bIL170 (44% identity; 45/102) (18) and ORFe26 from phage sk1 (38% identity; 39/102) (12). Finally, ORF41 is similar to early gene products from 936-like phages, including ORF41 from phage sk1 (33% identity; 26/78) and ORFe7 from phages bIL66M1 (27) and bIL170 (both 36% identity; 27/74). Despite these similarities, no function could be assigned to ORF35, ORF40, and ORF41.
Comparative genomics of Q54 versus phages of the 936 and c2 species. To better visualize the similarities between phage Q54 and the lactococcal phages from the c2 and 936 species described above, a whole-genome comparison was performed. Phages c2 and sk1 are closer to Q54 because several BLAST hits (although not always the best) were associated with these phages. As shown in Fig. 6, most of the ORFs present in the morphogenesis module of phage Q54 could be aligned with those of phage c2 (blue arrows). The order, number, and size of ORFs 20 to 29 were well conserved between the two phages. However, other ORFs are more closely related to phage sk1. For example, the protein similarity of Q54 endolysin (ORF34) with that of sk1 (ORF20) suggests that it might originate from a phage of the 936 species such as sk1.
![]() View larger version (37K): [in a new window] |
FIG. 6. Comparative genomics between Q54 and phages of the 936 and c2 species. Genome alignments present similarities between proteins of Q54 and phages c2 (accession no. NC_001706) and sk1 (accession no. NC_001835). ORFs from Q54 showing similarity to ORFs from these phages are linked by gray shading. The absence of shading means there was no significant similarity. The percent amino acid identity indicated inside the shading is representative of the aligned region only, whereas values in parentheses represent the percent amino acid identity over the entire length of the smaller of the two aligned proteins. Phages bIL67 (accession no. L33769) and bIL170 (accession no. NC_001909) were added into the alignment to show the extensive homology existing between phages of the same species. Color shading was used when two genomes from phages of the same species were aligned to discriminate between similarity of >80% amino acid identity (dark color) and similarity of 79% amino acid identity (light color).
|
SDS-PAGE analysis of Q54 structural proteins. The protein composition of Q54 virions was characterized by SDS-PAGE coupled with LC-MS/MS. Since phage Q54 and the reference phage c2 share morphological and some protein sequence similarities, we included a purified sample from phage c2 for comparison purposes. Strikingly, the protein profiles from the two phages were highly similar (Fig. 4A). Mass spectrometry analysis of the nine bands detected after Coomassie blue staining of the gel enabled us to identify the corresponding coding gene in the genome of Q54 and make the correlation with the predicted functions (Fig. 4B). The molecular masses estimated by SDS-PAGE were in agreement with the masses calculated from the gene sequence for six proteins. Discrepancies were observed for proteins detected in bands A, B, D, and E (Fig. 4B).
Three bands of different sizes (A, B, and D) corresponded to the MCP. Additionally, the vast majority of peptides detected by LC-MS/MS from these three bands were mapped within the last 273 aa of the protein. These data strongly suggest that the N-terminal amino acids (250 aa) of the MCP are cleaved off during capsid assembly, as observed for phage c2 (48). Surprisingly, the molecular masses of bands A, B, and D were well over the molecular mass calculated from the gene sequence of the orf21 product, which is 56 kDa. However, inclusion of only the last 273 aa of the protein, as suggested by the LC-MS/MS analyses, would lead to a calculated mass of 29 kDa. Thus, dimers, trimers, and hexamers of this protein would correspond to masses of 59, 88, and 176 kDa, respectively, and would reasonably fit with the observed masses for bands A, B, and D on SDS-PAGE (Fig. 4A). In phage c2, two major bands of 175 kDa and 90 kDa comigrated with bands A and B from Q54 and were previously proposed to be hexamers and trimers of the MCP, respectively (48). Additionally, there was a high-molecular-mass protein complex that barely penetrated the gel in the lanes for Q54 and c2. Although we did not try to identify this protein complex here, it was previously shown for phage c2 that this band contains a higher multimeric form of the MCP (48). According to the above data, the MCP subunits of Q54 are most likely processed and cross-linked during capsid assembly.
The fourth striking difference was the LC-MS/MS analysis of band E (Fig. 4), which revealed the presence of two proteins, namely, the major tail protein (ORF23/MTP) and receptor-binding protein (ORF24/RBP). An equivalent number of peptides covering the two proteins were detected, suggesting an equimolar ratio. This finding was rather surprising as band I (Fig. 4A) also corresponded to ORF23 and the molecular mass observed on SDS-PAGE was close to the mass calculated from the orf23 sequence. Since orf23 is translated from frame 3 and orf24 from frame 1, we investigated the possibility of a translational frameshift occurring between these two ORFs.
A detailed analysis of the intergenic region between orf23 and orf24 revealed the presence of a CCC.UAA codon at the end of orf23 which could act as a putative +1 "shifty stop" (Fig. 5A). Forward slippage (+1) of the translational machinery from this location would cause translation to occur on frame 1, the same as orf24. The frame 1 translational product covering the intergenic region starting from the CCU codon was found to be highly similar to a segment upstream from the region of ORF45 from lactococcal phage 4268 that matched Q54 RBP (Table 1). Importantly, a 2 frameshift would also lead to the same result, i.e., translation in frame 1, but with the addition of a supplementary amino acid (a tyrosine) between the isoleucine (I) and the last proline (P) residue from ORF23. Obviously, this additional amino acid would not be observed in a +1 frameshift. Thus, to confirm the occurrence of a frameshift event and to discriminate between the two possible frameshift mechanisms (+1 or 2), we searched for a matching peptide in band E. Analysis of the LC-MS/MS data enabled the identification of specific peptides mapping to this region and lacking a tyrosine residue, thus corresponding to the translational product resulting from a +1 frameshift (Fig. 5B). The result is an ORF23-ORF24 fusion protein with a calculated mass of 48 kDa, which fits well with the mass observed on SDS-PAGE for band E (Fig. 4A).
![]() View larger version (30K): [in a new window] |
FIG. 5. Programmed +1 translational frameshift. (A) Programmed +1 translational frameshift identified between orf23 and orf24 leading to translation of the 48-kDa protein (band E). A potential hairpin structure, shown above the sequence, was found next to the orf23 stop codon using the MFold program. (B) LC-MS/MS peptide mapping of ORF23-ORF24 fusion protein. The sequence represented here corresponds to that resulting from a +1 translational frameshift. The peptides detected by LC-MS/MS are shaded, and the translated intergenic region between ORF23 and ORF24 is underlined.
|
5 times less frequent than the CCU codon, also encoding a proline (http://www.kazusa.or.jp/codon/index.html). According to the proposed models, the translation machinery would make a pause at the CCC codon, which eventually would force a certain proportion of the ribosomes to slip forward (+1) to the next CCU codon. Translation would then continue downstream in the overlapping frame until the next stop codon. The presence of a two-stem-loop secondary structure next to the stop codon at the end of orf23 of phage Q54 (Fig. 5A) probably favors ribosome stalling as proposed by others (43, 80). Notably, the second stem-loop structure overlaps the ribosome-binding site upstream of orf24 and this could also regulate its translation. Accordingly, we did not detect ORF24 alone on SDS-PAGE (Fig. 4A), suggesting that translation initiation upstream of ORF24 is probably infrequent or may not occur at all. |
|
|---|
Determination of the complete genomic sequence of Q54 confirmed that this phage clearly belongs to a different group, despite its morphological relatedness to the prolate-headed c2 species. The 26,537-bp dsDNA genome was shown to have cohesive termini (cos-type phage) with several features common to other lactococcal phages such as direct and inverted repeats, as well as an A/T-rich region close to the cos site (13, 19, 20, 61, 62). Nonetheless, the cos site [CCAA(N10)AACT] of phage Q54 is atypical. To our knowledge, this is the first L. lactis phage to deviate from the lactococcal phage consensus cos site [TCAN(N4-7)NACT] (62).
Another distinctive feature of phage Q54 is the very low degree of conservation of its deduced ORFs with proteins in databases. Globally, only limited protein similarity was observed with structural proteins of lactococcal c2-like phages (22 to 34% amino acid identity). Interestingly, a few early gene products from Q54 (ORF1, ORF35, ORF40, and ORF41) were similar to early expressed proteins from lactococcal phages of the 936 species. A whole-genome comparison between Q54 and c2 (Fig. 6), clearly suggests the presence of an extra module at the end of the late gene cluster of Q54. Transcriptional analyses demonstrated that this extra module is transcribed early after infection. Two diverging early modules separated by the cos site are thus present in the phage Q54 genome. This is, to our knowledge, the first report of a lactic acid bacterial phage with such a genomic organization. It is tempting to propose that phage Q54 is the result of past recombination events involving 936- and c2-like phages. Genetic drift and recombination events would have then modified the genome of Q54 to its present form. A notable supporting observation was the presence of a conserved DNA region between Q54 and a 936-like phage that is located at the boundary of the late and early module and falls within the endolysin gene (orf34). This region might have been a point of homologous recombination. Phage Q54 is clearly not adapted for optimum propagation, as suggested by its relatively long latent period (for a lactococcal phage) and small burst size. It is possible that Q54 is currently evolving to gain more fitness in order to replicate more efficiently.
Three of the putative ORFs encoded by Q54 (ORF4, ORF11, and ORF24) were more closely related to proteins from P335-like phages than to those from any other phages. ORF131 and Sak2 from the type phage P335, which are homologous to ORF4 and ORF11 of phage Q54, respectively, were previously shown to be involved in homologous recombination events with the host chromosome (7). In phage P335, sak2 is flanked by orf131 on one side and a gene encoding an SSB protein on the other side. In phage Q54, six genes showing no relatedness to any lactococcal phage or bacterial strain separate orf4 and orf11. However, a gene coding for an SSB protein from a lactococcal host immediately follows orf11 in the genome of phage Q54. These observations strongly suggest that Q54 acquired these genes through homologous recombination with prophages from the chromosome of an L. lactis host. Whether these three genes were acquired simultaneously as a functional module, followed by genetic drift and insertion of six additional genes, or whether they were acquired separately through two or more recombination events is unknown.
For strict lytic phages such as members of the 936 and c2 lactococcal phage groups, homologous recombination is less likely to occur due to the absence of temperate members (15). Consequently, finding Q54 genes homologous to bacterial or prophage sequences is intriguing. Some authors suggested that temperate/lytic phages evolve by horizontal gene transfer with prophages residing in the host chromosome, whereas strictly virulent phages evolve by point mutations and small indels but rarely exchange DNA outside their group (15, 35). It is not excluded that Q54 could have had a temperate lifestyle before losing its lysogeny module to become strictly virulent. However, no remnant of a lysogeny module could be found in Q54, which could support this hypothesis.
Posttranslational modifications of structural proteins. The SDS-PAGE and LC-MS/MS analyses strongly suggest that the MCP of phage Q54 is processed. Cleavage of the N-terminal part of the protein, in addition to cross-linking, appears necessary for building the mature viral capsid. A similar observation was made for the type phage c2 where N-terminal sequencing showed that the MCP was lacking its 205 N-terminal residues, in addition to being cross-linked (48). Cross-linking and proteolytic cleavage of the MCP subunits during assembly have been reported for a number of other phages (1, 16, 29, 34, 46, 53, 63).
Programmed +1 translational frameshifting in tail structural genes. Programmed translational frameshifting (1 type) seems to be relatively common among dsDNA phages, as revealed by a recent study (79), which identified potential signals for such frameshifting in about 70% of the analyzed phage genomes. Typical frameshift signals were found within the major tail protein and an overlapping gene. Only one example of a natural 2 frameshift event was demonstrated for phage Mu (79). Two examples of +1 translational frameshifting have been confirmed in E. coli, which includes RF2 (17), and for recombinant human transferrin (25). In the latter case, frameshifting occurred at a rate of 2 to 4% and was due to the presence of a CCC.UGA "shifty stop" at the end of the gene. This frequency was much lower than the 50% rate observed for RF2 (17). In phages, however, +1 translational frameshifts have been reported only for the temperate phage PSA from L. monocytogenes (80). In this case, the major capsid protein and major tail protein were both extended in the C-terminal ends at an estimated frequency of 25%. The CCC.UGA and CCC.UCA slippery sequences were shown to be involved in this mechanism (80). Our LC-MS/MS analysis revealed a programmed + 1 translational frameshifting event involving a "shifty stop" and secondary structures and occurring between the major tail protein (ORF23) and the receptor-binding protein (ORF24) of phage Q54. According to the recent study by Xu et al. (2004), this would represent the first case of translational coupling between an MTP and an RBP (79). Also, based on the relative band intensities on SDS-PAGE, we estimate this event to occur at a frequency of approximately 25%. Since the RBP (ORF24) alone was not detected in the virion structure, it suggests that only a fraction of the MTP is fused to the RBP. This chimeric protein would be most likely located at the distal part of the tail for host recognition, while the other single units of the MTP would probably be associated with the chimeric MTP/RBP for completion of the tail.
In summary, we have shown that the lytic phage Q54 represents a new lactococcal phage species. Despite a familiar morphology, phage Q54 has a novel genome architecture and a distinctive transcriptional map, both of which suggest past recombination events with the genomes of members of the three most common L. lactis phage groups. However, the degree of protein relatedness is very low, suggesting that the breakpoint likely occurred a long time ago. Such DNA shuffling through evolution may have led to the unusual +1 translational frameshifting in the tail structural genes of phage Q54.
This study was funded by a strategic grant from the Natural Sciences and Engineering Research Council (NSERC) of Canada.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»