Previous Article | Next Article ![]()
Journal of Bacteriology, December 2002, p. 6522-6531, Vol. 184, No. 23
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.23.6522-6531.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Becky A. Bartlett, and Tina S. Goodwin
Department of Microbiology and Immunology, Virginia Commonwealth University, Richmond, Virginia 23298-0678
Received 20 June 2002/ Accepted 6 September 2002
|
|
|---|
|
|
|---|
In the course of this study, a site was discovered that results in a programmed translational frameshift. DNA sequence analysis led to the identification of an additional open reading frame between the previously identified essential tail genes E and T that overlaps the end of gene E in the -1 reading frame. Work described here demonstrates that this reading frame encodes an essential P2 function and that ribosomes translating gene E undergo a programmed frameshift near the 3' end of the gene about 10% of the time and enter the -1 reading frame. The resulting 15.4-kDa protein shares 85 N-terminal amino acids (aa) with gpE and contains a C-terminal extension encoded by the overlapping reading frame. This extended protein has been designated gpE+E'.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Bacterial and bacteriophage strains used in this study
|
|
View this table: [in a new window] |
TABLE 2. P2-containing plasmids for marker rescue, sequencing, and expression
|
|
View this table: [in a new window] |
TABLE 3. P2-specific oligonucleotide primers used for cloning and mutagenesis
|
cells. The desired mutation in the resulting plasmid, pTG502, was verified by sequence analysis prior to subsequent subcloning. Marker rescue and complementation analysis. P2 amber mutations were localized by marker rescue. Phage lysates were treated with UV light (approximately 300 erg/mm2) from a General Electric germicidal lamp. Titers of the UV-irradiated lysates were determined on a nonsuppressing strain (C-1a) containing a plasmid with a fragment of wild-type P2 DNA and on C-1a alone. An increase of at least 200-fold in the plating efficiency of the amber mutant on the plasmid-bearing strain indicated the presence of the wild-type allele on the cloned fragment. Results of the marker rescue are summarized below (see Fig. 1B).
![]() View larger version (34K): [in a new window] |
FIG. 1. Genetic map of P2 and physical map of the tail gene region reported in this paper. (A) Linear map of the P2 genome, with cosL at the left. Thin black arrows indicate the direction and extent of the known transcription units. orf, open reading frame. (B) Expanded view of the physical map of the part of the P2 genome characterized in this report and the location of amber mutations in this region. Selected restriction sites used in subcloning and marker rescue are indicated, with coordinates shown in nucleotides from the left end of the P2 genome. Boxed regions delineate the coding regions for each of the genes indicated in the physical map. The extent of DNA carried by each plasmid subclone is aligned below the relevant restriction sites or position on the physical map; gray bars indicate the intervals to which amber mutations were mapped by marker rescue. (C) DNA sequence of the region between the end of gene E and the beginning of gene T, showing the -1 (E') open reading frame between these two genes. The TGA stop codon for gene E and the two TAA stop codons defining the boundaries of the -1 reading frame are shown in boldface type. The first in-frame Met codon in the -1 reading frame is underlined, and the location of the C-to-A change introduced to create an amber mutation in E' is indicated.
|
10 promoter was assayed in strain C-2420 carrying the compatible plasmid pGP1-2 (43). This plasmid carries T7 gene 1 under control of a temperature-sensitive
repressor. Expression of T7 RNA polymerase was sufficiently leaky at 33°C to allow complementation of P2 amber mutants by plasmids expressing the corresponding wild-type genes from the T7 promoter. Plating efficiencies of P2 amber mutants on strains carrying complementing plasmids were equivalent to those obtained on a supD strain. Complementation of P2 Ets55 by plasmids carrying wild-type and mutant copies of the E+E' region was examined in strains C-6518 (Su-) and C-6519 (supD), which express T7 RNA polymerase from the lacUV5 promoter. Expression in the absence of induction was sufficiently leaky to allow complementation, which was tested at 42°C.
DNA sequence determination and computer analysis.
DNA sequences were determined by the method of Sanger et al. (40) using a Sequenase kit (U.S. Biochemicals, Cleveland, Ohio) and 5'[
-35S]thio-dATP or using a Taq DyeDeoxy Terminator Cycle Sequencing Kit (Applied Biosystems, Inc., Foster City, Calif.). Plasmids carrying cloned DNA from this region of the P2 genome (Table 2) were used as templates. The sequence changes in amber mutants were determined from cloned fragments or from PCR-amplified phage DNA of the regions identified by marker rescue. Sequencing reactions using the fluorescent dideoxynucleotides were analyzed on an ABI automated sequencer in the Nucleic Acids Core Facility, Massey Cancer Center, Medical College of Virginia, Virginia Commonwealth University. At least two independent sequence determinations were performed for each strand, and oligonucleotide primers were used to obtain sequences spanning all restriction sites used in subcloning. All sequences obtained from PCR-amplified DNA were determined from the products of two independent amplification reactions. M13 universal sequencing primers were purchased from commercial suppliers; additional oligonucleotide primers were synthesized by the Nucleic Acids Core Facility, Massey Cancer Center, Medical College of Virginia, Virginia Commonwealth University, or purchased from Oligos, Etc. (Wilsonville, Oreg.). Details about primers used in sequence analysis will be provided upon request.
Sequence data were stored and analyzed using the GCG (Genetics Computer Group) Wisconsin Package (Accelrys, Madison, Wis.). The sequence reported in this paper corresponds to coordinates 19384 to 23954 in the sequence of the entire P2 genome and is in the GenBank database.
Assay of ß-galactosidase. Cultures of strain C-2420 carrying pTG316 or pTG373 were grown overnight in LB plus ampicillin. Cultures grown overnight were diluted 20-fold in LB supplemented with ampicillin in 96-well microtiter plates and incubated for 90 min with shaking at 37°C. The cultures were then diluted into fresh medium with 1 mM isopropyl-ß-D-thiogalactopyranoside (IPTG), and incubation was continued at 37°C for another 90 min. After the optical densities at 600 nm (OD600) of the cultures were read, growth was halted by mixing aliquots (130 µl) of each culture with 30 µl of chloroform. The ß-galactosidase activity of a 20-µl aliquot of each culture was determined by the method of Menzel (31). Six independent colonies of each construct in the presence and absence of IPTG were assayed, and activities are expressed as the mean ± standard error of the induced activity in arbitrary units. Background activity in this assay was essentially undetectable.
Purification of fusion proteins. The gpE-ß-galactosidase fusion protein was prepared from strain C-2420 carrying pTG316 and the compatible lacI plasmid pRG1 (R. Garcea, unpublished data). A 1-liter culture of cells was grown at 37°C in LB supplemented with ampicillin and kanamycin to an OD600 of 0.55. IPTG was added to a final concentration of 1 mM, and incubation was continued overnight. Cells were pelleted and resuspended in 100 ml of 50 mM Tris-HCl [pH 7.5]-0.5 mM EDTA. Lysozyme was added to a final concentration of 1 mg/ml, and the cells were incubated on ice for 1 h. NaCl and MgCl2 were added to final concentrations of 50 mM and 1 mM, respectively, and the lysate was treated with 2 mg of DNase I for 30 min on ice followed by a 10-min incubation at 37°C. Cell debris was removed by centrifugation at 10,000 x g for 15 min. The gpE-ß-galactosidase fusion protein was precipitated by the addition of ammonium sulfate to 40%, followed by an additional centrifugation at 10,000 x g for 20 min. The ß-galactosidase activity, which was present in the pellet fraction, was resuspended in 20 ml of 20 mM Tris-HCl. Insoluble material was removed by centrifugation for 10 min at 10,000 rpm in a Sorvall SS-34 rotor, and the supernatant was applied to a Promega Protosorb lacZ immunoaffinity column. The fusion protein was eluted under the high pH conditions recommended by the manufacturer, dialyzed overnight against water, and dried. Protein was redissolved in Laemmli sample buffer for further analysis.
For purification of MalE fusion proteins, a 1-liter culture of DH5
cells containing pTG414 or the vector pMALc-2 was grown at 37°C in LB supplemented with ampicillin to an OD600 of 0.55. IPTG was added to a final concentration of 0.5 mM, and incubation was continued for 2 h. Cells were pelleted, weighed, and resuspended in 10 ml of lysis buffer (25 mM morpholinepropanesulfonic acid [MOPS] [pH 7.1], 100 mM NaCl, 20 µM EDTA, 5% [vol/vol] glycerol, 1 mM ß-mercaptoethanol) plus 1 mM phenylmethylsulfonyl fluoride for each gram of cell paste. Cells were lysed in a chilled French pressure cell at 12,000 lb/in2, and the lysate was centrifuged at 41°C for 30 min at 16,000 rpm in a Sorvall SS-34 rotor. The supernatant was applied to a column containing 5 ml of amylose resin (New England Biolabs) at a flow rate of 15 ml/h. The column was washed with approximately 40 ml of lysis buffer, and then the fusion protein was eluted with lysis buffer containing 10 mM maltose. One-milliliter fractions were collected, and aliquots from fractions containing the protein peak were analyzed by electrophoresis on sodium dodecyl sulfate (SDS)-polyacrylamide gels.
N-terminal sequence determination. The N-terminal sequence of purified ß-galactosidase fusion protein was obtained by automated Edman degradation on a Hewlett-Packard model G1005A protein sequencer, by Commonwealth Biotechnologies, Inc., Richmond, Va.
Nucleotide sequence accession number. The sequence reported in this paper, corresponding to coordinates 19384 to 23954 in the sequence of the entire P2 genome, was used to complete the P2 genome sequence. The complete genome has been deposited in the GenBank database under accession number NC_001895.
|
|
|---|
Identification of gene E and the -1 frameshift extension E'. Gene E was originally defined by the ts mutation 55 (26); nonsense mutation am30 lies in the same complementation group and was shown by Lengyel et al. (24) to result in a lack of tails. A polypeptide corresponding to the product of gene E has not been identified in infected cells or phage particles. Both the Ets55 (C to A at nt 19473) and Eam30 (C to T at nt 19619) mutations lie in an open reading frame just distal to FII that encodes an acidic polypeptide of 91 aa (Fig. 1 and 2). The role of gpE is unknown; Lengyel et al. (24) reported that the product of gene T appears to be unstable in the absence of gpE and suggested that gpE plays a role in stabilizing gpT.
![]() View larger version (13K): [in a new window] |
FIG. 2. Complementation of P2 E mutants by cloned fragments carrying the E-E' region. P2 Ets55 was plated at 42°C. Complementation was shown by growth of P2 mutants as follows: +, growth; -, no growth; NT, not tested.
|
10 promoter. The resulting plasmid, pTG546, failed to complement P2 Eam30 in a nonsuppressing strain expressing low levels of T7 RNA polymerase (Fig. 2). An otherwise identical plasmid carrying the corresponding wild-type fragment (pKL2) did complement P2 Eam30 in the same background. These results demonstrate that E' is essential for P2 lytic growth and indicate that the Eam30 mutation also affects expression of E'. The pTG546 plasmid also failed to complement P2 Ets55 at 42°C in a nonsuppressing strain, although complementation was obtained in an otherwise isogenic supD strain. The Ets55 missense mutation should not be polar on E', so this result is consistent with expression of the E' reading frame as a frameshifted extension of E.
Further evidence in support of an essential role for E' comes from studies of coliphage 186, a close relative of P2. The homologous 186 tail gene cluster contains a similar pair of overlapping reading frames (38); the upstream one encodes a polypeptide 67% identical to gpE, while the downstream -1 reading frame encodes a polypeptide 74% identical to gpE'. Two 186 amber mutations have been mapped to the downstream reading frame, which was identified as a tail gene, H, by Hocking and Egan (19). Lysates of 186 Ham56-infected cells showed an accumulation of apparently normal tails (19). This finding suggests that the longer frameshifted gene product either is a minor protein that must be added to completed tails, such as a collar component, or is involved directly in the process of head-tail attachment. The P2-related Pseudomonas aeruginosa phage
CTX (35) encodes a similar potential protein generated by a -1 frameshift (see Discussion), while the P. aeruginosa R2 pyocin gene cluster does not (36). This observation is also consistent with a role for gpE+E' in head-tail attachment, since the R-type pyocins are completed tail structures that are not attached to heads.
Since all mutations that affect P2 gpE also affect gpE+E', there is no direct evidence that the shorter gpE protein is in fact essential, as originally reported. A strong case can be made, however, based on the difference in reported phenotypes caused by Eam30 and the amber mutation affecting just the downstream reading frame in the homologous gene from phage 186, Ham56. If the essential role of gpE were just to permit proper expression of gpE+E', the defects conferred by Eam30 (which makes neither protein) and Ham56 (which makes the protein equivalent to gpE but not the longer frameshifted polypeptide) should be the same. However, as described above, no tails were observed in cells infected with P2 Eam30 (24), while apparently complete but unattached tails accumulated in cells infected with 186 Ham56 (19). If one accepts the hypothesis that these homologous genes encode proteins that have equivalent functions in P2 and 186, this evidence strongly supports the conclusion that both gene products are essential for tail assembly.
Confirmation of a programmed translational frameshift between E and E'. To test directly the hypothesis that the region of overlap between the gene E reading frame and the E' reading frame included determinants for a -1 translational frameshift, we employed a plasmid designed to allow expression of a reporter gene only if such a frameshift occurs. Plasmid p138, described by Weiss et al. (45), carries a modified lacZ gene in which DNA between codons 2 and 5 is replaced with a short "stuffer" region flanked by HindIII and ApaI sites. Expression of lacZ is controlled by Ptac and the E. coli lpp translation start site. A fragment containing the putative frameshift site near the end of gene E and the beginning of E' was generated by PCR using primers EH3 and EA1, which introduced HindIII and ApaI sites, respectively (Fig. 3). Insertion of this fragment between the ApaI and HindIII sites of p138 generated plasmid pTG316. Production of active ß-galactosidase from this construct can occur only if a frameshifted ribosome bypasses the termination codon at the end of E and enters the downstream lacZ coding region in the correct frame. We also created a parallel construct, pTG373, by inserting a fragment generated by PCR with primers EA1.1 and EH3. This plasmid contains an additional nucleotide just upstream of the stop codon of gene E; this places the lacZ gene in frame with the initiating AUG and allows measurement of translation that bypasses the frameshifting site without changing reading frame. A comparison of the levels of ß-galactosidase produced by these two plasmids (Fig. 3) indicates that the ratio of ribosomes that shift reading frame to those that bypass the frameshift is about 1:13. The fraction of ribosomes that shift reading frame during translation of the E gene is therefore on the order of 7 to 8%.
![]() View larger version (15K): [in a new window] |
FIG. 3. Analysis of the region containing a programmed translational frameshift. (A) The nucleotide sequence spanning the end of gene E and the beginning of open reading frame E' is shown, along with the oligonucleotides used to clone this region into the frameshift detection plasmid p138. The position where a C residue was inserted to allow translation of lacZ by ribosomes that did not shift reading frame is indicated. Below the DNA sequence the predicted amino acid sequences encoded by E and the -1 open reading frame E' are shown, along with the actual N-terminal sequence determined from the ß-galactosidase (ßgal) fusion protein purified from cells carrying pTG316. (B) Measurement of frameshifting efficiency in vivo. The ß-galactosidase (ß-gal) activity was determined in cells carrying a plasmid with the wild-type (wt) E-E' fragment, in which lacZ must be translated by ribosomes that have undergone a -1 frameshift, and from cells with a plasmid carrying the equivalent fragment in which a C was inserted just upstream of the stop codon for gene E (fs+1), so that lacZ is translated by ribosomes that did not shift reading frame into E'.
|
We wished to demonstrate that the frameshift observed in the context of the lacZ fusion also occurred during normal translation of gene E. Unlike the relatively abundant
gpG and gpG-T proteins, the products of gene E and its frameshifted extension (hereafter referred to as gpE+E') have never been identified in P2-infected cells. A number of attempts were made to overexpress these proteins from plasmid constructs using either the T7 expression system or an N-terminal hexahistidine tag. No proteins with a predicted molecular mass corresponding to that of either gpE or gpE+E' were observed, nor were there any identifiable proteins whose appearance was affected if the constructs carried the Eam30 mutation. It may be that these presumptive tail assembly proteins are unstable in the absence of other P2 tail components. As a final attempt, we constructed plasmid pTG414, which placed the coding region for genes E and E' at the end of the malE gene in plasmid pMAL-c2. Amylose affinity purification of the fusion proteins yielded a major product with an apparent molecular mass of 52.6 kDa (Fig. 4), in good agreement with the predicted size of 52.4 kDa for the MalE-gpE fusion protein. In addition, a small amount of a larger 57.8-kDa fusion protein was observed; this is the predicted size for the MalE-gpE+E' frameshifted product. Several smaller products, presumably arising from degradation, were also apparent. Densitometric analysis of the relative proportions of the two largest species indicates that the larger protein represents approximately 12% of the total, in reasonably good agreement with the frequency of frameshifting measured in the ß-galactosidase assay.
![]() View larger version (56K): [in a new window] |
FIG. 4. Synthesis of a MalE fusion protein carrying the frameshifted gpE+E'polypeptide. Protein was affinity purified as described in Materials and Methods and analyzed by electrophoresis on a 10% SDS-polyacrylamide gel stained with Gelcode blue (Pierce). Protein bands corresponding to the expected sizes for the MalE-gpE fusion protein and the MalE-gpE+E' frameshifted protein are indicated, as are the sizes (in kilodaltons) of molecular mass standards. Densitometry and digital imaging of the gel was performed on a ChemiImager 4000 (Alpha Innotech Corp.), and the figure was compiled using Microsoft PowerPoint.
|
Lengyel et al. (24) suggested that gpT was the P2 tail fiber, based on its large size and the fact that it was present in approximately six copies per phage particle. Subsequent work, however, established gpH as the P2 tail fiber protein (13). The most likely role for gpT is that of tail length determination. It is the only P2 tail protein that is large enough to span the length of the P2 tail shaft. Both
(18, 22) and T4 (1) encode proteins that have been shown to act as tape measures for tail shaft polymerization. The sizes of these proteins correspond to a fairly constant 0.15 nm of tail length per amino acid residue, suggesting that their structure is that of an extended
-helix. The secondary structure predicted for gpT (6) is also largely
-helical. In addition, a BLAST homology search revealed multiple regions of similarity between gpT and myosin heavy chains, which are proteins with known extended
-helical structure. The tail length-to-amino acid ratio for gpT is 0.17 nm per amino acid residue, suggesting that this protein might be a bit more extended than in the lambdoid phages.
A comparison of P2 gpT with the homologous proteins encoded by other P2-related phages provides further support for assigning the role of tail length determination to this gene product. The coliphage 186 gpG is 78% identical to P2 gpT and virtually the same size (812 aa compared to 814 aa [38]), and the tails of phage 186 and P2 are indistinguishable in length. The P2-related Pseudomonas phage
CTX encodes a 904-aa protein, predicted to be rich in
-helix, that is 29% identical to P2 gpT (35). The product of this gene has the predicted N-terminal sequence but apparently migrates to the position of an 67-kDa protein, rather than to the position of a 95.8-kDa protein predicted by the DNA sequence, suggesting that the C-terminal portion of the protein is removed by processing. Consistent with the proposed role of this protein in tail length determination, the
CTX tail is 105 nm in length (17) compared to 135 nm for P2. While C-terminal processing of tail proteins during the assembly process has been documented for
gpH and the analogous proteins from several other lambdoid phages (18), there is no evidence to suggest proteolytic cleavage of gpT during tail assembly, since the protein identified in phage particles is roughly the same size predicted by the coding sequence.
Identification of gene U.
Beginning with the TAA termination codon for gene T is a potential ribosome binding site with extensive homology to the 3' end of 16S rRNA, i.e., TAAGGAGGTGA. Seven base pairs beyond this is the first of two tandem ATG codons, which begins the next open reading frame. We characterized three U amber mutations: Uam25 (C to T at nt 22363) and Uam77 and Uam92 (identical mutations that change A to T at nt 22501). Both changes lie in this next open reading frame, which encodes a protein of 160 aa with a predicted molecular mass of 17.45 kDa. Although gpU was not found in phage particles (24), Ljungquist and Bertani (29) identified a protein that appeared to be synthesized in maxicells from a plasmid carrying a wild-type P2 PstI fragment, but not from pEE260, which carries the same fragment from P2 Uam25. This protein, which was 28 kDa, was thought to be the product of gene U. The open reading frame predicted by sequence determination encodes a considerably smaller protein. To ensure that we had correctly identified the entire coding sequence, a 567-bp fragment generated by cleavage with NsiI and TaqI (P2 coordinates 22286 to 22853) was cloned into pT7-7. The resulting plasmid, pTG203, complemented P2 Uam25, indicating that a functional U gene was contained within this fragment. Further confirmation is provided by the paralogous gene from
CTX, which encodes a predicted protein of similar size (16.2 kDa) that is 51% identical to P2 gpU (35). N-terminal microsequencing of
CTX virion proteins indicates that this protein is present in the phage particle in small amounts, suggesting that P2 gpU is likely to be in P2 virions as well and may have escaped detection in earlier studies (24) due to its small size and low abundance.
Identification of gene D.
The product of the D gene was also not identified in phage particles (24) but was expressed in minicells from the cloned PstI fragment examined by Ljungquist and Bertani (29). They identified a protein with a molecular mass of about 46 kDa that was replaced by a 32-kDa protein encoded by a plasmid carrying the Dam6 mutation. Overlapping the termination codon of gene U is a start codon for an open reading frame of 388 aa, encoding a protein with a molecular mass of 42.8 kDa. The Dam6 mutation (C to T at nt 23592) is in this reading frame and would generate a protein of 267 aa (29.4 kDa), consistent with the reported reduction in size (29). The gpD homologue of
CTX, 45.4% identical to P2 gpD, is present in phage virions (35). It is likely, therefore, that gpD is present in P2 virions as well. The failure to detect gpD in phage particles or infected cells (24) can be explained by the similarity in size between gpD and the relatively abundant gpFI (42.8 versus 43.1 kDa), which would make these proteins difficult to resolve on polyacrylamide gels. The polar effect of the Fam4 mutation on gene D also prevented detection of gpD in the absence of gpFI. Antibodies to gpD will be required to confirm the presence of gpD in P2 virions.
|
|
|---|
![]() View larger version (38K): [in a new window] |
FIG. 5. Potential translational frameshift sites in P2-related phages. (A) Alignment of sequences from the region corresponding to E-E' in related phages that parallel P2 in genome organization. Sequences are from the following phages (GenBank accession numbers are shown in parentheses): E. coli phages 186 (NC_001317) and W (Esposito et al., unpublished); S. enterica phages SopE (AF153829), Fels-2 (sequenced as a prophage in the S. enterica serovar Typhimurium LT2 genome; (AE006468) and PSP3 (Christie et al., unpublished); and P. aeruginosa phage 102 CTX (NC_003278). Boxes indicate the extent of the open reading frame (orf) and overlap in the genes equivalent to P2 E (white) and the -1 reading frame E' (light gray), as well as show the beginning of T (dark gray, which in all cases is in the same frame as E). The DNA sequences from this region are aligned below the sequence alignment. The codons for the last amino acid read in the gpE reading frame and the first amino acid read in the gpE+E' reading frame are indicated by an underline and overline, respectively. Nucleotides shown in boldface type are complementary to the 3' end of 16S rRNA. Arrows indicate complementary sequence encoding potential RNA hairpins predicted by the GCG program MFold. In the case of P2 and W , the most stable of several predicted structures is shown. However, the extensive similarity between these two sequences might also allow formation of a hairpin equivalent to that of P2 in W and vice versa. (B) A similar comparison of open reading frames upstream of the putative tail tape measure protein in the more distantly related P2-like phages from H. influenzae (HP1 [NC_001697] and HP2 [NC_003315]) and V. cholerae (K139 [NC_003313]) (GenBank accession numbers shown in brackets). The DNA sequence of the region of overlap between the genes in locations equivalent to E and E is shown. The TAA stop codon for open reading frame 26 (orf26) (HP1 and HP2) or orf30 (K139) is underlined, as is the TGA stop codon that defines the extent of overlap in the -1 reading frame.
|
CTX (36), Salmonella enterica phages SopE
(15, 33) and Fels-2 (30), and two additional phages whose genomes have been determined recently, W
(D. Esposito, B. J. Schmidt, F. R. Bloom, and G. E. Christie, unpublished data) and PSP3 (G. E. Christie, P. Xu, P. Vitazka, and G. A. Buck, unpublished data). As illustrated in Fig. 5A, the extent of potential overlap in the -1 open reading frame is virtually identical for P2 and 186, as well as for W
and PSP3 (which have sequences in this region identical to P2 and 186, respectively). The other three phages have a much longer region in which a frameshift could occur. The gpT homologue is encoded in the same reading frame as the gpE homologue in all cases, and its reading frame overlaps the end of the reading frame for the E' homologue by 8 nt except in
CTX, where the overlap is 11 nt. A comparison of the DNA sequence of these seven phages in the region surrounding the E-E' frameshift in P2 shows absolute conservation of the T6G at the upstream boundary of the frameshift site (Fig. 5A). Another feature shared by all of these sites is a sequence complementary to the 3' end of 16S rRNA, just upstream of the T6G. These sites all have potential hairpins 3' of the frameshift site as well, but the strength and position of each of these predicted RNA secondary structures is somewhat variable, suggesting that this feature may be less important. Genetic or biochemical evidence supporting the existence of a frameshift is available only for P2 and 186 at present, but on the basis of conservation of sequence and features in this region, it seems likely that a similar frameshift occurs at this position in all of these closely related phages. Three additional phages have been classified as P2-related phages on the basis of amino acid sequence similarities of some of their genes. These include the Haemophilus influenzae phages HP1 (9) and HP2 (B. J. Williams, M. Golomb, M. V. Olson, and A. L. Smith, GenBank entry NC_003315), and the Vibrio cholerae phage K139 (D. Kapfhammer, J. Nesper, J. Blass, and J. Reidl, GenBank entry NC_003313). The capsid gene clusters of these three phages are clearly homologous to those of the other P2-related phages, but the gene organization and amino acid sequences of the tail genes are much less similar. The tail gene clusters of these three phages are closely related to each other, however. No homologue of P2 E has been identified in these phages, but all possess a putative tail tape measure gene that does show similarity to that of P2 T. Inspection of the sequence upstream of the T gene homologue reveals two small overlapping reading frames in a location similar to that of E and E' (Fig. 5B). All three phages have an overlap of the same length between the putative E and E' equivalent genes, and the extended open reading frame that could be generated by a -1 frameshift ends just upstream of the beginning of the putative tape measure gene, which is translated in the same reading frame as the potential frameshifted polypeptide. The sequences of HP1 and HP2 in the region of overlap are identical; the sequence of K139 differs from the HP1 and HP2 sequence, except for conservation of a potential "slippery sequence," A6C, preceding the stop codon in the upstream reading frame (Fig. 5B). No potential Shine-Dalgarno sequence or RNA secondary structures are evident in this region. While there are no amino acid similarities between these small open reading frames and those in the P2 E-E' region or any genetic evidence suggesting a similar role for these genes, the parallel in gene organization and the potential for a translational frameshift are suggestive of a similar mechanism in these three more distant relatives of P2 as well. Translational frameshifting in tail assembly genes was first reported in bacteriophage lambda (25). The virion proteins of P2-related phages and lambdoid phages show little amino acid sequence similarity. The roles of the two polypeptides encoded by the overlapping reading frames have not been clearly established for lambda or P2, but in both cases they play a role in tail assembly. The apparent frequency of frameshifting in P2 is two- to threefold higher than that reported for lambda gpG-T, and features of the frameshift sites suggest differences in the mechanism involved in promoting frameshifting. Nevertheless, the arrangement of the cluster of tail assembly genes that includes the programmed translational frameshift is strikingly parallel. The overlapping open reading frames are preceded in both cases by the gene (or in the case of P2, with a contractile tail, genes) encoding the major structural component(s) of the phage tail. Distal to the frameshifted gene is the gene encoding the protein that determines tail length. A similar organization of tail genes and a putative site for a translational frameshift have recently been reported for bacteriophage Mu and several of its relatives as well (34). While it is possible that this remarkable similarity in gene organization is accidental, it is tempting to speculate that the frameshift may play a biological function (beyond regulating the relative molar ratios of the two polypeptides encoded by the overlapping reading frames) that has been conserved during phage evolution.
This work was supported in part by grants (to G.E.C.) from the National Institutes of Health (GM34651), American Cancer Society (NP869A), and the A. D. Williams Foundation, Medical College of Virginia Campus, Virginia Commonwealth University.
Present address: Department of Biology, Drew University, Madison, NJ 07940. ![]()
|
|
|---|
protein. J. Bacteriol. 177:3743-3751.
tail assembly protein. J. Mol. Biol. 234:124-139.[CrossRef][Medline]
CTX, a cytotoxin-converting phage of Pseudomonas aeruginosa: implications for phage evolution and horizontal gene transfer via bacteriophages. Mol. Microbiol. 31:399-419.[CrossRef][Medline]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»