Paul Hyman,1,
Timothy Harrah,1,2 and
Edward Goldberg1*
Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston,1 Department of Biomedical Engineering, Tufts University, Medford, Massachusetts2
Received 9 July 2004/ Accepted 15 September 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
|
The folding and oligomerization of the TFs is regulated by chaperones encoded by genes 57A (23) and 38 (9). Chaperone P57A participates in the assembly of P34 and P37 and also in the assembly of the short TF, P12 (15, 19). P38 is necessary (in addition to P57A) for the trimerization of gp37 to form the mature P37 rod, which initiates the extension of LTF assembly (9, 13). The order of assembly of the distal-half fiber starts with the maturation of three gp37 monomers into the P37 trimer. Then, three gp36 monomers bind to the N terminus of the P37 trimer and mature into the P36-P37 rod. This event is followed by the addition of one gp35 molecule to the N terminus of the mature trimeric P36 LTF segment to complete the distal-half fiber. The P34 trimer can then join to the gp35 end of the distal-half fiber at an angle to form a complete LTF (20). Although it is certain that chaperones P57A and P38 are involved in distal-half fiber assembly, the binding sites, order of interactions, and mechanisms of their functions are still not clear.
A temperature-sensitive mutation (ts3813) that suppresses the requirement for P38 function at 30°C but not at 42°C was found in gene 37 (which codes for gp37) (2). The authors initially speculated that P38 may catalyze the noncovalent association of two gp37 molecules. A comparison of the wild-type T4 sequence of gene 37 to the mutant sequence (12) showed a 21-bp duplication of gene 37 inserted between nucleotides 3328 and 3329, near its C terminus (Table 1, first and second rows). They speculated that the extended sequence is responsible for the maturation of gp37 to P37 in the absence of P38. We recognized that the extended 21-nucleotide sequence in the ts3813 mutant had a coiled-coil (CC) motif within a CC diheptad present in wild-type T4. When we tested the peptide sequence by using COILS (http://www.ch.embnet.org/software/COILS_form.html), the extended 21-nucleotide sequence in the ts3813 mutant was predicted to have a 98.4% CC-forming probability (when the window size was 21 residues), while in wild-type gene 37 protein, the same region of a 15-amino-acid (aa) sequence exhibited much lower CC formation probabilities (0.714 with a window size of 14 residues and 0.020 with a window size of 21 residues). Cerritelli et al. also examined the protein sequence of all of the segments of wild-type T4 TF (4) and, although 2.5 "possible" heptads were identified in this region, concluded that
-helical coiled coils were "at most a minimal element in the LTF."
|
In this paper, we report the design and construction of a series of T4 phages with longer or putatively more stable CC-like heptads in gp37 that enable P38 chaperone bypass at temperatures ranging from 30 to 50°C in vivo. From the phenotypic differences of the T4 series and the fact that all of the bypass mutants that we found had 7-, 14-, or 21-residue inserts, we conclude that the extended CC region is responsible for the initiation of P37 trimeric assembly and acts as an artificial cis chaperone to bypass the need for P38.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Construction of a plasmid encoding an extended CC. In order to add an extended CC to T4 phage P37, we began with the plasmid pT7-5/37ts3813 (P. Durham and E. B. Goldberg, unpublished data), a clone of the T4 gene 37ts3813 mutant placed in the T7 expression construct pT7-5 (34). Additional heptad-encoding DNA segments were designed with the MultiCoil and COILS programs (see below), and oligonucleotides were synthesized at the Tufts University Core Facility. Equimolar amounts of the 5'-phosphorylated oligonucleotides BlBDZ-F (5'-ATTGAATCACTTAAGACAGAA-3') and BlBDZ-R (5'-TTCTGTCTTAAGTGATTCAAT-3') were mixed, boiled, and slow-cooled to anneal the two oligonucleotides. The double-stranded oligonucleotide was then ligated to the EcoRV-digested pT7-5/37 ts3813 mutant (EcoRV fortuitously cuts uniquely at the 3' end of the CC-encoding segment between the final heptad g and a positions, encoding residues D and I). Following ligation, the new construct was introduced into E. coli MC1061 by using heat shock according to standard methods (28). The resulting plasmids were screened for multiple inserts in the correct orientation by sequencing (Tufts University Core Facility). The final construct encodes gene 37 with a pentaheptad CC (35 aa) and is designated pT7-5/37hep5 (whose protein sequence is shown in the last row of Table 1).
Recombination of plasmid DNA and T4 phage genome. We transformed E. coli B40suI cells with pT7-5/37hep5 by using routine protocols and antibiotic screening. From an overnight culture, we made a fresh log-phase culture in the same medium (L broth with 100 µg of ampicillin per ml) supplemented with 0.2% glucose at 30°C, and cells were grown to a density of 2 x 108 cells per ml. We infected the plasmid-carrying cells with T4 37ts3813 38amB262amC290 phage at a multiplicity of infection of 5. After incubation for 1 h, 50 µl of chloroform was added to lyse the cells, and the culture was incubated for an additional 5 min. The lysate was clarified by centrifugation at 5,000 x g for 10 min at 4°C and stored at 4°C.
Selection and screening of phages. We determined the titers of phage from the lysates of the recombination on a lawn of E. coli B40suI grown at 37°C overnight. From these plates, we chose individual plaques at random and spotted them in duplicate on lawns of E. coli BB and B40. One plate of each host bacterial strain was incubated at 30°C, and the other was incubated at 37°C overnight. In this way, we identified phage that formed plaques at both temperatures on both hosts. These plaques were resuspended and used as templates from which to amplify the CC region of gene 37 by PCR according to the protocol of Jozwik and Miller (18). The primers 37RV-1F (5'-GTTCTGGTAATTTTGCTAAC-3') and 37RV-1R (5'-AACAGCTAACTTTGGATATG-3') were used to amplify short segments that included all of the CC. In order to identify the heptad number in each mutant phage amplicon, we ran 2.2% agarose gels in Tris-borate-EDTA running buffer. These gels were used to screen for different CC lengths. To determine the precise CC nucleotide composition, we selected amplified fragments for DNA sequencing. To confirm the retention of the two amber mutations within gene 38, we also amplified gene 38 with primers gp38F (5'-AGCATAAGGAGAGGGGCTTC-3') and gp38R (5'-CTAGGTGCTGCCATAGACCC-3') and sequenced the product.
Temperature testing of phage mutants. To determine the range of in vivo temperature stability of multiheptad mutants, resuspended plaques were plated on E. coli BB and B40 at 30, 37, 42, 45, and 50°C overnight and counted. Serial plating at various temperatures with identical methods was used to investigate heptad stability.
Secondary-structure prediction and sequence alignments. COILS version 2.2 (22) was employed to predict the probability of CC structures in protein sequences. This program uses a single sequence-based prediction algorithm for CC regions by using statistical patterns of CC proteins within a database of known CC sequences. For all COILS analyses reported here, the MTIDK matrix was used for scoring. No weighting of positions was applied, and window sizes of 14 and 21 residues were used. Related analysis was also performed using a 28-residue window on MultiCoil (38; http://multicoil.lcs.mit.edu/cgi-bin/multicoil), which models probabilities of both dimeric and trimeric CC structures. Here, the CC probability cutoff was set at 0.5. TF sequences were analyzed as described above. Predicted CC sequences were aligned and phylogenetic trees were generated by using the CLUSTAL W version 1.75 (37) and the GeneBee multiple alignment program (25; www.genebee.msu.su).
| RESULTS |
|---|
|
|
|---|
|
We next tested the novel phages for growth at elevated temperatures. gp38 chaperone bypass was observed in cultures at nonpermissive temperatures of up to 50°C (Table 1, 2nd column) for all of the extended CC mutants, while ts3813 controls were unable to form plaques at 37°C and above. All of the phages grow at all of the temperatures in permissive cells. Sequencing of gene 38 confirmed that the two amber mutations which prevent the expression of gp38 were unchanged. It also showed that both amber mutations were G-to-A point mutations at bases 159846 and 159933 (using numbering conventions from the NCBI database, accession number NC_00086). These mutations correspond to stop codons introduced into gp38 amino acids W66am and W95am.
Although the CC was generally stable, a recombination that reduced its length continued to occur during serial plating. This continuation was primarily observed in the case of the pentaheptad mutant, SQ5A, where tetraheptad plaques were occasionally found after several rounds of serial plating. We did not conduct a systematic study of heptad reduction rates. In no case, however, did we find spontaneous reversion to the wild-type diheptad CC even after three rounds of plating. Importantly, in all plaques sequenced, the CC region contained integral multiples of heptad repeatsno partial heptads were observed. This finding supports the hypothesis that the CC motif is required.
COILS and MultiCoil predictive modeling. COILS analysis of the entire wild-type gene 37 protein sequence suggested that the 15-aa segment FANLNSTIESLKTDI (aa 795 to 809) exhibited some CC formation probabilities (0.714 with a window size of 14 residues and 0.020 with a window size of 21 residues). No other CC regions were predicted in the remainder of gp37. The one heptad elongation of this region found in the mutant phage 37ts3813 has an even higher CC-forming probability (0.806 with a window size of 14 residues and 0.984 with a window size of 21 residues), suggesting significant homology to other known CC structures. MultiCoil analysis of the sequence yielded lower CC probabilities for the 37ts3813 mutant, with a maximum overall probability of 0.124 occurring in an 18-residue segment largely overlapping the triheptad region (residues 793 to 810). Interestingly, MultiCoil predicts very low but nearly equivalent dimer and trimer probabilities of 0.055 and 0.070, respectively.
We also analyzed the novel CC sequences found in the recombinant phages without the surrounding gp37 sequence (data not shown). All of the sequences have a high probability of forming a CC for both window sizes. Furthermore, we analyzed the longer sequences by using MultiCoil to see how they compare to other dimeric and trimeric CC proteins in its database. Surprisingly, although P37 is known to be a homotrimeric protein, all of the sequences are predicted to be homodimeric CCs.
CCs in other T-even phages. A GenBank search identified 21 protein sequences of gp37 equivalents from bacteriophages that are morphologically similar to bacteriophage T4. These phages are classified as Myoviridae but often called T-even-like phages (36). Of the 21 sequences, 15 are complete protein sequences, while the remaining 6 sequences comprise only the C terminus of the protein. The sequences represent 19 different phages, among which two phages, Aeh1 and RB43, each have two genes encoding analogues of T-even-like gp37. In both cases, these genes appear to be fusions of gp36 and gp37. It is not known if both or only one is expressed. Each of the 21 sequences was analyzed with COILS. All but one of the sequences (phage 44RR2.8t, GenBank accession number AAQ81539) contained a sequence with homology to known CC segments. Interestingly, eight of the sequences contained more than one potential CC sequence, although in most of these sequences, one of the segments had a much higher probability of forming a CC than the others. These putative CC sequences are summarized in Table 2.
|
Finally, alignment of each protein sequence in the region with the highest probability of forming a CC was conducted using CLUSTAL W 1.75 on the GeneBee server. The resulting sequence alignment and phylogram diagrams are shown in Fig. 3. In general, these relationships parallel those seen in other phage genes, although there are a few exceptions (see below).
|
| DISCUSSION |
|---|
|
|
|---|
We hypothesized that by extending the CC one more heptad, we could stabilize the clamp to create a temperature-independent suppressor. This proved to be the case. The observed enhancement of temperature stability further supports the idea that the mechanism of P38 function is to enhance the parallel noncovalent association of the CC segments of gp37 monomers. This association leads to the formation of the trimeric CC and, we assume, to alignment of upstream and downstream regions leading to efficient maturation of the trimeric P37 TF segments. The assembly of P37 also requires P57A. P57A protein is also required for the maturation of P34 and P12 into trimers, even though no CC is apparent (4). Thus, gp34 and gp12 may have other means for the initiation of homotrimeric maturation. However, even in the absence of both P38 and P57A, T4 matures the various TF components to about 10% (8). The mechanism of initiation of maturation is still an unsolved problem.
In keeping with this proposed role for the CC in P37 maturation, we also examined 19 sequences of gp37 equivalents in other T-even-like bacteriophages whose sequences are known. All of the sequences contained one, two, or, in one protein, three potential CC segments that contain two to five putative CC heptads (Table 2). In 12 of the sequences, the segment with the highest probability was nearest to the C terminus of the protein.
The phylogenetic tree produced by aligning the CCs of the different phage species (Fig. 3B) is, in general, in agreement with those produced by other phage genes, notably genes 23, 18, and 19 (36). One exception is the identical sequences of the CCs of SV14 and T4; the other genes of these phage have only 84% identity, and morphology indicates significant divergence (36). Interestingly, while the sequences with the higher probability of forming CCs from each of the two RB43 genes are clearly related, the CC from the second Aeh1 gene (AAQ17987) appears to be unrelated to all of the other CCs. This finding agrees with the general model that has been proposed for modular swapping of phage genome segments by Sandmeier and others (29, 35).
The novel CCs (Table 1) allowed P37 assembly at temperatures up to 50°C. This finding raises the question of why T4 mutants possessing similar CC duplications (and that have lost gene 38 function) do not arise spontaneously. It seems advantageous for phage to minimize the number of chaperones needed for assembly. Three possibilities merit consideration. First, the extended CC may not be stable over multiple generations without the positive selection of a gp38 mutation. Because phage T4 uses homologous recombination to initiate replication during part of its life cycle (21), duplications may be quickly removed before sequence divergence or before a gp38 mutation can stabilize them. We did observe occasional heptad loss changing a pentaheptad to a tetraheptad. Second, a model of gene exchange between species of bacteriophage has been suggested (29, 35). Such an exchange at the very end of gene 37, which encodes the primary host ligand, would be of obvious advantage in expanding the phage host range. Deletion of the gene 38 chaperone, which is often found adjacent to gene 37 in bacteriophage species, might decrease the rate of exchange.
The third possibility derives from the role of the CC. Exact duplications in the CC could lead to misalignment of the CCs between the different monomers that make up the trimer. This misalignment might delay or even prevent proper P37 maturation. Interference with P37 maturation would lead to a reduction in the total number of properly assembled TFs and therefore lower numbers of progeny phage with sufficient TFs for infection. Thus, there may be selection for CCs that can align in only one register.
CCs as intrinsic chaperones. While the precise mechanism in T4 LTF folding remains unclear, it is interesting that predictive algorithms such as MultiCoil predict dimeric and not trimeric CC regions based on protein sequence. However, since MultiCoil uses modeling based on homology to other known dimeric or trimeric proteins, two potential conclusions can be drawn. First, the TF's CC protein regions may in fact be trimeric and may simply contain sequences not recognizable as trimeric by the MultiCoil algorithm and its present structural database. Second, the CCs in the TF may not act to bring all three monomers together into a trimer but, instead, may be needed only to bring the first two together to stabilize the transient dimer until the third oligomer can associate and begin folding the highly stable trimer. Such transient dimer formation has been previously identified and investigated with model trimeric CC peptides (3, 6).
A critical step in the folding of many multicomponent fibrous proteins is the initial alignment of the monomeric proteins. Our results suggest that, in wild-type T4 phage, the folding of three copies of gp37 is initiated by a diheptad CC and the P38 chaperone (and possibly P57) but that phage with extended CCs are able to bypass P38 function at temperatures up to 50°C. CC structures have an alignment role in many proteins and can even promote folding of synthetic peptides and recombinant chimeric proteins (24, 27). We speculate that a CC, especially one designed to optimize the initiation of proper ß-structural trimer formation, could, in a proper medium, stabilize the association of three gp37 molecules long enough for nonchaperoned trimer folding to occur. We believe that this scenario points to the feasibility of recombinant monomer production and simplified in vitro trimeric ß-structure formation (16).
| ACKNOWLEDGMENTS |
|---|
We thank Amy Keating for early discussions and help, Debabrata Raychaudhuri for ongoing helpful discussions, and Sankar Adhya for encouragement. We also thank N. Madias of Tufts University Medical School for supplementary support that enabled us to complete this work.
| FOOTNOTES |
|---|
Present address: Altus Biologics, Inc., Cambridge, MA 02139. ![]()
Present address: The Ohio State University, Mansfield, OH 44906. ![]()
| REFERENCES |
|---|
|
|
|---|
. J. Bacteriol. 178:6258-6265.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |