Previous Article | Next Article ![]()
Journal of Bacteriology, May 2008, p. 3606-3612, Vol. 190, No. 10
0021-9193/08/$08.00+0 doi:10.1128/JB.00095-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Department of Biomolecular Engineering, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4359-B39 Nagatsuta-cho, Midori-ku, Yokohama 226-8501, Japan,1 National Institute of Advanced Industrial Science and Technology, AIST Tsukuba Central 2, Umezono 1-1-4, Tsukuba Science City, Ibaraki 305-8568, Japan2
Received 18 January 2008/ Accepted 28 February 2008
|
|
|---|
|
|
|---|
The gp5 protein of bacteriophage T4 is an essential structural component of the phage baseplate. In the T4 phage, the gp5 protein possesses a lytic activity (8-10, 13, 14, 19), and because of this, it was originally referred to as a lysozyme that caused "lysis from without" (6, 7, 22). During the tail assembly of phage T4, gp5 first interacts with gp27 and forms a heterohexameric complex, (gp5)3(gp27)3, (1, 4, 8-10). X-ray crystallography of the complex in combination with electron microscopy (EM)-based three-dimensional image reconstruction has unambiguously localized the gp5-gp27 complex to the central region of the baseplate at the tip of the tail tube (2, 4, 13). The gp5 protein has three distinct domains, namely, the N-terminal domain (gp5N), the lysozyme domain (gp5Lys), and the C-terminal domain (gp5C). The gp5C domain is responsible for forming the extraordinary triple-stranded β-helix, which plays a major role in both puncturing the outer membrane of the host, Escherichia coli, and locally degrading the peptidoglycan layer. The gp27 trimer in the complex forms a cuplike structure, together with gp5N at the base (cup inner and outer diameters,
30 and 80 nm, respectively). Trimeric gp27 has a pseudo-sixfold symmetry. It connects the threefold-symmetrical tail lysozyme complex with the sixfold-symmetrical baseplate (4). The upper part of the cup is thought to connect the tail lysozyme complex with the tail tube via two tail-associated proteins, gp48 and gp54 (2, 11, 13).
The gp5 protein of KVP40 has a high degree of sequence similarity in the N-terminal and C-terminal domains to the T4 gp5 protein (46% and 35%, respectively). It also possesses the multiple-repeat VXGXXXXX sequence in the C-terminal domain; however, it lacks the lysozyme domain that the T4 gp5 protein possesses (Fig. 1). No KVP40 ORF product having homology to the T4 gp27 has been detected, in spite of the fact that a number of other baseplate proteins in the two phages share significant homology (21). We surmised that there exists a gene encoding a T4 gp27 homolog in the KVP40 genome and that the three-dimensional structure is better preserved than the amino acid sequence. We chose ORF334 as a candidate for the T4 gp27 homolog, as the gene product is of a size similar to that of T4 gp27 and it is located in the baseplate gene cluster (the arrangement of baseplate genes in KVP40 is different from that of phage T4, a point taken up in Discussion below). In the present study, we expressed and analyzed KVP40 gp5 and ORF334. It was shown that KVP40 gp5 forms a trimer in solution and that it forms a heterohexamer with KVP40 ORF334. We propose that KVP40 ORF334 is the homolog of T4 gp27.
![]() View larger version (80K): [in a new window] |
FIG. 1. Sequence comparison of gp5 proteins from phages T4 and KVP40. Sequences of gp5 proteins from six T4-related phages, T4 and RB69 (T-even), RB49 and 44RR2.8t (Pseudo), and Aeh1 and KVP40 (Schizo), were aligned using ClustalW. Only the results for gp5 proteins from T4 and KVP40 are shown. Shading indicates the N-terminal, lysozyme, and C-terminal domains of T4. The asterisks indicate identical amino acids in the two phages, and the arrowheads denote V and G in the 8-residue repeats, VXGXXXXX, in the C-terminal domain of T4.
|
|
|
|---|
![]() View larger version (20K): [in a new window] |
FIG. 2. Alignment of six T4-related phage genomes. Tail and head gene alignments among six T4-related phages, T4, RB69, RB49, 44RR2.8t, Aeh1, and KVP40, are shown. The color definitions are as follows: light blue, tail protein (g5 and ORF334 are shown in sky blue); dark blue, head and neck proteins; yellow, DNA association protein; and green, chaperonin and assembly catalysis. The lines among phages show the orthologs: red is baseplate-related proteins, black is others. The arrows indicate the positions of gene 5 and gene 27, and gene 5 and ORF334, of T4 and KVP40, respectively.
|
Amino acid sequence alignment of gp5 proteins from T4-related phages. The amino acid sequences of the gp5 proteins from six T4-related phages, T4, RB49, RB69, 44RR2.8t, Aeh1, and KVP40, were aligned using ClustalW (30). In Fig. 1, only the sequences from T4 and KVP40 are shown.
Expression and purification of gp5. For expression of gene 5, E. coli BL21(DE3) cells containing pMNK were cultivated in LB medium with 200 µg/ml ampicillin at 37°C. When the optical density of the culture at 600 nm was 0.4 to 0.5, protein expression was induced by 1 mM IPTG (isopropyl-β-D-thiogalactopyranoside). The cells were pelleted at 2,140 x g for 20 min 4 hours after induction.
The purification steps for gp5 tagged with His6 at the C terminus were based on those for T4 (5) and were modified as follows. Harvested cells were resuspended in a 10x volume of buffer 1 (50 mM Tris-Cl, 5 mM imidazole, pH 8.0) and sonicated with phenylmethylsulfonyl fluoride at a final concentration of 1 mM. After centrifugation at 20,000 x g for 20 min, the supernatant was loaded onto a HiTrap chelating column charged with nickel (GE Healthcare) that had been equilibrated with buffer 1, and the proteins were eluted by a linear gradient of 5 to 500 mM imidazole in EDTA-containing solution. The fractions containing the desired proteins were collected and applied to a HiTrap Q HP column (GE Healthcare) equilibrated with buffer 2 (50 mM Tris-Cl, pH 8.0). The targeted proteins were eluted at an NaCl concentration of 0.40 to 0.45 M by use of a linear gradient of 0 to 1 M. The fractions containing the pertinent proteins were collected and concentrated to 2 to 5 ml by Amicon Ultra 50K (Millipore) and then loaded onto a Hiload 16/60 Superdex 200-pg column (GE Healthcare) that had been equilibrated with buffer 3 (50 mM Tris-Cl, 100 mM NaCl, pH 8.0). The proteins were collected after elution with buffer 3.
Expression and purification of the gp5-ORF334 complex. For coexpression of gene 48, gene 53, orf334, and gene 5, E. coli BL21(DE3) cells containing the pMNC plasmid were cultivated in LB medium in the presence of 50 µg/ml kanamycin at 37°C. Expression was induced by the addition of 1 mM IPTG when the culture reached an optical density of 0.4 at 600 nm, and the cells were then incubated at 20°C for 1 h. The cells were harvested at 2,140 x g for 15 min after overnight incubation at 20°C.
The purification steps were the same as those for gp5 except that (i) buffer 1 was 50 mM Tris-Cl, 5 mM imidazole, 50 mM NaCl; (ii) buffer 2 was 50 mM Tris-Cl, 50 mM NaCl, pH 7.5; and (iii) the gradient of NaCl was 0.05 to 1 M (eluted at 0.4 to 0.5 M).
SDS-PAGE and N-terminal amino acid sequence determination. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) was carried out according to the method of Laemmli (15) with a vertical minislab gel (9 by 7.5 cm). The gel was stained with staining buffer (0.1% Coomassie brilliant blue, 10% acetic acid). Proteins separated on SDS-PAGE were transferred to polyvinylidene difluoride membranes electrophoretically. After proteins on the membranes were visualized by Coomassie brilliant blue, bands were cut out and the sequences were confirmed by a protein sequencer (PPSQ-21 protein sequencer; Shimadzu).
CD spectrum. The far-UV circular-dichroism (CD) spectrum of gp5 was measured at 20°C with a J-720 spectropolarimeter (Jasco) in a 1-mm-path-length cell. The protein concentration of gp5 was 0.26 mg/ml. The reference solvent was buffer 3, which was also used for prior exhaustive dialysis of the protein sample. The CD spectrum obtained between 198 and 240 nm was analyzed using the program CONTINLL (24) in order to estimate the secondary structure.
Analytical ultracentrifugation. Sedimentation velocity and equilibrium experiments were conducted with an Optima XL-I (Beckman-Coulter) using a four-hole An60Ti or an eight-hole An50Ti rotor at 20°C. gp5 was dialyzed against buffer 3, and the dialysate was used as the reference solution. For the gp5-ORF334 complex, the equilibration buffer for gel filtration was used as the reference solution, because extended periods of dialysis tended to result in nonspecific protein aggregation. Sedimentation velocity data were acquired at a rotor speed of 40,000 rpm for gp5 and 35,000 rpm for the complex without specification of time intervals between successive scans. The sedimentation coefficient distribution function, c(s) was obtained using the SEDFIT program (26, 27). The molecular mass distribution c(M) was obtained by converting c(s) on the assumption that the frictional ratio f/f0 was common to all the molecular species (as implemented in SEDFIT).
Sedimentation equilibrium was carried out at starting absorbances at 280 nm (A280) of 0.15, 0.3, and 0.5 at rotor speeds of 6,000, 8,000, and 10,000 rpm for gp5 and at A280 of 0.2, 0.3, and 0.4 at rotor speeds of 4,500, 7,000, and 8,500 rpm for the complex. For each experiment, the data were globally fitted to a single-species model to determine the molecular weight. The protein partial specific volumes (
) were determined based on the amino acid sequence by the program SEDNTERP (16; J. Philo, unpublished data). The
value of the complex was calculated using the amino acid composition of a hypothetical tandemly connected ORF334 with gp5 (as the complex contains an equal number of moles of gp5 and ORF334, as determined by SDS-PAGE [see Results]). The buffer density (
) and viscosity (
) were also calculated using the SEDNTERP program.
EM. A solution of the gp5-ORF334 complex with approximately 0.03-µg/ml total protein concentration in buffer 3 (defined above) was adsorbed onto a thin carbon film (supported on top of copper mesh grids) that had been previously rendered hydrophilic by glow discharge in a partial vacuum. Samples were washed with 5 drops of double-distilled water, negatively stained twice with 2% uranyl acetate solution for 30 s each time, blotted, and then dried in air. Micrographs of negatively stained particles were recorded in a JEOL 100CX transmission electron microscope at x53,000 magnification using a 100-kV acceleration voltage. The images were recorded on SO-163 films (Eastman Kodak), developed with a D19 developer (Eastman Kodak), and digitized with a Scitex Leafscan 45 scanner (Leaf Systems Inc.) at a pixel size of 1.82 Å at the specimen level.
|
|
|---|
Purification of gp5. gp5 expressed in BL21(DE3) cells was purified with Ni affinity, anion-exchange, and gel filtration chromatography to homogeneity (Fig. 3a) (see Materials and Methods). In order to confirm that the purified protein was gp5, the N-terminal amino acid sequence was determined by protein sequencing. The N-terminal 7 amino acid residues were MFMGLDG, which confirmed that the protein was indeed gp5.
![]() View larger version (36K): [in a new window] |
FIG. 3. SDS-PAGE of purified gp5 and the gp5-ORF334 complex. The gels (12.5%) were stained with Coomassie brilliant blue. (a) Purification of gp5 and change in the migration pattern in the presence of urea. Lane 1, standard molecular mass marker; lane 2, soluble fraction of the cell that overexpressed gp5 using pMNK; lane 3, purified gp5 boiled in the presence of urea; lane 4, purified gp5 boiled in the presence of urea; lane 5, same sample as lane 4 boiled in the absence of urea. (b) Purification of the gp5-ORF334 complex. Lane 6, standard molecular mass marker; lane 7, soluble fraction of cells that overexpressed gp48, gp53, ORF334, and gp5 using pMNC; lane 8, purified gp5-ORF334 complex boiled in the presence of urea.
|
gp5 forms a trimer rich in β-structure. In order to establish the nature of the quaternary state of gp5, analytical ultracentrifugation (AUC) and far-UV CD measurements were carried out.
Sedimentation velocity experiments indicated that gp5 exists in solution in a single quaternary state that has a sedimentation coefficient of 6.53 ± 0.14 S. The molecular weight corresponding to this s value, 134,000, is 2.8 times the value calculated based on the amino acid sequence, indicating that it is a trimer (Fig. 4A). Complementary sedimentation equilibrium experiments gave a molecular weight of 145,000 ± 3,000, which is 3.03 times that of the molecular weight of the monomer (data not shown). From these measurements, it was concluded that gp5 is a trimer in solution.
![]() View larger version (56K): [in a new window] |
FIG. 4. Sedimentation velocity. Moving boundaries were measured using A280 (20°C). The sedimentation coefficient at the peak top of c(s) was obtained by SEDFIT analysis, and the molecular mass at the peak top of c(M) was converted from c(s). (Top) Raw data on moving boundaries. (Middle) Residuals between raw data points and the fitted theoretical curve. (Bottom two panels) c(s). (A) gp5. The rotor speed was 40,000 rpm. The sedimentation coefficient was 6.53 ± 0.14 S, and the molecular weight was 134,000 ± 4,000. (B) gp5-ORF334 complex. The rotor speed was 35,000 rpm. The sedimentation coefficient and the molecular weight were 10.1 ± 0.4 S and 297,000 ± 14,000, respectively.
|
-helix, 23%; β-structure, 69%; turn, 15%) (Table 1). As KVP40 gp5 shares significant sequence similarity with T4 gp5 (except for the lysozyme domain [Fig. 1], which is absent in KVP40 gp5), the secondary-structure contents of the N- and C-terminal domains of T4 gp5 were calculated based on the reported three-dimensional structure of T4 gp5 (4). This result was used for comparative analysis (Table 1). As can be seen, T4 gp5 and KVP40 gp5 showed similar secondary-structure contents, except for the lysozyme domain. |
View this table: [in a new window] |
TABLE 1. Estimated secondary structure of gp5
|
The oligomerization state of the gp5-ORF334 complex. In order to determine the association state of the gp5-ORF334 complex, sedimentation velocity experiments were carried out using the peak fraction isolated with the combination expression system. The results of the data analysis using the program SEDFIT are shown in Fig. 4B. There is some shoulder to the left, but the median peak s value was 10.1 ± 0.4 S. When c(s) was converted into c(M), the peak molecular weight was 297,000 ± 14,000, which is close to the expected molecular weight of the heterohexamer, (gp5)3(ORF334)3, namely, 287,000. In order to confirm the molecular weight of the complex, sedimentation equilibrium experiments were carried out. The result of the analysis using a single-species model based on nine datasets (see Materials and Methods) gave the molecular weight of 301,000 ± 12,000, which added further weight to the heterohexamer model (data not shown).
Observation of the complex by EM. The gp5-ORF334 complex was negatively stained and examined by EM (Fig. 5A). The gp5-ORF334 complex appeared similar in shape to the gp5-gp27 complex of phage T4 (Fig. 5B), in which the total length of the globe and the rod is about 253 ± 27 Å. Figure 5C and D very likely represent the dissociation products of the globe and the rod, respectively, where the diameter of the globe and the length of the rod are about 110 ± 10 Å and 190 ± 20 Å. The longer rod measurement is consistent with the fact that ORF334 is predicted to have 17 repeats of VXGXXXXX in the β-helix motif compared with 12 such repeats in T4 gp5, which is 110 Å (Fig. 1).
![]() View larger version (117K): [in a new window] |
FIG. 5. EM images of the gp5-ORF334 complex. (A) A typical image of the gp5-ORF334 complex. (B to D) Magnified images. (B) Combination of a globe (upper arrow) and a rod (lower arrow). (C) Globe. (D) Rod. The scale bars in panels A and B are 100 Å.
|
|
|
|---|
In the T4 phage, gp5 associates with gp27 to form a heterohexamer, (gp5)3(gp27)3 (2). The baseplate assembly in KVP40 has not been investigated, but due to their relatedness, it is expected to be similar to that of phage T4. A BLAST search did not find any ORF in KVP40 sharing significant homology to T4 gp27. On the assumption that such a gp27 homolog existed in KVP40, we chose a number of possible candidates based on the likely ORF size and direction of transcription. From our initial candidate pool (ORFP1sit, ORF339, and ORF334), ORF334 was chosen as the most likely candidate. ORF334 alone was cloned, expressed, and purified. In this work, we demonstrated that ORF334 interacted with gp5 to form a heterohexamer (Fig. 3 and 4). Furthermore, in a similar fashion to gp27, purified ORF334 tends to form aggregates upon standing in solution, suggesting it has somewhat similar properties.
AUC experiments indicated that the gp5-ORF334 complex was not as stable as the gp5-gp27 complex derived from phageT4. This result may indicate that the heterohexamer requires further stabilization by other baseplate proteins involved in subsequent stages of baseplate assembly. The EM images revealed that the KVP40 gp5-ORF334 heterohexameric complex formed a globe (ORF334) binding a rod (gp5), similar to the structures observed in EM micrographs of the gp5-gp27 complex derived from phage T4. The top view of the globe-like structure is shown in Fig. 5C. At 110 Å, the diameter of the globe is slightly larger than that of the corresponding structure in T4 (4). Upon closer examination, the globe appears to be a hexagon containing a central hole. We suggest that this hole is likely to be the center of the ORF334 trimer, as is the case with the T4 gp27 trimeric complex (2, 4). Based on the above observations, we concluded that ORF334 of KVP40 is very likely a homolog of T4 gp27. The amino acid homology between KVP40 ORF334 and T4 gp27 reveals only 15% identity. Such a low value is commonly taken as showing no relatedness; however, such an assessment based on the linear sequence information gives no consideration to the existence of possible three-dimensional structural similarities. Recently, the crystal structure of gp44 from phage Mu was reported (12). This gp44 three-dimensional structure was extremely similar to that of T4 gp27, and the root mean square deviation of the distances between equivalent C
atoms was 2.7 Å. It is also known that both proteins form trimers. Such a tertiary- and quaternary-based structural similarity was unexpected, because (i) Mu and T4 belong to different subgroups of the family Myoviridae, (ii) Mu does not have a gp5-like protein, and (iii) the amino acid homology between the two phage proteins is only 14%. This finding reinforces the importance of comparing the three-dimensional structures rather than just the amino acid sequences. In future, we plan to apply X-ray crystallography to solve the structure of the gp5-ORF334 complex so that we may further prove that ORF334 is indeed the homolog of T4 gp27.
Phage KVP40 is categorized as a Schizo T-even phage (28). Although about 30% of all 386 ORFs in the KVP40 genome have some similarity to those of phage T4, the homology at the amino acid level is less than 50%. The remaining 70% of KVP40 ORFs have no significant homology at the amino acid sequence level with any estimated gene products in the database (21, 23). With regard to the baseplate genes, there is some variety in the locations of gene clusters in the genomes of the KVP40 and T4 phages. In general, functionally related genes form a cluster in the genome. In the case of T4, clusters of related genes are apparent, and the baseplate genes indeed form clusters; however, the cluster which encodes wedge proteins and that which encodes hub proteins are separated by head gene clusters (20) (Fig. 2). It has also been noted that gene 5, which encodes one of the hub proteins, is located in the wedge cluster, and gene 25, a gene that encodes a wedge protein, is located in the hub cluster. Such an arrangement of genes is conserved among the phages closely related to T4, such as RB69 (T-even), RB49, and 44RR2.8t (Pseudo T-even). On the other hand, distantly related phages, such as KVP40 and Aeh1 (Schizo T-even) (23), have only a single cluster of the baseplate genes, and the swapped gene cluster arrangement for gene 5 and gene 25 as seen in phage T4 is not observed in KVP40 or Aeh1 (For T-even, Pseudo T-even, Schizo T-even, and Exo T-even, see the introduction). Both KVP40 and the Aeh1 phage have a larger genome than T4, namely, 245 kbp and 233 kbp, respectively, whereas that of T4 is 169 kbp. Exo T-even phages are more distant from T4 than other T4-related phages and up to now have only included the cyanophages. The inclusion of this group in the T4-related phages is solely based on the similarities of gp23, the head major protein, and gp18, the contractile tail sheath protein. A noticeable difference between these Exo T-even phages and other T4-related phages is that the head is not elongated but rather icosahedral. The homolog of T4 gp5 in these phages is not clear, but orf211 of S-PM2 encodes a protein whose N-terminal domain shows significant homology to T4 gp5. It is a matter of considerable interest whether ORF211 plays the role of the structural protein in the baseplate complex in a manner analogous to the role of T4 gp5. We have cloned orf211 into an expression vector and plan to isolate the gene product for crystallization and subsequent X-ray-based structural analysis.
Recently, Pukatzki et al. reported that the type 6 secretion system of Vibrio cholerae secretes (extracellularly) three related proteins, VgrG-1, VgrG-2, and VgrG-3, and that they are structurally related to the gp5-gp27 cell-puncturing device of bacteriophage T4 (25). In these VgrG proteins, the N-terminal domain resembles gp27 and the middle domain resembles the C-terminal β-helix domain. These proteins do not have the corresponding oligonucleotide/oligosaccharide binding fold in the N-terminal domain or the lysozyme domain of gp5 (25). In this regard, it is interesting that the KVP40 gp5-ORF334 complex resembles the T4 gp5-gp27 complex yet the lysozyme domain is missing in the KVP40 gp5 protein. V. cholerae is one of the hosts of phage KVP40, and the VgrG proteins secreted from the type 6 secretion system are highly conserved in many pathogenic gram-negative bacterial species, including E. coli, which is the host of phage T4. In light of the work presented in this paper, it is fascinating to speculate that the cell-puncturing devices of T4-related bacteriophages and the type 6 secretion system of the host bacteria might in some way be related through the course of evolution. We plan to continue investigating this intriguing possibility.
In summary, we have shown that KVP40 gp5, like T4 gp5, forms a trimer in solution, despite the fact that it lacks the lysozyme domain. No T4 gp27 homologs were detected in KVP40 in a BLAST search; however, KVP40 ORF334, an ORF resident in the baseplate gene cluster, was shown to form a heterohexameric complex with KVP40 gp5 in a manner highly similar to the T4 gp5-gp27 system. Recent discoveries, such as the finding (12) that Mu phage gp44 has a quaternary structure nearly identical to that of T4 gp27 despite sharing no significant sequence homology and the finding (25) that the V. cholerae secretion system is structurally similar to the gp5-gp27 cell-puncturing device of bacteriophage T4, further reinforce the importance of structure-based comparative studies (such as the present effort) for the investigation of the phylogeny and origin of bacteriophages.
We thank Damien Hall for assistance with improving the clarity of expression.
Published ahead of print on 7 March 2008. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»