Microbiology/Membrane Physiology, University of Tübingen,1 Max Planck Institute of Developmental Biology, D-72076 Tübingen, Germany2
Received 29 September 2005/ Accepted 14 November 2005
| ABSTRACT |
|---|
|
|
|---|
J protein, which determines
host specificity.
Since the two proteins differ most in the C-proximal area, where the
binding site to the LamB receptor resides in the J protein, we propose
that Rtp43 contributes to Rtp host specificity. Lipoproteins similar to
the predicted lipoprotein Rtp45 are found in a number of phages
(encoded by cor genes) in which they prevent superinfection by
inactivating the receptors. We propose that, similar to the proposed
function of the phage T5 lipoprotein, Rtp45 prevents inactivation of
Rtp by adsorption to its receptor during cells lysis. Rtp52 is a
putative transcriptional regulator, for which 10 conserved inverted
repeats were identified upstream of genes in the Rtp genome. In
contrast, the much larger E. coli genome has only one
such repeat
sequence. | INTRODUCTION |
|---|
|
|
|---|
Escherichia coli and a selected group of E. coli phages paved the way into molecular biology. Among the phages was T1 (1, 29), whose genome sequence was, however, not determined until 2004, over 60 years after its discovery (75). The phage T1 genome encodes 77 open reading frames (ORFs), 37 of which are homologous to described phage ORFs. Functions have been assigned to 27 ORFs. The largest functional group consists of tail proteins. Twenty-two ORFs encoding putative proteins of fewer than 100 amino acid (aa) residues lie predominantly close to the termini. These putative proteins display no sequence similarity to known phage and prophage proteins.
We report on a novel E. coli phage with a genome similar in gene order and sequence to phage T1. This phage, named Rtp, lysed an E. coli production culture in an industrial fermentation plant. The bacterial resistance spectrum of phage Rtp was different from that of known E. coli phages that use outer membrane proteins as receptors or coreceptors. This finding prompted an electron microscopy study of the phage which revealed a rosette-like morphology at the tip of the tail. To our knowledge, such a morphology has not been hitherto described.
Here we report our analysis of the morphology, receptor specificity, and genome sequence of phage Rtp and its similarities to and differences from phage T1. Unlike phage T1, which uses the FhuA outer membrane protein as a receptor and depends on TonB function for infection, we show here that phage Rtp infection is independent of FhuA and TonB and requires a rough lipopolysaccharide (LPS).
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
SM buffer
(8 mM MgSO4, 100 mM NaCl, 25 mM Tris, pH 7.5) with
106 PFU of phage Rtp for 10 min at 37°C and then for
10 min at 0°C. Cells were pelleted at 12,000 x
g for 1 min, and the PFU counts in the supernatant were
determined. Outer membranes were prepared as described previously (60). Exponentially growing cells in 100 ml of tryptone-yeast extract were harvested at a density of 5 x 108 cells/ml and suspended in a solution containing 2.15 ml of 0.2 M Tris-HCl (pH 8), 0.2 ml of 10 mM EDTA, 0.845 ml of 80% sucrose, 0.2 ml of lysozyme (4 mg/ml), 6 µl of phenylmethylsulfonyl fluoride (37 mg/ml), and 6 µl of aminobenzamidine (43 mg/ml). The cells were lysed after a 10-min incubation; 0.2 ml each of DNase (1 mg/ml) and RNase (1 mg/ml) and 6.4 ml of double-distilled water were then added. Ten milliliters of buffer containing Triton X-100 was added, and the suspension was centrifuged for 1 h at 30,000 x g at 4°C. The outer membrane fraction in the sediment was washed once with 0.2 ml of water and then with 0.4 ml of 0.2 M Tris-HCl (pH 8) and finally suspended in 0.4 ml of buffer. One sample was supplemented with 10 µl of proteinase K (10 mg/ml), and the control sample received 10 µl of 0.2 M Tris-HCl (pH 7)-0.1% sodium dodecyl sulfate. Both samples were shaken for 1 h at 40°C. The outer membranes were pelleted, washed four times with 0.1 ml of water, and suspended in 0.1 ml of 0.2 M Tris-HCl (pH 7). The membranes were incubated with 106 PFU of phage Rtp for 10 min at 37°C with gentle shaking. The control contained the same concentration of phage Rtp in the same buffer without outer membranes. The sample was centrifuged to pellet the outer membranes, and the phage titer in the supernatant was determined by plating on E. coli W3110.
Electron microscopy. Phage morphology was examined by electron microscopy of high-titer phage samples prepared as described above and adsorbed to freshly glow-discharged grids coated with a carbon support film. The mounted phages were washed with water and negatively stained with 1% aqueous uranyl acetate. Micrographs were taken at a primary magnification of x52,000 with a Philips CM10 transmission electron microscope.
Isolation of phage Rtp DNA.
Phage Rtp DNA was
prepared by the protocol of QIAGEN (Hilden, Germany) for
DNA.
A purified phage preparation (0.2 ml) was diluted with 100 ml of buffer
containing 100 mM Tris-HCl (pH 7.5), 10 mM EDTA, 300 mM NaCl, 20 mg/ml
RNase A, 6 mg/ml DNase, and 0.2 mg/ml bovine serum albumin. After
incubation for 30 min at 37°C, the mixture was cooled to
0°C and mixed with 20 ml of ice-cold 30% polyethylene glycol
6000-3 M NaCl. After incubation overnight, the suspension was
centrifuged for 30 min at 38,000 x g. The sediment was
suspended in 6 ml of 100 mM Tris-HCl (pH 7.5)-100 mM
NaCl-25 mM EDTA. Sodium dodecyl sulfate (6 ml, 4%) was added,
and the mixture was incubated for 10 min at 70°C. Then, 2.4 mg
of proteinase K was added and the incubation continued for 10 min at
60°C. The mixture was cooled to 0°C and mixed with 6 ml
of ice-cold 3 M sodium acetate (pH 5.5). After 15 min of incubation on
ice, the mixture was centrifuged for 30 min at 38,000 x
g. During all incubation steps, the mixtures were gently
shaken. The supernatant was filtered through a QIA filter and then
placed on a QIAGEN-tip 100 equilibrated with 50 mM
3-(N-morpholino)propanesulfonic acid (MOPS; pH
7.0)-750 mM NaCl-15% ethanol-0.15% Triton
X-100. The tip was washed twice with 10 ml of 50 mM MOPS (pH
7.0)-1 M NaCl-15% ethanol; the DNA was then eluted with
10 ml of 50 mM Tris-HCl (pH 8.5)-1.25 M NaCl-15%
ethanol. The DNA solution was collected in 7 ml of isopropanol, and the
precipitated DNA was centrifuged for 30 min at 38,000 x
g. The DNA pellet was washed with 70% ethanol, dried in vacuo
for 10 min, dissolved in 80 µl of 10 mM Tris-HCl (pH
8.0)-1 mM EDTA, and used for DNA
sequencing.
DNA sequencing. The genome of phage Rtp was sequenced by MWG (Ebersberg, Germany) by the dideoxy chain termination method. Sequencing involved generation of a random genomic library in the plasmid sequencing vector pBluescript SK+, bulk sequencing of 384 shotgun clones, and closure of the remaining gaps. The average size of the hydrodynamically sheared DNA inserts was 1.4 kb. A single gap of approximately 5.0 kb remained. This region was amplified by using PCR with the 24-mer oligonucleotides 5'-GAGGTTCAGATTCAGTCAGGTGAG and 5'-TGCACTGAATGTAACCTCTAGGGC and the Roche Expand High Fidelity PCR kit according to the manufacturer's instructions. The sequence of the PCR product on both strands was determined by GATC (Constance, Germany) by primer walking with the purified PCR fragment as a sequencing template. PhredPhrap and Consed (39) were used to assemble the sequencing reads into a single circular contig.
DNA manipulations and restriction mapping.
Standard methods were used for DNA
manipulations and restriction mapping
(78). To test for
cohesive end sites in the chromosome of phage Rtp, one sample of a
complete restriction enzyme DNA digest was heated to 80°C for
10 min to disrupt the annealing of potential complementary overhangs
and then rapidly cooled on ice prior to gel electrophoresis. A second
sample was heated to 55°C and allowed to cool slowly to room
temperature to promote annealing of potential complementary ends before
being loaded onto a gel next to the heated sample. HindIII-digested
DNA was used as a positive control to detect annealing of
cohesive ends. The 23.1-kb and 4.4-kb fragments annealed at the
12-nucleotide (nt) cohesive ends and together formed a 27.5-kb
fragment.
Sequence analysis software
Protein coding
genes were predicted with GeneMarkS
(8) and Zcurve
(42). The predicted
translational start sites were corroborated by inspection of
E. coli ribosome-binding sites (consensus sequence,
AAGGAGGT;
62,
99). tRNAscan-SE was used
to search for tRNA genes
(61). BLAST, PSI-BLAST,
and Pfam (2,
6) were used for
similarity searches. Multiple-sequence alignments were generated with
ClustalW (83;
http://www.ebi.ac.uk/clustalw).
Protein motifs were scanned at the Prosite server
(50;
http://www.expasy.org/prosite),
and DNA motifs were examined with a local installation of PatScan
(34). DNA was restriction
mapped in silico at
http://www.restrictionmapper.org/.
BetaWrap (10;
http://betawrap.lcs.mit.edu)
was used for the prediction of ß-helix folds in proteins, and
3D-PSSM (58;
www.sbg.bio.ic.ac.uk/
3dpssm)
was used for protein fold recognition and remote homolog detection.
DNA-binding helix-turn-helix motifs were predicted by using the service
for the Dodd-Egan method at the Pôle Bioinformatique Lyonnais
(32;
http://pbil.univ-lyon1.fr).
Relevant genomic data of completely sequenced phages mentioned in this
report were inspected with the Protein View facility of the Genome
database section at the National Center for Biotechnology Information
(NCBI;
http://www.ncbi.nlm.nih.gov/).
Nucleotide sequence accession number. The complete nucleotide sequence of the phage Rtp genome has been deposited in the EMBL nucleotide sequence database under accession number AM156909.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
The receptor specificities of phage Rtp and phage T1 differ. The unusual morphology of the phage Rtp tail tip prompted us to identify the E. coli receptor. We tested whether Rtp, like phage T1, uses the FhuA protein as a receptor and requires TonB for infection and ExbB and ExbD for energization of FhuA (Table 1) since the sequence of the Rtp genome had high similarity to that of phage T1 (see below). Three fhuA mutants of E. coli K-12KO483, HK97, and H1880were examined and found to be fully sensitive. Three tonB mutants derived from three different parental strains and exbB and exbD mutants were also fully sensitive. The functions of ExbB and ExbD can be partially replaced by TolQ and TolR (14). However, tolQ and tolR mutants were as sensitive as parental strain GM1 (Table 1), which for unknown reasons poorly supported Rtp propagation. These results showed that the receptor specificity of phage Rtp differs from that of phage T1.
A series of other E. coli K-12 mutants with defects in defined, well-studied outer membrane proteins and LPS were tested (Table 1). All of the protein mutant strains were sensitive to phage Rtp. In contrast, E. coli B was completely resistant. Since E. coli B lacks the OmpC porin, which is present in the K-12 strains tested, OmpC was a candidate for the phage Rtp receptor. However, the E. coli B derivative BL21(DE3) and the E. coli K-12 ompC mutants KB429, KB5, and JWC30 were sensitive, showing that OmpC did not serve as the phage Rtp receptor. As will be shown below, resistance of E. coli B is probably caused by restriction of the phage genome.
To test whether LPS serves as a receptor, the sensitivity of rfa and rfb mutants of E. coli D21 was determined. The rfa-1 mutant D21e7 was 3 orders of magnitude less sensitive, and the rfa-1 rfa-21 mutant D21f1 was 4 orders of magnitude less sensitive than parental strain D21 (Table 1), which suggests that LPS contributes to binding of phage Rtp to E. coli. The major core sequence of LPS in strain D21 has been proposed to be GlcNAc-(Glc)2-Glc(Gal)(Hep)4-KDO-lipid A (9, 46), whereKDO is 2-keto-3-deoxyoctulosonic acid. Strain D21e7 lacks GlcNAc-(Glc)2, and strain D21f1 lacks, in addition, Glc(Gal). Additional evidence for involvement of LPS in phage Rtp infection came from the partial sensitivity of E. coli F464 and the complete resistance of its R1 core LPS mutant F470 (Table 1). Because of the lack of chemically well-defined E. coli LPS mutants and the variations in LPS structure, e.g., heterogeneous glycoforms in the core sequence and side chains with various amounts of phosphate and ethanolamine, we did not further test which LPS substructure contributes to phage Rtp binding.
Attempts to determine phage Rtp binding to viable cells failed under various experimental conditions. Exponential-phase cells (108 cells/ml) and stationary-phase cells in nutrient broth and in phage adsorption buffer were incubated at 37°C for 15 min with freshly prepared phage suspensions (106 PFU). No reduction in the number of PFU was observed. The experiments were repeated with outer membrane preparations of sensitive strains W3110 and MH225 and their spontaneous Rtp-resistant derivatives. Tenfold fewer PFU formed with the outer membranes of both the sensitive and resistant strains. Degradation of outer membrane proteins by proteinase K did not decrease, but rather increased, phage inactivation. These results are consistent with LPS playing a role in phage Rtp binding; i.e., presumably, proteinase K increased access of the phage to LPS. Since proteins embedded in the outer membrane are largely resistant to proteases, the Rtp receptor can be a protein. We propose that LPS is not the only receptor but facilitates adsorption, as has been shown for other phages, e.g., phage T5, where the L-shaped tail fibers bind to the LPS O9 antigen (47) and protein pb2 binds to the FhuA protein (48). Reversible binding to O9 accelerates T5 phage infection 15-fold, but infection occurs only through FhuA, which determines host specificity. Similarly, phage Rtp binding may be facilitated through LPS, to which the phage binds in wild-type and mutant outer membrane preparations. The receptor required for infection would then be another outer membrane component, presumably a less-well-characterized protein of which no mutant form was in our collection. Apparently, phage Rtp adsorbs poorly to cells, like phage T5 lacking the L-shaped tail fibers or when incubated with cells lacking the O antigen.
The Rtp genome is linear but circularly permutated. Restriction mapping of phage Rtp DNA revealed a double-stranded molecule of 46 to 48 kb. Despite the high coverage of the shotgun library (11.8-fold), commercial shotgun sequencing of the Rtp phage DNA and additional genome walking yielded a contig of only 41,234 bp. The remaining gap was covered through PCR amplification with primers placed close to the ends of the known sequence. The amplified DNA fragment of approximately 5.2 kb was sequenced directly on both strands by primer walking. The sequence obtained overlapped the ends of the 41,235-bp sequence and contained an additional 4,985 bp. The final assembly of the Rtp genome produced a 46,219-bp circular sequence. Restriction sites in the genome predicted in silico agreed with the experimentally determined restriction sites (data not shown), confirming correct genome assembly and supporting an apparently circular physical map. In addition to the expected fragments of the circular map, a single DNA fragment was observed in each digest that was present in submolar concentrations. The size of the fragment was dependent on the restriction enzyme used. These observations were consistent with a linear but circularly permutated genome of phage Rtp generated by a headful packaging mechanism (17, 52; see the later section on terminase and DNA packaging).
Overall features of the genome Phage Rtp multiplied on E. coli K-12 host strains but not on the restriction-proficient, hsdS+ E. coli B strain. Restriction-deficient E. coli B strain BL21(DE3) hsdS was sensitive. A computer search for strain-specific restriction sites in the Rtp genome sequence (76; http://rebase.neb.com) provided a rationale for this observation. We found 10 target sites for the EcoBI restriction system [TGA(N8)TGCT] but none for the EcoKIrestriction system [AAC(N6)GTGC]. Apparently, the genome was degraded by the EcoB1 endonuclease in E. coli B hsdS+.
The G+C content of the genome is 44.3 mol%, which is lower than the 51 mol% of the E. coli host. Phage Rtp does not carry any tRNA genes, as determined by tRNAscan analysis (61).
Seventy-five ORFs were identified with the complementary gene prediction programs GeneMarkS and Zcurve1.0 (8, 42). Agreement between the two programs was high; 70 of 75 genes found by GeneMarkS were also predicted by Zcurve 1.0, albeit sometimes with an alternative start codon. A large proportion of the predicted genes, i.e., 34 of 75 (45%), mostly located near the ends of the genome map as defined in Fig. 2, code for proteins of fewer than 100 aa. Only eight of these short gene products are significantly similar to proteins in the NCBI nonredundant protein database or could be predicted as translation products by TBlastN searches of the GenBank nucleotide database.
|
Rtp is closely related to coliphage T1 Genome comparison of phage Rtp with T1 showed that 47 (63%) of the Rtp genes were homologous to genes of phage T1, covering almost entirely the functionally annotated regions of the phage T1 genome, extending from genes encoding the terminase, head, tail, recombination, replication, and lysis proteins (Fig. 2 and Table 2). There is extensive synteny between the two genomes, and with the exception of three genes (rtp6, rtp44, and rtp59; discussed below), the closest homologs of Rtp genes are all found in phage T1.
|
In their discussion of the phage T1 genome, Roberts et al. (75) mentioned that the genome of a very close relative of phage T1, phage TLS (38), has been completely sequenced by Gregory German and Rajeev Misra at the University of Arizona, but this genome sequence has not been deposited in a public database. There are clear differences that distinguish phages Rtp and TLS from each other. (i) The genome of TLS contains a cytosine methylase (dcm), which is lacking in Rtp. (ii) TLS lacks ORFs 30, 31, and 32 (cor) found in T1, whereas Rtp carries three genes in that position (rtp44, rtp45, and rtp46), although these are not homologous to the T1 genes. (iii) TLS infection occurs through the TolC protein (5, 38, 66), in contrast to Rtp, which is independent of TolC. Therefore, the Rtp genome represents the second published sequence of three very closely related T1-like phages with three distinct receptor specificities.
Terminase and DNA packaging
The rtp19 and rtp21 genes code for the typical small and large subunits,
respectively, of a phage terminase protein complex that initiates,
drives, and terminates translocation of phage DNA into proheads
(17,
21,
37). The small subunit
has DNA-binding activity, and the large subunit provides ATP-binding,
prohead-binding, and DNA cleavage activities
(18,
20,
40,
41). Usually, the two
subunits are encoded by adjacent genes, as in phage T1. The two
predicted Rtp terminase genes show high end-to-end similarity to their
T1 homologs (Table 2), but
they are separated by rtp20, which encodes an HNH-type
endonuclease (24,
27,
54). Casjens et al.
(20) found a strong
correlation between the phylogenetic relationships of phage terminase
large subunits and the structure of the corresponding genome ends. In
their comparison of 114 large terminase subunits, they distinguished
the following types of phage chromosome packaging: (i)
5'-extended cos ends generated by terminase subunits related to
the
gene A product; (ii) 5' cos ends generated by
P2-like terminase subunits; (iii) 3' cos ends (e.g.,
E. coli phages HK022 and HK97); (iv) T7-like
packaging, which generates terminal direct repeats but no circular
permutation; (v) P22-like headful terminases; (vi) T4-like headful
terminases; (vii) Mu-like headful terminases, and (viii) GTA-like
terminases of transduction-deficient prophages. The comparative
analysis also shows that headful packaging terminases constitute a
diverse group of sequences for which there is no evidence of a
monophyletic origin. The closest homolog of the large terminase
subunits of Rtp and T1 in a free-living phage is found in phage Aaphi23
of the periodontal pathogen Actinobacillus
actinomycetemcomitans (Table
2). Phage T1 and Aaphi23
DNAs are packaged by the headful mechanism, resulting in terminal
redundancy and circular permutation
(72,
95). Restriction analysis
of Rtp DNA argues against the presence of cohesive ends and for
packaging of circularly permutated genome units. The agarose gel
electrophoresis pattern of single digests of Rtp DNA with PstI (two
recognition sites), BamHI (three sites), XbaI (seven sites), EcoRI
(eight sites), or XhoI (nine sites) was not affected by heating to
80°C or by promoting annealing of potential complementary
overlaps of terminal fragments prior to electrophoresis. Under the same
conditions, annealing of the 12-nt-long complementary 5'
extensions in HindIII-digested
DNA was readily detected (see
Materials and Methods). The following Rtp restriction fragments that
are not predicted by the in silico restriction map were found at
submolar concentrations as sharp bands on agarose gels: XbaI (3.6 kb),
XhoI (5.6 kb), EcoRI (6.0 kb), and PstI (10.2 kb). Undigested Rtp DNA
produced only a single high-molecular-weight band. An additional
restriction fragment of reduced concentration, the pac
fragment, is a hallmark of the headful packaging mechanism. Therefore,
packaging of Rtp appears to be initiated at a precise location on a
concatemeric substrate
(17,
52). This pac
site is only used once in a packaging series, so that only the first
packaged DNA of the series has a distinctive left end. The positions of
the XbaI, XhoI, EcoRI, and PstI sites permitted unambiguous
determination of the location of pac. The Rtp pac
site is located approximately 3.6 kb (± 0.2 kb) to the left of
the XbaI restriction site found at positions 3689 to 3694 of the genome
sequence (Fig. 2). By
pairwise alignment searches, we failed to identify any obvious DNA
sequence conservation in the regions of the Rtp and T1 genomes that are
predicted to contain the pac sites. The pac site of
T1 was mapped 1 kb to the left of an EcoRI
restriction site (EcoRI site at nt 1323 to 1328 in
the T1 genome sequence; GenBank accession no.
NC_005833)
(71). The five repeats of
the ATATA sequence mentioned by Roberts et al.
(75) as a possible motif
in the pac region of phage T1 are not present in phage Rtp.
The finding that the virion DNA of phage Rtp is generated by the
headful mechanism supports the hypothesis of Casjens et al.
(20) that the packaging
type can be predicted from the phylogeny of the large terminase
subunit. In their analysis, the terminase of phage Aaphi23 formed a
weakly supported group with only one other sequence from lactococcal
phage TP901-1. The present data expand the cluster to four sequences
(from T1, Rtp, Aaphi23, and TP901-1) that appear to form a subfamily of
headful terminases distinct from the currently more numerous P22-like
large subunits.
Head genes The region from rtp23 to rtp43 contains genes for head and tail morphogenesis; this region is highly syntenic between phages Rtp and T1. The precise location of the border between the head and tail genes is difficult to predict from the available database information. Rtp23 is a putative portal protein. Close homologs in free phages are only found in phages T1 and Aaphi23. Rtp24 is related to the Pfam family phage_Mu_F (6). Another member of this family, gp7 of Bacillus subtilis phage SPP1, is a minor head protein that interacts with portal protein gp6 (51, 81). Rtp25 is a possible prohead protease. Rtp25 is similar to products of conserved gene cluster COG3566, which was shown by computational analysis to belong to a superfamily of prohead proteases that includes proteins of double-stranded DNA bacteriophages and herpesviruses (23). The downstream rtp26 gene is duplicated in phage T1 as orf49 and orf48, and these are the only clear homologs. Proteome analysis of phage T1 has shown that both gp49 and gp48 are abundant proteins (75), but their mutual similarity has not been noted previously. Rtp27 displays high end-to-end similarity to the putative major T1 head protein gp47.
Tail genes
Differences in an otherwise highly
conserved region of tail genes may localize the distinct specificities
of phages Rtp and T1 for cellular surface receptors used for infection.
The gene order from rtp30 to rtp50 is the same as in
phage T1, with two gene insertions in Rtp (rtp34,
rtp40) and a potential module replacement where Rtp genes
rtp44, rtp45, and rtp46 replace T1 genes 32,
31, and 30. rtp33 codes for a homolog of the putative major
tail protein gp41 of T1. For the genes rtp30, rtp31,
and rtp32, there is no sequence-based evidence that they are
involved in tail formation. The 3' end of the rtp35
coding sequence contains a slippery sequence motif
(AAAAAAC-stop codon) that might permit a
translational 1 shift, thus leading to the translation of an
Rtp35-Rtp36 fusion protein across the 8-nt intergenic gap in addition
to the expression of proteins Rtp35 and Rtp36. In tailed phages, such
programmed translational frameshifting seems to be typical for genes
corresponding to the
G/GT region upstream
of the gene encoding the tape measuring protein
(97).
Rtp37 is a
tail tape-measuring protein with many homologs in phages and prophages
of gram-negative bacteria, including the
gene H product. The
rtp38 homolog orf37 in phage T1 and the three
downstream T1 genes are conserved tail genes also found in
-like phages and prophages of various members of the
subgroup of proteobacteria, including T1, N15, HK022, HK97,
So, PhiE125, Gifsy-1 and -2, Fels-1, CP933-O, CP933-K, and
(genes M, L, K, and I;
see reference 75 for a
phylogenetic analysis). Notably, in phage Rtp, this run of four genes
is separated by the insertion of a second HNH-type endonuclease gene
(rtp40). This endonuclease gene overlaps downstream tail gene
rtp41 by 20 bp without changing the amino acid conservation
that exists between the N termini of Rtp41 and T1-gp35.
Sequence
similarity searches for Rtp41 revealed not only homology to many
putative tail assembly proteins of phages and prophages, including
tail protein K, but also similarity to more distantly related
bacterial endopeptidases and cell wall-associated hydrolases. In silico
domain analysis suggests that the 252-aa Rtp41 protein is composed of
two domains, both of which are related to proteases. The Conserved
Domain search service integrated into NCBI BlastP searches
(63) indicated the
presence of an N-terminal COG1310 domain (predicted metal-dependent
protease of the PAD1/JAB1 superfamily) and a C-terminal NlpC/C60
domain. The latter was also clearly identified in a Pfam search
(6). NlpC/C60 is part of a
superfamily of cysteine/histidine-dependent amidohydrolases/peptidases
(4,
7,
74). Since the
protease-defining residues of the CHAP family are well conserved in
Rtp41 and its homologs in
and T1, one can envisage an
enzymatic role for this widespread and highly conserved tail protein of
lambdoid phages, for example, in the assembly of the tail. In fact,
tape-measuring protein H of
is cleaved toward the end of tail
assembly by an unidentified protease
(55,
85). Alternatively, Rtp41
and related proteins may be involved in cell wall degradation during
infection of the host.
Rtp42 is a homolog of conserved
tail assembly protein I, and Rtp43 is homologous to the tail fiber J,
which determines
host specificity
(36,
94). The presence of
several
-related genes in the tail region of Rtp prompted us
to determine how many conserved genes are shared between Rtp and model
phage
. A BlastP search (E-value limit, 10E-5) with all of the
predicted Rtp gene products against a
protein database
revealed that sequence similarity between the two genomes was limited
to adjacent tail genes rtp37, -38, -39,
-41, -42, and -43 and
H,
M, L, K, I, and J. These
gene products and
G form the tail assembly
initiation complex, which precedes the polymerization of major tail
protein V up to a shaft length that is determined by the sequence
length of protein H (reviewed in references
55 and
56). It appears that
phages Rtp and T1 have a conserved
-like initiation mechanism
for the assembly of their lambdoid tails and that the amino acid
sequences of the major subunit proteins can be completely
unrelated.
The putative tail fiber proteins Rtp43 and T1-gp33 are
probably involved in phage adsorption to their E.
coli outer membrane receptors. Differences between these
highly similar proteins may indicate the differing receptor
specificities. Sequence comparisons of the C-terminal part of Rtp43
with closely related tail fibers are of particular interest since the
last 249 residues of
J bind specifically to the cognate
receptor LamB (90). A
comparison of Rtp43 with its closest homologs is summarized in Table
3. The lengths of the related tail fibers from phages Rtp, T1, HK97,
HK022, N15, phi1026b, phiE125, and ES18 vary from 1,061 to 1,296
residues. The very long J fiber homolog of phiKO2 (3,433 aa) is an
exception. This protein can be viewed as an approximately 1,300-aa
sequence with a long insertion made up of eight evenly spaced imprecise
repeats of 130 to 180 aa
(19). Multiple-sequence
alignment of these tail fibers revealed that the N-proximal 800 to 900
aa residues are more strongly conserved than the C-proximal part, whose
length also varies more than that of the N-proximal portion. Four of
the nine closest fiber homologs in free phages belong to FhuA-dependent
phages T1, HK022, N15, and ES18 (Table
3). The amino acid
sequence conservation in the C-proximal part of the various
J-like proteins is reduced to different degrees. The T1 fiber is very
similar to Rtp43, whereas
J has the lowest score. The fiber
protein of ES18 has an overall score slightly above that of
J, but it aligns much better with Rtp43 in the C-proximal region. We
propose that Rtp43 has not undergone a major domain exchange in the
C-proximal region to yield a new receptor specificity. If Rtp43
determines the host range of Rtp, which differs from that of T1, then
the difference is likely caused by minor changes in the amino acid
sequence, probably in the variable C-terminal region. This conclusion
is supported by the finding that small sequence changes lead to new
receptor specificities. For example, mutations in the hypervariable
region of the long tail fibers of phage Ox2 change the receptor
specificity from the wild-type OmpA protein to the OmpC and OmpX
proteins, and even to LPS
(33). Similar results
were obtained with host range mutations in T-even-type phage M1, which
changed specificity from OmpA in the wild type to OmpC and subsequently
to OmpT
(45).
|
J homolog. In phage HK97,
the additional tail fiber gene, stf or orf23, is
located immediately downstream of the
J homolog. In
HK022, the tail fiber genes are separated by three genes that are
conserved among phages N15, ES18, and HK022. One of these conserved
genes is the lysogenic conversion gene cor
(88).
Are rtp45, cor, and the T5 gene llp functional analogues?
The most notable difference between the central regions of the Rtp and T1 genomes is the
lack of sequence similarity of the rtp44, -45, and
-46 genes in Rtp and orf32, -32, and
-30 (cor) in T1 downstream of the tail gene operons.
Homologs of orf32, -33, and -30
(cor) are found at the same genome location and in the same
order in five phagesT1, HK022, N15, ES18, and
80all of which use FhuA as a receptor. This set of
three genes is strictly limited to these phages, as shown by
comprehensive BlastP and TBlastN searches of all of the phage and
bacterial sequences in GenBank. Lambdoid FhuA- and TonB-dependent phage
mEp167 also contains a cor homolog, but sequence information
is limited to this gene only
(86). Related phages with
different receptor specificitiesHK97 (receptor, LamB;
31), TLS (receptor, TolC;
38), and Rtp (receptor
unknown)lack sequence similarities. The small cor
gene was originally identified in
80 as a lysogenic conversion
gene that prevents adsorption of superinfecting
80 to FhuA in
E. coli lysogens carrying a
80 prophage
(65,
88; note: the correct ORF
for cor of
80 and N15 was identified by Vostrov and
colleagues [88], but Cor
database entry BAA00267 still contains the incorrect protein sequence).
We noticed a number of similarities between the Cor family and the
llp gene product of phage T5, which inactivates the phage T5
FhuA receptor (15,
28), a receptor-blocking
protein from T5-related phage BF23 (GenBank accession number
AAZ03643), and the predicted Rtp45 gene product.
The proteins are putative outer membrane proteins (shown for Llp of
phage T5). Rtp45 and the other lipoproteins contain an N-terminal
sorting signal (positions 2 and 3 of the mature protein; Fig.
3) that favors translocation into the outer membrane
(84). Their N termini
conform to the Prosite consensus for prokaryotic lipoprotein lipid
attachment sites (50;
http://www.expasy.org/prosite).
The genes are located next to tail fiber genes (Fig.
4). The lengths of the putative lipoproteins (a lipid has been shown for
Llp of T5) are very similar, between 76 and 80 residues. Only Llp of
phage BF23 is slightly longer, with 88 residues (Fig.
3). Although the amino
acid sequences of the mature Cor proteins differ from the lipoproteins
of Rtp, T5, and BF23, the hydrophobicity profile of Rtp45 is similar to
those of the Cor proteins
(60). In the cases
studied, genetic and biochemical evidence demonstrates inactivation of
the host phage receptor. Receptor inactivation is not only advantageous
to prevent superinfection of lysogenic bacteria carrying prophages but
may also serve to prevent phage inactivation by receptors when phages
are released from lysing bacteria
(28). This explains why
nonlysogenic phages like Rtp, T1, and T5 synthesize
receptor-inactivating lipoproteins. We propose that all known Cor
homologs are outer membrane lipoproteins that inactivate receptors and
that Rtp45 and Llp of BF23 share this property.
|
|
Recombination, replication, and lysis modules. The genes rtp47, rtp48, and rtp49 show 44, 43, and 45% sequence identities, respectively, to the genes encoding the putative T1 recombination module (Table 2). Rtp47 is a homolog of RecE, a 5' exonuclease (25). Rtp48 is a member of the Erf (essential recombination function) family. rtp49 and orf27 of T1 both encode putative single-stranded DNA-binding (Ssb) proteins. Roberts et al. (75) identified the biological function of the Ssb protein as replication. However, as Ssb of phage T7 is required for both replication and recombination (49), this might also be the case for T1 and Rtp. Rtp51 (DNA primase) and Rtp53 (ATP-dependent helicase) share 42 and 51% sequence identity, respectively, over their entire length with the predicted replication proteins gp24 and gp22 of T1. Similarity to other bacterial and phage primase and helicase homologs further supports these functional assignments. gp23 of T1, whose function is unknown, is also conserved in Rtp and T1. rtp52 and its homolog orf23 code for putative DNA-binding transcriptional regulators (see below).
Like T1, Rtp contains two closely linked lysis genes, coding for a holin and an endolysin. The first genes of the lysis cassette (encoding those for Rtp63 and T1-gp13) code for holins that form pores in the cytoplasmic membrane. The products of the downstream genes are phage endolysins (Rtp64 and T1-gp12) that cleave the ß-1,4 linkages between N-acetyl-D-glucosamine and N-acetylmuramic acid in peptidoglycan heteropolymers of prokaryotic cell walls (Pfam PF00959, phage lysozyme) (75). The function of the surrounding genes that are conserved between Rtp and T1 is unknown (Table 2).
Identification of a putative transcriptional regulator in Rtp and T1. The predicted proteins Rtp52 (157 aa) and T1-gp23 (150 aa) are 51% identical over 133 residues, and no other closely related sequences were detected by BlastP. In both phages, the transcription polarity of these genes is the opposite of that of the adjacent DNA helicase genes. The length of the intergenic regions, putative divergent promoter regions, is approximately 100 bp in both phages. Analysis of Rtp52 and T1-gp23 for the presence of a DNA-binding domain yielded helix-turn-helix motifs with confidence levels of close to 100% for Rtp52 and approximately 50% for T1-gp23 (32) (Fig. 5). The helix-turn-helix motifs were in the same location in the two homologs. Other sequences that are distantly related to Rtp52 and T1-gp23 were detected with significant E values by a PSI-BlastP search. The result of the fifth iteration is shown in Fig. 5. The best hits of the search are all transcriptional regulators, followed by a mixture of transposases and more transcriptional regulators. All of the homologs tested contain a predicted helix-turn-helix motif, with various degrees of confidence, in a position equivalent to that of the Rtp and T1 proteins. Conservation of certain amino acid residues was limited to the helix-turn-helix region. The data suggest that both Rtp52 and T1-gp23 are transcriptional regulators with DNA-binding activity.
|
70-dependent promoters.
The multiple-sequence alignment shows that hexamers 22 bases downstream
from the inverted repeat motif conform well to the 10 region
of
70 promoters (12 to 7 region),
especially at functionally important and highly conserved positions
12, 11, and 7
(35,
44,
64). The distance from
predicted 10 boxes to the predicted start codons is in most
cases between 20 and 40 nt, often close to 30 nt (Fig.
6B). In contrast, there
are no obvious 35 boxes when promoter spacer regions of 16 to
18 nt are allowed. The right unit of the inverted repeats is close to
the 35 box recognized by region 4.2 of
70.
It seems possible that the TAGCA inverted repeats
with a 5-nt intervening sequence might be binding sites for a
transcriptional activator. There are additional highly conserved bases
outside the TAGCA repeats (marked with an asterisk
in Fig. 6) which were not
included in our
interpretation.
|
Concluding remarks. Comparison of the genomes of phages Rtp and T1 shows (i) a remarkable overall similarity of gene content and synteny; (ii) diverged genome endsas defined in the chromosome maps in Fig. 2with limited gene conservation to the left of the terminase genes (5.5 kb at the left end) and no sequence conservation across the rightmost 1.7-kb segment of the Rtp genome; (iii) distinct numbers of HNH-type endonuclease genes (two in Rtp, three in T1); (iv) modular exchanges of rtp44, -45, and -46 and orf32, -31, and -30 of phage T1; (v) putative secondary tail fibers Rtp50 and T1-gp26; and (vi) a single conserved putative transcriptional regulator in both phages.
Rtp shows a novel tail tip morphology which displays distant similarity to phage T1. The greater thickness and rigidity of the four leaf-like structures protruding at the tail tip of phage Rtp in comparison to the tail tip of phage T1 might be caused by amino acid substitutions in the FibJ homolog Rtp43. We favor the idea that the nonhomologous replacements involving rtp44 and rtp45, as well as the approximately 800-residue unique C-terminal domain of Rtp50, are responsible for the morphogenetic changes in the tail tip which probably cause the different receptor specificity of Rtp. The conservation of a putative three-gene cor module in FhuA-dependent phages (orf32, -31, and -30 of phage T1) warrants further studies.
Both Roberts et al. (75) and we found signs of multiple promoter regions. Studies of gene expression in T1 date back to 1977 (89) and were only directed at the level of translation. These analyses revealed three stages of T1 gene expression: early, early-late, and late. It will be interesting to find out where transcription start points are located, whether Rtp52 and T1-gp23 are indeed novel DNA-binding transcriptional regulators, whether they function as activators or repressors, and where their target sites might be. So far, T1-like phages could only be classified on the relatively weak basis of morphology and virulence. Now there are molecular criteria available for the classification of new members of the T1 group.
| ACKNOWLEDGMENTS |
|---|
This work was supported by the Fonds der Chemischen Industrie.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|