A1122 Reveals an Intimate History with the Coliphage T3 and T7 Genomes
Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, Livermore, California,1 Division of Vector-Borne Infectious Diseases, National Center for Infectious Diseases, Centers for Disease Control and Prevention, Fort Collins, Colorado,2 Molecular Genetics and Microbiology, Institute for Cell and Molecular Biology, University of Texas, Austin, Texas3
Received 3 April 2003/ Accepted 27 May 2003
| ABSTRACT |
|---|
|
|
|---|
A1122 has been determined.
A1122 grows on almost all isolates of Yersinia pestis and is used by the Centers for Disease Control and Prevention as a diagnostic agent for the causative agent of plague.
A1122 is very closely related to coliphage T7; the two genomes are colinear, and the genome-wide level of nucleotide identity is about 89%. However, a quarter of the
A1122 genome, one that includes about half of the morphogenetic and maturation functions, is significantly more closely related to coliphage T3 than to T7. It is proposed that the yersiniophage
A1122 recombined with a close relative of the Y. enterocolitica phage
YeO3-12 to yield progeny phages, one of which became the classic T3 coliphage of Demerec and Fano (M. Demerec and U. Fano, Genetics 30:119-136, 1945). | INTRODUCTION |
|---|
|
|
|---|
| Y. pestis bacteriophages. |
|---|
|
|
|---|
The most detailed studies of Y. pestis phages have been made by investigators in the former Soviet Union. The d'Herelle phage and many other strains isolated in the former USSR comprise four serovars. Serovar 1 is by far the most common and is represented by the d'Herelle and Pokrovskaya lytic phages; both of these phages exhibit T7-like growth curves and morphology (4, 74) and are probably members of the T7 phage family. Serovar 2 phages are P2-like (E. Garcia et al., unpublished data); they were originally defined by phages H (46) and L-413 C (34). (This phage H is not identical to the phage H described by Molnar and Lawton [41], which is discussed below.) Serovar 3 is represented by lysogenic phage
(47), and serovar 4 is represented by the lysogenic phages Tal and 513 (48). The three lysogenic phage serovars are said to be heteroimmune.
Although early work of d'Herelle was directed towards curing bubonic plague by administration of phage (phage therapy), which was, by his own account, a success, it did not become the focus of subsequent studies (68). Rather, most attention concentrated on the isolation of phages that could specifically type all natural isolates of Y. pestis and on the adaptation of phages to grow more efficiently on other enteric bacteria (see, e.g., reference 35). However, it is often difficult to determine from the old literature exactly which phage was being used; "the bacteriophage" is not an uncommon description. In addition, the specific protocols for phage propagation and its use by different workers are not always clearly described. In some cases descriptions of phage growth are accompanied by efficiencies of plating and plaque morphologies determined by using soft agar overlays, whereas in others phage growth is scored by the routine test dilution method (1), as plaques on the surface of agar plates, or even by simple spot tests (which measure only cell killing and not phage growth).
History of bacteriophage A1122.
|
|---|
|
|
|---|
A paper strip test containing phage was subsequently developed by Cavanaugh and Quan as a test for the rapid identification of Y. pestis (10). Curiously, given the importance of that communication, the authors referred to this diagnostic agent only as "the bacteriophage." The uncertain provenance of the Cavanaugh and Quan phage, which was said to be isolated from sewage at the Hooper Foundation, University of California, Berkeley, was presented by Molnar and Lawton (41). Those authors arbitrarily named the Cavanaugh and Quan phage "phage H." Although phage H had properties similar to those of phage P (25), which was being used at the Hooper Foundation at the same time that phage H was isolated, there was no proof that the two phages were identical. Like phage P, phage H is used at 20°C for diagnosing Y. pestis (10, 26) and was used by the U.S. Army Medical Research Institute of Infectious Diseases as their standard phage for identifying Y. pestis (see, e.g., reference 26). In recent years, the U.S. Army Medical Research Institute of Infectious Diseases has been using the
A1122 stock of the Centers for Disease Control and Prevention (CDC) for lysis of Y. pestis cultures (Pat Worsham, personal communication). The stock of phage whose genome sequence is presented here also originated from the Hooper Foundation. It was transferred to the Vector-Borne Diseases Division of the CDC in Fort Collins, Colo., in 1968. The documentation that accompanied this stock strongly implies, although it does not explicitly state, that it was the Gunnison P phage. Because of its uncertain history, we have renamed the phage
A1122, reflecting the host Y. pestis strain used for routine propagation. Strain A1122 is used by the CDC as the reference Y. pestis strain (12). Lysates derived from the original Hooper Foundation stock have been in continual use and widely distributed since 1968 by the CDC as one of the diagnostic tests for the identification of Y. pestis.
A1122 grows on all but two natural isolates of Y. pestis of the thousands tested by the CDC (M. C. Chu, unpublished data). The former Soviet Union (now Russian Federation and Central Asian Republics) antiplague networks similarly use the T7-related Pokrovskaya phage and the P2-related L-413 C phage for diagnostic purposes.
The
A1122 genome sequence determined as part of this study shows that
A1122 is very closely related to T7, although a significant portion of the genome is almost identical to the equivalent part of coliphage T3. Analysis of the sequence data suggests that T3, one of the classic set of T phages described by Demerec and Fano (14), was derived from a recombination event involving two different yersiniophages.
| MATERIALS AND METHODS |
|---|
|
|
|---|
A1122 is the standard phage used by the CDC in identifying wild or clinical isolates of Y. pestis. T7 and T3 are the laboratory stocks of I. J. Molineux. Y. pestis A1122, the CDC laboratory standard avirulent, Y. pestis bv. orientalis strain lacking the 70-kb pCad virulence plasmid, was used for
A1122 propagation. Host range mutants of
A1122 were selected on strain IJ511 (E. coli K-12
lacX74 supE44 galK2 galT22 mcrA rfbD1 mcrB1 hsdS3) or BL21 (E. coli B hsdS Gal-). DH5
and DH5
/F' lacIq, used for phage library propagation, were purchased from (Gibco-BRL Life Technologies). Other bacterial strains used were from the collection of I. J. Molineux and are described in Table 2. The plasmid pAR3685 (a gift from A. H. Rosenberg and F. W. Studier, Brookhaven National Laboratories) contains T7
17 and gene 17 cloned at the BamHI site of pBR322 in the opposite orientation to Ptet.
|
Phage DNA library and DNA sequencing.
For genome sequence determination,
A1122 was grown on A1122 by using a single plaque derived from the original stock provided to the CDC. A large-scale lysate was clarified by centrifugation and then concentrated by precipitation with 7.5% polyethylene glycol 8000 (Sigma). The pellet was resuspended and purified by CsCl density gradient centrifugation. DNA was extracted with phenol-chloroform (24:1, vol/vol) and precipitated with ethanol.
Phage DNA was subjected to hydroshearing (Gene Machines; Genomics Instrumentation Services, Inc., San Carlos, Calif.) to yield 0.5- to 1 kb fragments, which were cloned into dephosphorylated, SmaI-digested pUC18 and M13mp18 vectors. A 5-times-redundant M13 library (four 96-well plates) and a
12-times-redundant pUC-based library (five 96-well plates) were constructed. The majority of sequencing used Big Dye chemistry (PE Applied Biosystems), ABI PRISM 373 or 377 sequencers, and primers -21m13 and m13rp1 (Operon Technologies, Inc.). Approximately 1,300 reads were sequenced and assembled, resulting in a final coverage of about 16x. Base calling and assembly of sequences used Phred/Phrap (20, 21) and Consed (24). Ambiguities were resolved and the genome ends were determined directly from phage genomic DNA by cycle sequencing with appropriately designed primers.
Sequence annotation.
The final sequence was searched against the current protein and nucleotide databases (http://www.ncbi.nlm.nih.gov/) by using BLAST (3). Alignment of open reading frames to T7, T3, and
YeO3-12 sequences was done with Lasergene (DNAstar, Madison, Wis.), GeneWorks 2.45 (IntelliGenetics Inc., Mountain View, Calif.), the Artemis Comparison Tool (http://www.sanger.ac.uk/Software/Artemis/), and Alion:pairwise alignment (http://motif.stanford.edu/alion/). Computer-based predictions were then evaluated manually.
Nucleotide sequence accession number.
Nucleotide sequence data for
A1122 have been deposited in GenBank under accession no. AY247822.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
A1122 genome consists of 37,555 bp of DNA that includes direct terminal repeats of 148 bp. A cursory inspection of the sequence reveals that the phage belongs to the T7 family and that the genome is colinear with those of T7, T3, and the Yersinia enterocolitica phage
YeO3-12. The G+C content of the
A1122 genome is 48.3%, and that of its host Y. pestis is 47.6% (15, 51). For comparison, the G+C contents of the coliphages T7 and T3 are 48.4 and 49.9%, respectively, while that of E. coli K-12 is 50.8%. In total, 51 gene products are predicted in the
A1122 genome. The genome contains 46 distinct open reading frames plus three predicted overlapping genes in a second reading frame (Table 1; Fig. 1). In addition, and as in the T7 genome sequence, an internal initiation site provides the C-terminal, helicase-only (gp4B) portion of the
A1122 gene 4 helicase-primase, and a translational -1 frameshift near the end of the coding sequence for the major capsid protein gp10A allows synthesis of the minor form gp10B. The known regulatory nucleotide sequences allowing synthesis of T7 gp4B and gp10B are totally conserved with those in the
A1122 genome, and it can thus be assumed that the latter phage also makes two products from both genes 4 and 10. In contrast, a homolog to the putative T7 gene 4.1, which overlaps gene 4, is unlikely to be expressed in
A1122, as the open reading frame lacks a convincing ribosome-binding site. Neither T3 nor
YeO3-12 is thought to code for gene 4.1, and it may not be a real gene in the T7 phage family.
A1122 also lacks both an appropriate Shine-Dalgarno sequence and a start codon for a homolog to T7 gene 4.2, another putative gene that overlaps gene 4. It is unlikely that
A1122 gene 4.2 is expressed. However, homologs of T7 gene 4.2 may be expressed in T3 and in
YeO3-12.
|
|
A1122 genome is
2.5 kb shorter than that of T7; the major differences in genetic content are due to a 1.75-kb deletion that fuses the counterparts of T7 genes 0.3 and 0.7 and to the lack of genes 1.4, 2.8, and 5.3 and the first 120 codons of gene 1.7. Partially countering these deletions (still relative to T7) are two genes, 1.75 and 3.3, that have no positional T7 counterparts. Small deletions also affect the
A1122 terminal repeats and genes 1.3 and 7.3. However, genes 1.3 and 7.3 should both retain activity; the host range mutants of
A1122 described below plate on E. coli strains deficient in DNA ligase, which renders the phage ligase essential (Table 2). Furthermore, the T7 gene 7.3 protein is an essential component of the phage particle (P. Kemp and I. J. Molineux, unpublished data) and likely has a similar role in
A1122. Table 1 summarizes the regulatory and coding elements of the
A1122 genome. Aside from the putative
A1122 gene products that have no counterparts in the T7 genome, the predicted sequences of most other proteins exhibit >80% identity with the homologous T7 protein. Three, gp1.5, gp3.8, and gp4.3 (all of unknown function), are 100% conserved. In contrast, gp5.5 retains 46% identity, and gp1.7, which is present only in a truncated form, retains only 44%. Although the similarity to most T3 proteins is generally much lower than that to T7 proteins, four
A1122 proteins (gp5.7, gp17.5, gp18, and gp18.5) are 100% identical in sequence to their T3 counterparts.
Regulatory elements.
The
A1122 genome contains the same three major early promoters for E. coli RNA polymerase (RNAP) as T7 (Table 1); the minor C promoter and the leftward-facing A0 (or D) promoter are also present. The minor B promoter of T7 is missing as a consequence of the fusion of
A1122 genes 0.3 and 0.7. The 17 promoters for
A1122 RNAP are found in positions equivalent to those in T7. Several are identical in sequence to their positional T7 counterparts, but the two promoters suggested to have a role in DNA replication,
OL and
OR, are predicted to have significantly different activities. Unlike that of T7,
A1122
OL contains the consensus nucleotide -11G in the polymerase recognition domain and should thus be a stronger promoter (31) than T7
OL. In contrast,
A1122
OR is not the typical strong promoter found in the class III region of the T7 genome.
A1122
OR contains a -5G change from the consensus, one that is also found in the weak class II promoter
1.3. Whether
A1122
OR is also weak in vivo has not been determined experimentally, but in vitro the -5G change from the consensus reduces T7 promoter activity threefold (31).
The early transcriptional terminator TE, which defines the end of the region transcribed by the host RNAP, and the two phage polymerase terminators T
and CJ are all highly conserved in the T7 and
A1122 genomes. All are likely to function in vivo at an efficiency in
A1122 comparable to that during a T7 infection. Most of the RNase III recognition sites found in the T7 genome are also highly conserved in
A1122. No counterpart to T7 R0.5 is present because of the deletion that fuses
A1122 genes 0.3 and 0.7 but R0.3 and R1.1 are identical in the two genomes, and R18.5 is identical to T3 R18.5. In addition, relative to their T7 counterparts, R1 and R3.8 have only a single change in the top loop of the predicted imperfect hairpin that characterizes an RNase III recognition site. There are also minor differences between the R4.7 elements of the two genomes; only R1.3, R6.5, and R13 warrant additional mention.
Alone among the RNase III recognition structures, T7 R1.3 contains two sites, separated by 29 bases, for cleavage by the enzyme (56). Both sites are used in vivo; the downstream site produces a longer gene 1.1-1.2 RNA whose 3' end can base pair to the ribosome-binding region of gene 1.1 and thereby retroregulate translation (59). Although
A1122 R1.3 has a significantly shorter bottom duplex stem than T7 R1.3, the predicted overall structure of the enzyme recognition site suggests that both cleavage sites are retained (Fig. 2). Furthermore, retroregulation of gene 1.1 expression is likely conserved, as the 3' end of the longer 1.1 mRNA retains base-pairing properties with the 1.1 ribosome-binding site.
|
A1122 R6.5 differs from its T7 counterpart at two positions; it lacks a base in the 5' part of the upper duplex stem, creating an additional bulge loop, and it also contains the more canonical sequence CUUUAU|G for the predicted site of cleavage by RNase III. The predicted structure of
A1122 R13 differs from that of T7 in that it can be drawn as a perfect stem-loop (albeit with a large loop) rather than the bulged structures common to all T7 and other predicted
A1122 RNase III sites. RNase III cleaves T7 R6.5 efficiently but is much less effective at cleaving R13 (18), but any effects on cleavage by RNase III caused by the sequence differences between R6.5 and R13 of T7 and
A1122 cannot be predicted without experimental data. However, it should be noted that T7 grows essentially normally in E. coli strains that lack RNase III (17) and that E. coli RNase III is 92% identical to the Y. pestis enzyme.
A1122 host range mutants plate with normal efficiency in E. coli rnc mutants (Molineux, unpublished observations), and thus altered cleavage patterns between T7 and
A1122 at R6.5 and R13 are unlikely to be highly significant in vivo.
Genetic elements known or suggested to be involved in DNA replication include the terminal repeats TR, which, aside from a 13-bp deletion, are 95% identical to T7; the hairpin structure proposed to enable duplication of TR during packaging (13) is completely conserved. The primary origin of replication ori is identical in sequence to T7 ori, and both SRL and SRR, the arrays of imperfect repeats adjacent to each TR that contain the heptamer sequence CCTAAAG (18), are also conserved in the
A1122 genome.
Except for gene 13, all predicted
A1122 genes have easily recognizable Shine-Dalgarno sequences that at a minimum are GAGG or GGAG, or, for gene 1.8, AGGA (Table 1). Twenty-nine of the 51 predicted genes can make six base pairs with the 3' end of the 16S rRNA. However, only three consecutive base pairs can form between the gene 13 RNA and the 16S rRNA (Table 1; Fig. 2). An alignment of the
A1122 and T7 sequences 100 nucleotides upstream and downstream of the Shine-Dalgarno sequence revealed only four differences. Two are noncoding and do not affect any known regulatory element, and one is the change in R13 described above; the fourth is in the Shine-Dalgarno sequence itself. In T7 this sequence is the normal GGAGG, but in
A1122 it has changed to GGATG. The presence of the T residue has been confirmed by sequencing directly from the phage genome. T7 gene 13 is essential, and it is expected that
A1122 gene 13 is also essential; presumably sufficient gp13 is made to support phage growth, despite the poor Shine-Dalgarno sequence.
Early proteins.
The 0.3-0.7 fusion protein is active in preventing restriction by type I restriction enzymes, even though the annotations of the Y. pestis genome sequences (15, 51) do not refer to a type I restriction enzyme. However, the fusion does not have the 0.7 activity necessary to grow on the rpoC mutant strain BR3 (7, 45, 63). Host range mutants of
A1122 grow in E. coli containing EcoKI or EcoBI but not when RNAP contains the ß'E2258K change present in BR3 (Table 2). These properties of the phage are not surprising; it is known that only the first 94 residues of T7 gp0.3 are necessary for antirestriction activity (5), and the 0.3-0.7 fusion protein contains the N-terminal 108 amino acids of gp0.3. However, there are only 35 residues that correspond to the C terminus of the 359-residue T7 gp0.7. T7 gene 0.7 is essential for growth on BR3 (63). Inspection of the nucleotide sequences on each side of the
A1122 fusion and T7 DNA shows that recombination between repeated GAGGAA sequences at T7 positions 1243 and 2996 would result in the synthesis of a gp0.3-0.7 fusion protein exactly equivalent to that encoded by
A1122.
Deletions inactivating 0.7 function are commonly found in T7 strains that have been passaged extensively under laboratory conditions (64). Interestingly, the apparent deletion in
A1122 DNA has a 5' flanking sequence similar to those of the T7 mutants H1 and H1S (16, 63, 66), which are the most frequently isolated deletions affecting gene 0.7. The T7 gene 0.7 mutants H1 and H1S were both isolated on the type I restriction-positive strain E. coli B and recognized by their failure to grow on strain BR3 (63). Host range mutants of
A1122 have the same properties (Table 2).
The
A1122 gene 1 product is 98% identical to T7 RNAP (Table 1). The
A1122 RNAP contains asparagine at position 748, which in T7 RNAP is responsible for differentiating between T7 and T3 promoters (55). As expected, therefore, all
A1122 phage promoters resemble their T7 counterparts.
A1122 gp1.2 is 88% identical to its T7 homolog and contains a similar anti-dGTPase activity, as judged by the plating efficiency of
A1122 (Table 2) on strains containing the optA1 mutation (which increases host dGTPase levels), rendering 1.2 essential (58).
T7 gene 1.2 is toxic in cells that express the F plasmid gene pifA, and a 1.2 mutation is necessary for phage growth in the presence of F. The 1.2Y28N change not only abolishes toxicity but is one of three mutations (the other two affect gene 10) necessary for phage growth in the presence of F (40, 61).
A1122 gp1.2 contains glutamine at position 28. Consistent with the idea that tyrosine 28 in T7 gp1.2 is partially responsible for exclusion by F and that changes at that site help avoid exclusion, the relative plating efficiency of
A1122 host range mutants on isogenic E. coli strains with and without F is comparable to that of T7 1.2Y28N and significantly higher than that of wild-type T7 (Table 2).
Nonessential late proteins.
A1122 gp1.7 contains only 76 residues, about 100 fewer than its T7 or T3 counterparts. However, the predicted amino acid sequence of
A1122 gp1.7 is clearly related to the C-terminal residues of T7 and T3 gp1.7. The latter proteins also differ significantly in length, unlike most counterpart proteins from close T7 relatives. The function of gp1.7 is unknown, although a T7 1.7 mutant has a reduced burst size (19).
Positional counterparts to T7 genes 2.8 and 5.3 are missing in the
A1122 genome. Conversely,
A1122 contains two genes, 1.75 and 3.3, that are not present in T7. Gene 1.75 overlaps the 3' end of gene 1.7 by 20 bp and overlaps the 5' end of the putative gene 1.8 by 50 bp. The putative gene 1.75 codes for an HNH endonuclease family protein commonly found in this group of phage genomes but displays no sequence similarity to any thus far described. The C-terminal two-thirds of gp1.75 is most closely related to gp179 of the mycobacteriophage Bxz1 and to an HNH endonuclease family protein in the Streptococcus agalactiae putative prophage
Sa2. The predicted gene 3.3 product is yet another member of the HNH endonuclease family and, interestingly, shows sequence similarity to gp7.7 of both
A1122 and T7, to T7 gp2.8, and to gp5.3 of both T3 and
YeO3-12. Thus, the
A1122 genome contains two variants (gp3.3 and gp7.7) of the same group of HNH endonucleases. All of these putative endonucleases are likely expressed during phage infection, but it is not clear whether they play an important role in phage development. None of the counterpart T7 proteins is essential for growth, and no phenotype has yet been ascribed to their loss in mutant phages.
T7 gene 5.5 is normally nonessential, although it is required for growth on
rex+ lysogens (36, 65). Although both the Y. pestis CO92 and KIM10+ genomes contain lambdoid and other prophage sequences (51, 15), annotations have not identified homologs to
RexA or RexB. T7 gp5.5 also binds to the host DNA-binding protein H-NS, and an E. coli hns mutant that makes 5.5 essential has been isolated (37). Y. pestis contains hns, with the protein being about 85% identical to E. coli H-NS.
A1122 gp5.5 is 46 and 50% identical to, respectively, gp5.5 of T7 and
YeO3-12, but is 90% identical to T3 gp5.5. It is likely that the
A1122 and T3 genes 5.5 have the same activity, but whether T7 or
YeO3-12 gene 5.5 is directly equivalent is much less certain.
The
A1122 gene 3.5 product is 93% identical to its T7 counterpart, usually called T7 lysozyme. T7 lysozyme has two major functions: (i) inhibitingT7 RNAP activity from weaker promoters and increasing termination at the CJ terminator and (ii) facilitating lysis and release of progeny phage (38, 71, 73). The N-terminal residues of lysozyme that interact with RNAP (11) are fully conserved between
A1122 and T7, and as
A1122 RNAP is 98% identical to the T7 enzyme, it is probable that
A1122 gp3.5 has the same effects as T7 lysozyme on transcription during phage infection. Amino acid residues important for the amidase activity of T7 lysozyme are also conserved, and the
A1122 protein therefore should also contain this activity. Access of lysozyme to the cell wall requires the holin gp17.5; the
A1122 protein is 100 and 94% identical, respectively, to the T3 and T7 holins.
A1122 contains homologs to the T7 overlapping genes 18.7, 19.2, and 19.3. Of these, gene 18.7 has an activity predicted to function during cell lysis when divalent cations are present (9). No function has been predicted for the other two genes, although T7 gene 19.3 is thought to be expressed (18).
Essential late proteins.
The
A1122 essential protein that is least similar (76% identity) to its T7 counterpart is the tail protein gp7.3. With only 79 amino acids, the
A1122 protein is also considerably shorter than that in T7. T7 gene 7.3 is essential (Kemp and Molineux, unpublished data), although its function has not yet been clearly defined. The sequences of T3 and
YeO3-12 gp7.3 have also diverged from that of T7 gp7.3, sharing only 57 and 66% amino acid identity, respectively. However, the T7, T3, and
YeO3-12 proteins are comparable in length (106, 106, and 99 residues, respectively) The C-terminal 49 residues of T7 gp7.3 are identical to those of
A1122 gp7.3. Similarly, the C-terminal 33 residues of T3 and
YeO3-12 gp7.3 are 94% identical. Presumably it is the C termini of these proteins that contain the activity that renders the protein essential.
All other essential late proteins of T7 are more than 85% identical to their T7 counterparts, and it is thus reasonable to expect that all of the
A1122 homologs have equivalent properties (Table 1). For example, the active-form T7 DNA polymerase is a complex of gp5 and the host thioredoxin (39).
A1122 gp5 is 98% identical to T7 gp5; not surprisingly, therefore,
A1122 host range mutants fail to grow in E. coli lacking thioredoxin (data not shown). In addition, the
A1122 major capsid protein gp10 is 94% identical to T7 gp10, and in particular, residue 61 of the mature form of both proteins is glutamine. This residue can cause the abortive infection of T7 in Shigella sonnei D2 371-48 (29, 53). As expected, therefore,
A1122 host range mutants do not form plaques on S. sonnei D2 371-48 (Table 2), although they do adsorb (Kemp and Molineux, unpublished observations). The efficiency of plating is, however, substantially lower than that of T7, suggesting that
A1122 may exhibit an additional growth defect in S. sonnei D2 371-48. However, we have no information on what that second defect may be. The abortive infection of T7 is due to endonucleolytic breakdown of the phage genome (27), a consequence of capsid protein synthesis in the infected cell (6). T7 DNA ligase is unable to prevent or repair the damage to the genome, although T3 DNA ligase possesses that capacity. One difference between T3 and T7 gp1.3 is a stretch of 19 amino acids that is missing in the T3 protein. However,
A1122 gp1.3 also contains a comparable "deletion"; thus, this region of nonhomology is unlikely to account for the difference in activity between the T3 and T7 enzymes in S. sonnei D2 371-48.
The primary determinant of phage host range is the tail fiber protein gp17. Both T3 and T7 tail fibers adsorb to the bacterial surface lipopolysaccharide, and the close similarity of
A1122 gp17 indicates that this phage also binds the surface lipopolysaccharide of Y. pestis.
A1122 infects only Y. pestis; the relative efficiency of plating on E. coli K-12 or B strains at temperatures of between 25 and 37°C is less than 10-4 (Table 2 and data not shown). An alignment of the gp17 proteins from
A1122 and T3 reveals that the
A1122 and T3 proteins are 98.9% identical; only 6 residues of 558 are different (Fig. 3). T3 does not plate on A1122 at detectable frequencies (Table 2); it does not adsorb efficiently. We have not yet isolated host range mutants of T3 that do grow. This is surprising, since one of the
A1122 mutants described below that grows almost equally well on Y. pestis and E. coli contains a tail fiber that has only five differences from T3 gp17. Perhaps two or more of these five residues must be changed before T3 can adsorb to Y. pestis. In contrast, T7 mutants that plate on Y. pestis arise at a frequency of about 10-6, even though the
A1122 and T7 proteins are only 86.4% identical, and include two short gaps necessary to maintain alignment.
|
A1122.
A1122 plates on the E. coli K-12 strain IJ511 at an efficiency of 5 x 10-5 relative to the Y. pestis strain A1122 (Table 2). Plaques on IJ511 were picked and purified; none had switched host range specificity from Y. pestis to E. coli, and they had instead acquired an expanded host range that allowed growth on both bacterial species. All mutant phages plated on IJ511 at the same efficiency as on A1122. The gene 17 region of several independent mutants was sequenced, and all but one (which contained two changes) were shown to contain a single mutation affecting the amino acid sequence of gp17 (E. Ramanculov, unpublished data). Although mutations were found throughout gene 17, the mutant K18, containing the gp17L523S change, was chosen for further study described above. This change makes the amino acid at position 523 the same as T3 gp17 and is near the C terminus of the protein, which is the region of the tail fiber that interacts with the bacterial cell surface (62).
Host range mutants of
A1122 were also obtained after recombination with the T7 gene 17 plasmid pAR3685. The plasmid was electroporated into Y. pestis A1122 by using a selection for Ampr, and the resulting strain was used as a host for
A1122. The titers of the resulting lysate on A1122 and IJ511 were then determined. The titers were approximately equal, indicating that at least half of the progeny phage had recombined with the multicopy T7 gene 17 plasmid. Such a high frequency of phage-plasmid recombination is not uncommon with T7 and T7 plasmids. Recombinants that grew on E. coli strain IJ511 were purified on the same strain, and all of those analyzed were found to have exchanged the 5' portion of
A1122 gene 17 with T7 DNA from plasmid pAR3685. The overall nucleotide similarity between
A1122 and T7 gene 17 is 85.3%, but the 5' 500 nucleotides are 93.6% identical, whereas the 3' 500 nucleotides are only 73.4% identical. Thus, the pattern of recombinants was not unexpected from nucleotide similarities, but it was surprising that recombination over only the 5' portion of gene 17 could result in a change in phage host range. The N-terminal portion of the T7 gp17 tail fiber is thought to interact with the phage tail (62) and is well conserved in other members of the T7 family with different adsorption specificities (e.g., Y. enterocolitica phage
YeO3-12 [49] and E. coli K1 capsule-specific phage K1F [52]).
Two recombinants from this experiment were selected; these contained the largest and the smallest replacements of
A1122 DNA with that of T7. E11 contains T7 codons 1 through 190 derived from T7; E6 contains only T7 codons 118 through 125. E6 gp17 has only a single amino acid change from the wild type, affecting codon 118; E11 gp17 contains the same change at codon 118 but also contains four other amino acid replacements (Fig. 3). Both E6 and E11 have an expanded host range and form plaques on E. coli at an efficiency comparable to that on Y. pestis A1122 (Table 2). Interestingly, changing part of the N-terminal region of
A1122 gp17 to the same sequence as T7 gp17 results in the same phenotype as the single amino acid change L523S near the C terminus of
A1122 gp17.
Nucleotide sequence similarities between
A1122 and coliphages T7 and T3.
When the sequences of genes that have no positional counterpart (indels) in the other phage are removed, an alignment of the
A1122 and T7 genomes shows that they share about 89% identity; those of
A1122 and T3 share about 73% identity (Fig. 1). The latter value is essentially the same as that for the T3 and T7 genomes, which are about 74% identical (50). Thus, at the level of complete genomes,
A1122 is much more closely related to T7 than it is to T3.
Only two small regions of the T7 genome show less than 80% nucleotide identity with
A1122; one is centered on the normally nonessential gene 5.5 (37, 65), and the other is centered on genes 18 and 18.5/18.7 (Fig. 1). The longest stretch of identity between the
A1122 and T7 genomes is 482 nucleotides (containing parts of genes 7.7 and 8), and there are 5 stretches of >200 identical nucleotides and 30 stretches of >100 identical nucleotides.
A similar comparison between the
A1122 and T3 genomes is, however, considerably more revealing. Although there are only 13 stretches of sequence identity longer than 100 nucleotides across the entire genome, 5 are longer than 900 nucleotides (Table 3). Significantly, these five, plus several shorter stretches, lie in a single contiguous region whose ends lie in genes 15 and 19 (Fig. 1). Almost one-quarter of the
A1122 genome, a stretch of 9,188 nucleotides that codes for about half of the morphogenetic functions, is 99.8% identical between the
A1122 and T3 genomes (
A1122 positions 26066 to 35253; T3 positions 26657 to 35844). This degree of nucleotide conservation between the
A1122 and T3 genomes is quite remarkable. It is most unlikely that primary stocks of
A1122 have been exposed to E. coli since 1943 (33), and they perhaps have not been exposed since the first probable isolation of the phage (2). Wild-type
A1122 does not adsorb efficiently to the E. coli K-12 and B strains that we have used (although Hausmann [28] refers to a coliphage named A1122); conversely, T3, a phage first defined by Demerec and Fano (14), does not adsorb efficiently to Y. pestis A1122 (Ramanculov, unpublished observations). The two phages have thus been isolated in different laboratory collections for more than half a century, and they have been propagated on different hosts and most likely under different environmental conditions. Nevertheless, the two phages have retained an extremely high degree of sequence identity over a quarter of their genomes.
|
A1122 and T3 must be related through a recombination event that included the gene 15-gene 19 region, and it is tempting to consider
A1122 a simple T7-T3 hybrid phage that acquired a host range mutation. However, this is a biased point of view, which arises because T3 has been more intensively studied genetically and biochemically than
A1122 and thus is more easily thought of as a parent phage. It is more logical to propose that
A1122 is a parent and T3 is the recombinant. Supporting this line of reasoning are the observations that the T3
17 promoter has T7 promoter sequence and specificity (57), and T3
17 is not used during infection. In contrast, the
A1122
17 promoter (and all other
A1122 promoters) has the specificity of the
A1122 RNAP, which is presumed to be the same as that of T7. Thus,
A1122
17 is more closely related to the remainder of the
A1122 genome than T3
17 is to the remainder of the T3 genome. It follows, therefore, that
A1122 would be a parent phage and T3 would be a recombinant. The
17 promoter is contained within the gene 15-gene 19 region of
A1122 that we suggest became part of the T3 genome.
Assuming that a single recombination event occurred, the 287-bp region of identity within gene 15 and the 1,351-bp region ending in gene 19 was donated from
A1122 to the genome that became defined as T3. The G+C content of this region is 48.8%, which is significantly less than that of the remainder of the T3 genome (50.3%) and much closer to that of the entire
A1122 genome (48.3%) and of its host Y. pestis (47.6%). Aside from the changes required for a different host range specificity, the majority of the mismatches between T3 and
A1122 in the gene 15-gene 19 region presumably represent adaptation of T3 to growth in E. coli (G+C content, 50.8%).
A1122 may also have been the donor of a second region in the T3 genome. The products of
A1122 genes 5.5 through 5.9 are more similar to T3 than to T7 (Table 1), and a region encompassing the 3' end of gene 5 (DNA polymerase) through gene 6 (exonuclease) has a higher level of sequence identity (Fig. 1). Inspection of the nucleotide sequences within this region reveals a 107-bp stretch of nucleotide identity in gene 5 and a 96-bp stretch within gene 6 (Table 2), which, ignoring the insertion of T3 gene 5.3, provides a 3,156-bp segment that shows 95.0% nucleotide identity. This calculation assumes that T3 gene 5.3 entered the progenitor T3 genome after acquisition of
A1122 DNA. However, it is perhaps more likely that more than one recombination event occurred in this region. A genetic replacement of 1,772 bp may have occurred within DNA polymerase (gene 5), using the 107-bp region of identity (Table 2) and a 77-bp stretch containing one mismatch (
A1122, 14277 to 14353; T3, 14915 to 14991) for recombination. This segment remains 97.1% identical in nucleotide sequence between
A1122 and T3. A second recombination event may have donated an 1,195-bp segment, now 95.1% identical, that spans the 5' end of gene 5.5 through the 5' half of gene 6, using the 94- and 96-bp identical stretches (Table 3) for homology.
Ignoring regions of unequal genetic content, which correspond mainly to putative HNH or other endonucleases, the first 26 kb of the T3 genome, from the genetic left end through the beginning of gene 15, is about 90% identical to that of the Y. enterocolitica phage
YeO3-12. Much of the remaining 12 kb of the T3 genome, from gene 15 through gene 19, was found to be more similar to T7 (50). Consequently it was suggested that T3 may be the product of recombination between a T7-like coliphage and a YeO3-12-like yersiniophage. Indeed, recombination between T7 and the Y. enterocolitica phage
YeO3-12 in vivo was demonstrated experimentally; it allowed the isolation of a coliphage possessing many of the properties of T3 that distinguish it from T7. However, the level of sequence identity (91.0%) between the T3 and T7 genomes in the gene 15-gene 19 region is significantly lower than that between the T3 and
A1122 genomes (98.4%) (also see Fig. 1). This suggests the alternative possibility that T3 is a host range variant of a recombinant between two yersiniophages. One phage is closely related to the Y. enterocolitica phage
YeO3-12, and the second is either the Y. pestis phage
A1122 itself or an extremely close relative (Fig. 4).
|
A1122 and phage H.
A1122 (Gunnison P phage) is used by the CDC as a diagnostic agent for Y. pestis. The U.S. Army still used phage H for similar purposes. H was the name given by Molnar and Lawton (41) to the phage described by Cavanaugh and Quan (10), as there was no evidence that it and the Gunnison P phage were identical. Some properties of
A1122 and of phage H, as described by Cavanugh and Quan (10), are the same. Both are specific for Y. pestis at low temperatures but grow on Y. pseudotuberculosis at 37°C. However, at 37°C phage H attacks all strains of Y. pestis, about half of the Y. pseudotuberculosis strains, and all of the E. coli F- strains tested (41). This host range is different from that of
A1122, which does not plate on E. coli, suggesting that phage H and
A1122 may be different. The host range of H, however, is comparable to those of the
A1122 host range mutants isolated as part of this study. Host range mutants of all members of the T7 phage family are readily isolated. For example, the original isolate of
II was described as specific for E. coli B (44, 69), but many widely distributed variants of
II plate normally on some K-12 strains (8, 42, 43, 64). Likewise it is possible that phage H is merely an expanded host range mutant of the original Gunnison P phage (which was renamed
A1122 for this study).
Heteroduplex mapping with phage H and
II DNAs (7) showed that the phages differ only in a single region, one that by its position likely carries gene 7.7. The positions of nonhomologous regions in heteroduplexes of
II and T7 DNAs (30) are, with one exception, consistent with nonhomologous regions of the
A1122 and T7 genomes as shown here by nucleotide sequence comparison. The one exception is gene 7.7, which is 99.7% identical between
A1122 and T7. Therefore, aside from gene 7.7, the
A1122 and
II genomes are likely to be very similar. Since
II and phage H are so closely related and since
A1122 was in use at the Hooper Foundation when phage H was said to be isolated, it seems likely that
A1122 and phage H were originally the same phage. Now that the genome sequence of
A1122 has been determined, and because this phage retains its absolute specificity for Y. pestis at temperatures below 28°C and for Y. pestis and Y. pseudotuberculosis at 37°C, we suggest that it should be used as the primary diagnostic phage in the identification of Y. pestis both in the United States and elsewhere.
| ACKNOWLEDGMENTS |
|---|
Work at the Lawrence Livermore National Laboratory was performed under the auspices of the U.S. Department of Energy by the University of California, Lawrence Livermore National Laboratory, under contract no. W-7405-Eng-48. Support for Erlan Ramanculov was provided by Biotechnology Enhancement Program Project K584, Department of Health and Human Services. Work at the University of Texas was supported by Public Health Service grant GM32095.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
II are nearly identical. J. Virol. 11:306-313.
II and between T3 and
II as determined by heteroduplex mapping in the electron microscope. J. Mol. Biol. 77:189-196.[CrossRef][Medline]
II. Biochem. Biophys. Res. Commun. 39:1017-1020.[CrossRef][Medline]
Ye03-12 of Yersinia enterocolitica serotype O:3. J. Bacteriol. 183:1928-1937.