Previous Article | Next Article ![]()
Journal of Bacteriology, November 2002, p. 6026-6036, Vol. 184, No. 21
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.21.6026-6036.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Nestlé Research Center, Nestec Ltd., Vers-chez-les-Blanc, CH-1000 Lausanne 26, Switzerland,1 Department of Microbiology, University College, Cork, Ireland,2 Area de Microbiologia, Universidad de Oviedo, Oviedo, Spain3
Received 20 May 2002/ Accepted 1 August 2002
|
|
|---|
|
|
|---|
Comparative phage genomics opened the perspective of a natural taxonomy of Siphoviridae, i.e., implementation of a phage systematics that reflects (at least partially) the phylogenetic relationship between phages. To investigate this possibility in more detail for Sfi21-like Siphoviridae, we included further Sfi21-like phage genome sequences (S. thermophilus phage 7201 [51], Lactobacillus casei phage A2 [27, 28, 33, 46], and three L. lactis prophages [16]) into the comparative analysis. The analysis corroborated the proposal of a new Sfi21-like phage genus in lactic acid bacteria, yet the trend for a coevolution of phages and their host bacteria was not confirmed. Graded relatedness ranging from DNA, protein, and only functional gene map similarity for the structural genes was even observed for phages infecting a single host species, L. lactis. Comparative genomics of lactococcal phages also demonstrated a dilemma of phage taxonomy. On the basis of DNA homology, temperate lactococcal phages can be regarded as one or four phage species depending on whether the analysis is based on the nonstructural gene cluster or the structural gene cluster, respectively. On the basis of the mode of head morphogenesis, phage genomics can differentiate temperate lactococcal phages into at least two distinct phage genera widely distributed in low-GC-content gram-positive bacteria (Sfi21- and Sfi11-like phages). The reasons for this taxonomic dilemma and the prospect of a phage genomics-based taxonomy are discussed.
|
|
|---|
Phage A2 was propagated on L. casei ATCC 393 growing in MRS supplemented with 10 mM CaCl2 and MgSO4. Phage particles were purified by isopycnic centrifugation on a continuous CsCl gradient. DNase I-generated fragments were cloned in pUC18, and the DNA sequence was determined on both strands by the PCR cycle sequencing method (47). Gaps in the sequence were filled using PCR-amplified segments or direct sequencing using the whole phage DNA as a template.
Table 1 provides the pertinent characteristics of the phages investigated in this comparative study for an easier orientation.
|
View this table: [in a new window] |
TABLE 1. Characterization of the bacteriophages discussed in the current study
|
|
|
|---|
![]() View larger version (50K): [in a new window] |
FIG. 1. Alignment of the genetic maps from the temperate cos site S. thermophilus phage Sfi21, L. casei phage A2, and S. aureus phage PVL, all members of the proposed Sfi21-like genus of Siphoviridae. The ORFs are color coded according to their predicted function deduced from a set of criteria consisting of either their database matches, the synteny argument with phage lambda (18), protein analysis (38), transcription analysis (55), or a mixture of these criteria. Green, predicted DNA packaging and head morphogenesis genes; brown, predicted head-to-tail joining genes; dark blue, tail morphogenesis genes; light blue, tail fiber genes; mauve, lysis genes; black, predicted lysogeny conversion genes; red, lysogeny genes; orange, DNA replication genes; yellow, transcription regulation genes (this region is relatively undefined in A2: ORFs with database match are denoted with a green +, and those lacking a match are denoted with a red -). Selected genes from the A2 phage genome are denoted. Genes encoding proteins that showed significant amino acid sequence similarity are linked by grey shading, and the percentage of amino acid identity is indicated. The phage genomes are depicted following the convention of placing the DNA packaging genes at the left end.
|
Further deviations from the Sfi21 gene map were observed in A2. Of note are an insertion/deletion in the replication module (ORF 36 in Sfi21) and the insertion of the small ORF 21 upstream of the integrase gene in A2. The latter, ORF 21, is interesting in the controversy about the location of the excisionase gene in dairy phages, since in two lactococcal phages the excisionase gene was located one or two ORFs downstream of the cro-like gene (8; D. van Sinderen, unpublished data). However, no excisionase activity could be associated with this gene in vector integration experiments (4), and the location of the A2 excisionase is still unknown. Downstream of the replication module, two endonuclease genes (ORFs 48 et 57) are notable in A2. Related endonucleases are intron-associated in a number of phage systems (24). Also interesting is a methyltransferase gene in A2. Related methyltransferases were observed in Lactobacillus phage phig1e and Listeria phage A118, and they might be involved in the modification of phage and host DNA, but to our knowledge they are not involved in restriction-modification systems.
Also notable is an A2 gene (ORF 22) with links to the pathogenicity island of Clostridium difficile encoding toxin A. This gene is transcribed in the A2 prophage (33). Interestingly, the A2 gene is found at a position occupied by the superinfection exclusion gene (sie) in Sfi21. The sie gene is actually the most prominent transcript from the Sfi21 prophage (54). A sie function is not excluded for ORF 22 by the Clostridium link, since the predicted protein showed two membrane-spanning domains, which concurs with the data reported for the sie functions in lactococcal phages encoded at a corresponding genome position (44). ORF 19 without a database match is located between phage lysin and integrase genes. This region is occupied by toxin genes (ORF 27 and ORF 28) in phage PVL (32), and it is also one of the few transcribed regions in prophages from lactic streptococci (54). A2 ORFs 19 and 22 are, therefore, candidates for lysogenic conversion genes. While this position encodes superantigens, mitogenic factors, and toxins in pathogenic streptococci (S. pyogenes, Streptococcus equi) (21), no lysogenic phenotype has ever been demonstrated or searched for in dairy phages.
S. thermophilus phage 7201 shows graded relatedness with Sfi21-like streptococcal and lactococcal phages. S. thermophilus phage 7201 is not inhibited by the Sfi21-type origin of phage replication (51). Since the DNA replication module is highly conserved in S. thermophilus phages (19), a distinct genome organization was suspected for phage 7201. The sequencing of the 35,466-bp-long phage 7201 genome revealed a distinct replication module in an otherwise standard Sfi21-like phage genome. The predicted genetic map is depicted in Fig. 2 and is compared to that of cos site virulent S. thermophilus phage Sfi19, an established member of the Sfi21 phage group (41). Note that the two virulent streptococcal phages are depicted in a hypothetical prophage configuration to facilitate the comparison with the L. lactis prophage bIL286 (top of Fig. 2). The database matches with phage DT1 and an S. thermophilus prophage remnant (54) identified the first three 7201 genes (ORFs 1, 2, and 3) as the lysogeny replacement module (Table 2). Over the next eight predicted genes (ORF 4 to 10), phage 7201 shared no similarity with any described S. thermophilus phages. However, with the exception of ORF 6a and 6b, the products of a DNA duplication event, all predicted proteins matched likely DNA replication proteins identified on plasmids or phages and prophages from pathogenic streptococci (51).
![]() View larger version (41K): [in a new window] |
FIG. 2. Alignment of the genetic maps from the L. lactis prophage bIL286 with the virulent S. thermophilus phages 7201 and SfiI9. The open reading frames are color coded according to their predicted function as in Fig. 1. Selected genes or genomic features are denoted. Genes encoding proteins that showed significant amino acid sequence similarity are linked by red shading, and the percentage of amino acid identity is indicated. The degree of amino acid identity (>90, >80, >70, and <70%) is reflected in the color intensity of the red shading. Note that the genomes of the virulent phages were rearranged to allow an easier comparison with the prophage bIL286. The natural ends of the streptococcal phage DNA flank the indicated cos sites.
|
|
View this table: [in a new window] |
TABLE 2. Selected database matches for predicted genes from S. thermophilus phage 7201
|
Sequence identity between S. thermophilus phages 7201 and Sfi19 was high and exceeded 80% at the amino acid level for proteins encoded by the head, tail, and part of the tail fiber genes (Fig. 2). At the DNA level, both phages could be aligned with few interruptions over three-quarters of the respective genomes (Fig. 3A). Proteins specified by the entire structural gene cluster of phage 7201 also displayed significant amino acid sequence similarity to the proteins encoded by the equivalent region of the Sfi21-like L. lactis prophage bIL286. However, the level of protein sequence identity between 7201 and bIL286 was lower than that between any two S. thermophilus phages belonging to the Sfi21 group. DNA sequence similarity between 7201 and bIL286 was limited to one DNA replication gene, several head genes, the genes involved in DNA packaging, the major tail gene, and two tail fiber genes (Fig. 3B). Due to the high level of sequence identity (85 and 80% base pair and amino acid identity, respectively, between 7201 ORF 36 and bIL286 ORF 55), this tail fiber gene is a candidate for lateral gene transfer between streptococcal and lactococcal phages.
![]() View larger version (22K): [in a new window] |
FIG. 3. Dot plot analysis for the S. thermophilus phage 7201 genome versus the S. thermophilus phage Sfi19 genome (A) and the S. thermophilus phage 7201 genome versus the L. lactis prophage bIL286 genome (B). The comparison window was 50 bp, and the stringency was 30 bp. Genes sharing nucleotide sequence similarity are highlighted in black on the gene maps (see Fig. 2 for an expanded view of the maps) of the phages depicted at the sides of the dot plot.
|
Dot plot matrix for lactococcal phages. In apparent contradiction to the widespread distribution of Sfi21-like phages for S. thermophilus, Sfi21-/BK5-T-like L. lactis phages have only rarely been isolated from the cheese factory. However, Sfi21-like lactococcal phages were detected as prophages during the sequencing of the L. lactis strain IL1403 (16), increasing the number of complete lactococcal phage genomes to 11. It is, therefore, appropriate to study the DNA sequence relationships between all currently available L. lactis phages in the dot plot matrix displayed in Fig. 4. This analysis allows investigation into the genetic differences underlying the lactococcal phage species and genus classification established by phage taxonomists (7). Several observations are noteworthy. The virulent lactococcal phages represent two clearly separated groups with respect to DNA sequence relationships: c2-like phages (an International Committee on Taxonomy in Virology (ICTV)-accepted genus in Siphoviridae [43]) and sk1-like phages. Each group is represented by two sequenced members that share close sequence relationships with each other (c2 and bIL67 on one side and sk1 and bIL170 on the other side) but practically no DNA sequence relationships with any other group of lactococcal phages (Fig. 4). Both c2- and sk1-like phages are abundant for L. lactis, while phages with similar genome organization have, until now, not been isolated from any bacterial species other than L. lactis. Temperate lactococcal phages like r1t (53), Tuc2009 (J. F. M. L. Seegers, M. van de Guchte, M. Creaven, G. F. Fitzgerald, and D. van Sinderen, unpublished data), and TP901-1 (9) (currently classified as a P335 species of lactococcal phages) are also prominent factory isolates in contrast to temperate phages from the lactococcal BK5-T species (42). All temperate lactococcal phages are linked by DNA sequence relationships. This is most evident for the lysogeny, DNA replication, and putative transcriptional regulation genes, which are shared to a large extent by all seven temperate lactococcal phages (Fig. 4, diagonal line in top seven squares of leftmost column). However, with respect to DNA sequence homology over the structural genes, several groups could be differentiated within the temperate lactococcal phages. The first group is represented by BK5-T-like phages. This group contains three sequenced members: phages BK5-T, bIL286, and bIL309. They are clearly members of the proposed Sfi21-like genus of Siphoviridae. The second group is represented by phages Tuc2009 and TP901-1, while a third group is represented by a single phage isolate, r1t (Fig. 4). In the current taxonomic system of lactococcal phages, the latter two phage groups are classified together as P335 species. This classification leads to problems, however. Tuc 2009 and TP901-1 are pac site phages which are members of the recently proposed Sfi11-like genus of Siphoviridae (10, 11), which are as widely distributed in low-GC-content gram-positive bacteria as phages from the proposed Sfi21-like genus. In contrast, phage r1t is a cos site phage with a clearly distinct DNA sequence and genome organization of the structural gene module. Closely and distantly related phages were found in S. pyogenes and mycobacteria, respectively (20, 21), indicating that we might have here a further widely distributed structural gene module. Prophage bIL285 constitutes a fourth DNA homology group for the structural genes in temperate lactococcal phages. Therefore, at least six distinct structural modules can be defined for Siphoviridae from L. lactis phages, and more are likely to exist (e.g., in the 949 lactococcal phage species). What might be the reason for this much broader genetic diversity for phages from L. lactis than for those from S. thermophilus, which showed only two distinct structural modules for both virulent and temperate phages? Since L. lactis phages have been investigated for a longer time and by more laboratories, it might represent an observational bias. We judge this unlikely, since more than 200 S. thermophilus phage isolates have been investigated in several laboratories, revealing just two major phage types (12, 35). We prefer an alternative interpretation. Numerous plasmid-encoded phage inhibition systems have been described for L. lactis, while plasmids are rare for S. thermophilus, and even rarer are S. thermophilus plasmids that encode phage resistance mechanisms (26). Interestingly, one of the rare S. thermophilus plasmids (30) showed sequence similarity to the origin of replication of S. thermophilus phages (25). However, in contrast to the situation for E. coli, where phages can replicate as extrachromosomal elements (phage P1), no good evidence for prophage-like plasmids exists for lactic acid bacteria (15). L. lactis phages are therefore under a much greater selection pressure to diversify within the species or to accept "foreign" phage modules from the environment than S. thermophilus phages. We speculate that the latter process might be favored by the broader ecological range of L. lactis compared to S. thermophilus, which appears to be restricted to milk.
![]() View larger version (35K): [in a new window] |
FIG. 4. Dot plot matrix calculated for the complete genomes of the L. lactis prophages bIL285, bIL286, and bIL309, all from the genome sequence of L. lactis strain IL1403 (13), of the temperate L. lactis phages Tuc2009 (P335 lactococcal species, proposed Sfi11-like genus of pac site Siphoviridae), BK5-T (BK5-T lactococcal species, proposed Sfi21-like genus of cos site Siphoviridae), TP901-1 (P335 lactococcal species, proposed Sfi11-like genus of pac site Siphoviridae), r1t (P335 lactococcal species, new r1t/LC3-like genus (?) of cos site Siphoviridae) and of the virulent L. lactis phages bIL67 and c2 (both c2 lactococcal species, c2-like genus of Siphoviridae) and sk1 and bIL170 (936 species of lactococcal phages, new sk1-like genus (?) of Siphoviridae). The prophages and temperate phages are aligned so that the phage integrase is at the utmost left (top) of each indicated square, followed by the lysogeny and DNA replication modules; in the center are the structural genes (DNA packaging, head, and tail genes); at the right (bottom) of each indicated square are tail fiber and lysis genes. The virulent phages are given with the late structural genes at the right and the early nonstructural genes at the left. The BK5-T and c2 sequences were rearranged or inversed, respectively, with respect to the database entry. The left y axis provides a scale in kilobases; the right y axis and the bottom x axis identify the phage genomes that were compared in the corresponding square. The dot matrix was calculated using Dotter (50). The comparison window was 50 bp, and the stringency was 30 bp.
|
Prophages bIL286 and bIL309 are further examples of Sfi21-like L. lactis phages. Over the entire late gene cluster, the L. lactis prophage bIL286 exhibited a high degree of protein sequence identity with phage BK5-T, an established member of the proposed Sfi21-like Siphoviridae genus (Fig. 5). In particular, the proteins encoded by the DNA packaging, major head and tail, and lysis genes were nearly identical between these two phages. Major differences between the late genes were limited to three regions. The first is the additional ORF 14 in BK5-T, located in a region where phage lambda shows frameshifting between the tail genes G and T. The putative tail fiber genes ORF 54 and 55 from bIL286 have no counterpart in BK5-T, while a gene specifying a closely related protein was detected in the L. lactis prophage bIL309. This is actually the second example where a single ORF in one phage has similarity to multiple ORFs in another phage (see also ORF 35 and 38 from 7201 compared to ORF 1291 in Sfi19) (Fig. 2). The large gene in phage Sfi19 encodes the likely receptor-recognizing tail fiber gene (39) and represents a multidomain protein, which is due to internal collagen-like repeats that are the target of recombination events (23, 39). It cannot be decided whether the fused or split genes represent the ancient gene constellation. A third difference was observed in the lysis module: bIL286 possesses a relatively large anonymous gene upstream of the holin gene (ORF 59), while a similarly sized gene was located downstream of the lysin gene in BK5-T (ORF 30). The latter gene is one of the few transcribed genes of the BK5-T prophage (6) and thus a candidate for a lysogenic conversion gene (54). Over the nonstructural gene region, only isolated genes shared sequence similarity between bIL286 and BK5-T.
![]() View larger version (61K): [in a new window] |
FIG. 5. Alignment of the genetic maps from the L. lactis prophage bIL309 with the L. lactis prophage bIL286 and temperate L. lactis phage BK5-T. The open reading frames are color coded according to their predicted function as in Fig. 1. Selected genes or genomic features are denoted. Genes encoding proteins that showed significant amino acid sequence are linked by red shading, and the percentage of amino acid identity is indicated. The degree of amino acid identity (>90, >80, >70, or <70%) is reflected in the color intensity of the red shading.
|
Prophages bIL285: a distant member of the Sfi21-like phage group. An interesting case is presented by L. lactis prophage bIL285. Despite its lack of any DNA or protein sequence similarity with the structural gene cluster from any other sequenced lactococcal phage, the gene map suggested an Sfi21-like phage (Fig. 6). Diagnostic for this group is the protease/major head gene constellation presented by ORF 43 and ORF 44 in bIL285. Notably, the L. lactis prophage bIL285 and the Listeria innocua prophage 5 are closely related at the protein level for most of the predicted structural proteins. Exceptions were the bIL285 ORF 45 insertion/deletion and ORF 55, a probable tail fiber gene which we predict to be involved in host cell recognition since it is closely related to a corresponding gene in phage Tuc2009 (Fig. 6). The similarity between bIL285 and the Listeria prophage extended to the DNA sequence level (Fig. 7). This is unusual, since up to now DNA sequence similarity between phages from low-GC-content gram-positive bacteria have been limited to phages infecting the two closely related bacterial genera Streptococcus and Lactococcus (20, 21). Listeria, in contrast, is a more distant member of this branch of bacteria. Notably, lactococcal Sfi21-like phages cover genetic relationships ranging from close similarity at the DNA sequence level over relatedness at the protein sequence level to the sharing of a comparable gene map in the absence of any sequence similarity. This observation confirms the previously described graded relationships between phages in low-GC-content gram-positive bacteria and thus provides evidence for shared evolution of these phages that may be deciphered by comparative genomics. Apparently, at some point the Lactococcus, Listeria, and Streptococcus phages have been in the same host (which may be none of the above bacteria) and have shared their DNA. Since sequence relatives of the structural modules identified in all temperate lactococcal phages have also been found in Streptococcus and Listeria phages, it is unlikely that these structural modules have specifically evolved in the L. lactis species.
![]() View larger version (45K): [in a new window] |
FIG. 6. Alignment of the genetic maps from the temperate L. lactis phage Tuc2009 with L. lactis prophage bIL285 and L. innocua prophage 5. The open reading frames are color coded according to their predicted function as in Fig. 1. Selected genes from bIL285 are annotated. Genes encoding proteins that showed significant amino acid sequence similarity are linked by red shading, and the percentage of amino acid identity is indicated. The degree of amino acid identity (>90, >80, >70, or <70%) is reflected in the color intensity of the red shading. The brackets under the bIL285 gene map demarcate the patchwise sequence sharing of bIL285 genes with other lactococcal phages (B, BK5-T, T, TP901-1; 286, bIL286), possibly identifying genes whose proteins have to interact in a common function (see the text).
|
![]() View larger version (24K): [in a new window] |
FIG. 7. Dot plot analysis for the L. innocua prophage 5 (29) genome versus the L. lactis prophage bIL285 genome. The comparison window was 50 bp, and the stringency was 30 bp. Genes sharing nucleotide sequence similarity are highlighted in black on the gene maps (see Fig. 6 for an expanded view of the prophage maps) of the phages depicted at the sides of the dot plot.
|
Outlook for phage taxonomy. The current system from the ICTV classifies bacteriophages according to the type of genetic material (viruses with double- or single-stranded DNA or double- or single-stranded RNA) (43). This classification most likely represents phylogenetical distinct groups of phages. Within the group of bacteriophages with double-stranded DNA genomes a further subdivision is done according to morphology of the phage particle. A major group is the tailed phages of the order Caudovirales (1). This group represents the vast majority of all phages and is so characteristic that it is seen by many phage researchers as a monophyletic group. However, the contentious issue starts when the group of tailed phages is further subdivided into phage families. Currently, this division is mainly based on tail morphology (Myoviridae, Siphoviridae, and Podoviridae; phages with long contractile tails, with long noncontractile tails, and with short or no tails, respectively). Phage genomics has started to undermine this morphological subdivision, e.g., P22-like Podoviridae and lambda-like Siphoviridae share too similar a genome organization to justify their separation into distinct phage families (52).
The current taxonomic system has a hierarchical organization. Under the family classification come different phage genera (e.g., in Siphoviridae there are 6 genera accepted by ICTV: lambda-, T1-, T5-, L5-, c2-, and psiM1-like phages) and then phage species. It seems fair to say that the current phage classification system represents more our limited knowledge on the diversity of phages in nature than their true genetic diversity. Our limited knowledge is apparent at two levels. First, only a few detailed studies have been performed on the ecological complexity of phages in their natural environment (14), and second, we possess only a very small database of complete phage sequences. In addition, most sequences are from just three bacterial host systems: dairy phages, coliphages, and mycobacteriophages (13).
The present work suggests that a new phage genus should be added to the Siphoviridae family: the Sfi21-like phage genus. This group of phages is widely distributed in low-GC-content gram-positive bacteria, and it is characterized by a uniform genome organization (20). Sfi21-like phages infecting bacteria as different as lactic acid bacteria, staphylococci, listeria, bacilli, and clostridia are linked by numerous sequence similarities. The proposal of this new genus thus seems well founded.
However, the presented analysis goes beyond the proposal of a new phage genusin fact, it questions some phage taxonomy principles. One of these principles is the species level of phage taxonomy. For example, the lactococcal phage species P335 (Table 1) and the BK5-T species share DNA sequence identity over the nonstructural genes (Fig. 4), which would justify their unification into one species. At the same time, similarities with streptococcal phages over the structural genes suggest a division of these temperate lactococcal phages into two distinct genera: according to their structural gene module phages, Tuc2009 and BK5-T should be classified into the proposed Sfi11-like and Sfi21-like genera, respectively, of Siphoviridae. We propose to replace the current lactococcal phage taxonomy (31) by one that is based on the genomic analysis of the structural gene cluster (Table 1).
This concurrent lumping and splitting of the same phage set is real and not compatible with a hierarchical taxonomical system. Phage species concepts should be the domain of geneticists and evolutionary biologists but not a prime concern of taxonomists (except for the notorious problem of weird naming of phage isolates). As already predicted by the modular theory of phage evolution (5) and vindicated by phage genomics, phages do not evolve along linear lines of descent (13). This is best illustrated by phages that are hybrids of currently distinguished phage families. For example, Shigella flexneri phage SfV is a lambda-like siphovirus with respect to its head genes but a Mu-like myovirus with respect to its tail genes (2). E. coli phage phi P27 is another example of such a lambda/Mu chimera (48). Comparative genomics has revealed prophages sharing head genes of a lambda-like siphovirus and tail genes of a P2-like myovirus (Xylella fastidiosa prophages XfP1 and XfP2). Comparative phage genomics has just started to delineate the evolutionary history of some individual phage modules, such as the structural gene module in Siphoviridae (11). The peculiar mode of phage evolution precludes depicting even in principle the genetic relationships between phages by a tree-like phylogeny (13). The development of web-like phylogenies will thus become necessary to illustrate evolutionary relationships between phages. However, the current phage taxonomy system is still based in its terminology and conception on Linnean thinking, and any hierarchical taxonomy will be unsuitable for biological systems demonstrating web-like phylogenies.
Some authors have argued that one should make a separation between taxonomy and phylogeny. The former revolves around the naming and does not necessarily have to deal with the phylogeny, although it traditionally does. One can therefore have taxonomy in the absence of phylogeny. Two solutions to phage taxonomy independent of phage phylogeny have been proposed. One is the phage proteomic tree, which classifies phage into taxa based on the overall similarity of about 100 completely sequenced phage genomes (49). The tree is the result of an objective calculation process conducted by a computer program and is applicable to any completely sequenced phage. Interestingly, the resulting taxa still matched relatively well the phage families and phage genera established by phage taxonomists. In another system, phages are classified into domains, divisions, families, and modi. The last are in some ways a shorthand description of the phage. For example, the above-mentioned SfV phage chimera is in this system characterized by the modi (i) HK97-like head genes, (ii) genes for Mu-like contractile tail, and (iii) integrase-mediated temperate phage (34). The usefulness of this approach will depend on the number of modi we will need to describe phages when we are confronting in the near future a thousand sequenced phage genomes. We propose a third approach to taxonomy based on the genomics analysis of a single, but prominent, phage module, which allows a rational hierarchical system (phages linked by different degrees of DNA sequence and then protein sequence identity and then sharing of a gene map in the absence of sequence similarity). There are some arguments in favor of basing a phage taxonomy system on the comparative genomics analysis of the DNA packaging-head gene cluster. The evolutionary analysis of the morphogenesis module is the most advanced of the different phage modules, and the structural gene order is apparently very conserved (11). Alternatively, one could base a phylogeny-inspired taxonomy system on phage tail genes (excluding tail fiber genes). This choice will assure a maximal overlap with the current phage taxonomy, which differentiates the majority of the isolated phages according to phage tail morphology. In fact, the hybrid phage SfV would have been classified as a myovirus by electron microscopy and a Mu-like myovirus by genomic analysis of the tail genes. The real taxonomic value of the different proposals will depend on the diversity of the bacteriophage genomes encountered in the environment. A systematic survey of prophages in genomes from low-GC-content gram-positive bacteria has up to now revealed only rather limited genetic diversity of the phage genomes (H. Brüssow, C. Canchaya, and C. Proux, submitted for publication). With respect to the structural genes, most could be classified as either Sfi21-like or Sfi11-like Siphoviridae, a few were r1t-like Siphoviridae, and further types were up to now limited to a single species (e.g., SPß in Bacillus subtilis). In the sequenced genomes of gram-negative bacteria also, only a few prophage types could be differentiated with respect to the structural genes. Lambda-like, P2-like, P4-like, Mu-like, M13-like phages account for the majority of them (Brüssow et al., submitted). The taxonomy of temperate phages might thus not be a problem; however, only a minority of the phages have adopted the temperate lifestyle. In contrast, the sequencing of virulent phages still surprises us with genomes which have few if any database matches (45). Only the sequencing of more and ecologically representative virulent phages will show us if we reach some saturation of the sequence space or if we are confronted with a bewildering diversity of phage genomes which will complicate all approaches to phage taxonomy.
|
|
|---|
KZ of Pseudomonas aeruginosa. J. Mol. Biol. 17:1-19.
P27 is not related to other STx phage genomes, but the modular genetic structure is conserved. Infect. Immun. 70:1896-1908.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»