Previous Article | Next Article ![]()
Journal of Bacteriology, May 2004, p. 3097-3107, Vol. 186, No. 10
0021-9193/04/$08.00+0 DOI: 10.1128/JB.186.10.3097-3107.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
Received 5 January 2004/ Accepted 4 February 2004
|
|
|---|
70), a 2-aa insert in seryl-tRNA synthetase (SerRS), a 1-aa insert in ribosomal protein L1, and a 2-aa insert in UvrA homologs. By using PCR primers for conserved regions, fragments of these genes were amplified from a number of Deinococcus-Thermus species, and all such fragments (except SerRS in Deinococcus proteolyticus) were found to contain the indicated signatures. The presence of these signatures in various species from all three known genera within this phylum, viz., Deinococcus, Thermus, and Meiothermus, provide evidence that they are likely distinctive characteristics of the entire phylum which were introduced in a common ancestor of this group. The signature in SerRS, which is absent in D. proteolyticus, was likely introduced after the branching of this species. Phylogenetic studies as well as the nature of the inserts in some of these proteins (viz.,
70 and SerRS) also support a sister group relationship between the Thermus and the Meiothermus genera. The identified signatures provide strong evidence for the monophyletic nature of the Deinococcus-Thermus phylum. These molecular markers should prove very useful in the identification of new species related to this group. |
|
|---|
The Deinococcus, Thermus, and Meiothermus genera have been grouped together as a distinct phylum within Bacteria based on their close clustering in 16S rRNA trees, despite morphological and physiological dissimilarity (2, 25, 31, 42, 44). With the rapid increase in the sequence database entries, it is becoming increasingly imprecise to assign species to different taxonomic groups based on branch patterns alone (25). Unfortunately, there are presently no other criteria or molecular means by which species belonging to this phylum can be unambiguously distinguished from other bacterial phyla (2, 9, 31, 33). We have described a new approach based on conserved indels (i.e., inserts or deletions) found in different proteins that is helpful in distinguishing the major bacterial phyla and to understand the interrelationships among them (13-15, 17). Recently, a large number of conserved indels (or signature sequences) which provide distinctive molecular markers for the identification of proteobacteria, chlamydiae, and cyanobacteria have been described (12, 14, 19).
The present communication describes for the first time a number of conserved indels in widely distributed proteins that are distinctive characteristics of the Deinococcus-Thermus phylum. The identified signatures include a 7-amino-acid (aa) insert in Thr-tRNA synthetase (ThrRS), a 5-aa deletion in the signal recognition particle protein Ffh, a 1- and a 3-aa insert in the ß' subunit of RNA polymerase RpoC, a 1-aa insert in the ribosomal protein L1, and 2-aa inserts in major sigma factor 70 (
70), seryl-tRNA synthetase (SerRS), and UvrA homologs. The sequence information for these proteins was previously available from only a limited number of Deinococcus and Thermus species. As the Meiothermus genus has only recently been established, there is little sequence information currently available for this group (33). We have tested the specificity of the identified signatures by PCR amplifying and sequencing fragments of these genes from additional Deinococcus and Meiothermus species for which no sequence information was available. The presence of these signatures in all of the species examined (with a single exception) provide evidence that they are likely distinctive characteristics of the entire phylum and might be used as molecular markers for this group of species.
|
|
|---|
PCR amplification and sequencing. Cultures of Meiothermus ruber (ATCC 35948), Meiothermus silvanus (DMSZ 9946), and Deinococcus grandis (DSMZ 3963) were generously supplied by Peter Gogarten and Lorraine Olenzenski (36). Deinococcus proteolyticus (ATCC 35074) high-molecular-weight DNA was prepared as previously described (6, 16). Oligonucleotide primers, in opposite orientations, were designed for conserved regions in the protein sequences that flanked these signatures based on sequence information from available Deinococcus-Thermus and other species. Degeneracy was incorporated into the primers to account for differences in codon usage among different species. The primers were synthesized at the Molecular Biology Central Facility (MOBIX) of McMaster University, Hamilton, Ontario, Canada.
PCRs. PCR was performed in a Techne Techgene thermocycler. The PCRs had a final volume of 10 µl, and all primer sets were optimized for Mg2+ concentration (in the range of 1.5 to 4 mM) for each DNA strain tested. PCR amplification was carried out over 30 cycles (15 s at 94°C, 15 s at 55 or 45°C, 1 min at 72°C) with an initial 1-min hot start at 94°C and a final extension step (15 s at 94°C, 15 s at 55°C, 7 min at 72°C) (12). The reaction mix also contained 2% dimethyl sulfoxide, which improves PCR performance by lowering the melting temperature of DNA. DNA fragments of the expected size were purified from 0.8% (wt/vol) agarose gels (using a GENECLEAN kit) and subcloned into the plasmid pDRIVE by using a TU cloning kit (Invitrogen). Escherichia coli JM109 cells were transformed with the ligated vector and insert, and the inserts from a number of positive clones were sequenced at MOBIX. Sequences of all cloned fragments were run through a BLAST search to ensure that the amplified gene was from a novel source. The primer sequences used for the amplification of different genes are as follows.
(i)
70.
The following primers were successful in amplifying 504-bp inserts from D. grandis, D. proteolyticus, M. silvanus, and M. ruber: forward, 5'-ACNTAYGCNACNTGGTGGAT-3'; reverse, 5'-GRNGCYTTRTTYTCDATYTG-3', where N represents A, G, C, or T; Y is C or T; R is A or G; and D is A, G, or T.
(ii) Threonyl tRNA synthetase. Fragments 432 bp in length were generated from D. grandis, M. silvanus, and M. ruber genomic DNA with the following primers: forward, 5'-TTCCGSCACWCSCTGGSCCACGTCMTG-3'; reverse, 5'-CCNCKCCARTANGCNCC-3', where S represents C or G, W is A or T, K is G or T, and M is A or C.
(iii) Signal recognition particle Ffh. The following primers were used to amplify a 264-bp fragment from D. grandis: forward, 5'-ATHYTNGGNATGGGNGA-3'; reverse, 5'-CKYTCYTTNACNGTCAT-3', where H represents A, C, or T.
(iv) SerRS. Fragments from M. silvanus and D. proteolyticus of 234 bp in length were successfully amplified by using forward primer 5'-CACSARTTYCGYAARGTNGARCAG-3' and reverse primer 5'-CGARCAGGARTGGGTYTCGCGRTC-3'.
(v) RNA polymerase ß' subunit RpoC. RpoC gene fragments (645 bp) were amplified by PCR from M. silvanus, M. ruber, D. proteolyticus, and D. grandis by using the following primers: forward, 5'-GAYGGNGGNMGNTTYGC-3'; reverse, 5'-CATYTGRTCNCCRTCRAARTC-3'.
(vi) Ribosomal protein L1. A 510-bp fragment was generated from M. silvanus and D. grandis by using the following primers: forward, 5'-ATGCCTAAGCACGGCAAGCGTTACC-3'; reverse, 5'-CCGGTCTTGTCGTTGCGGAACTC-3'.
(vii) Exinuclease ABC subunit A UvrA. A 639-bp fragment was amplified from M. silvanus by using forward primer 5'-TGGCYTTYGACACCATCTACGCCGAGG-3' and reverse primer 5'-AGGCGAACTTCTCSGAGWACAGCTCCTC-3'.
Phylogenetic analysis. Phylogenetic analysis on protein sequences was carried out by procedures described in earlier work (6, 20). Multiple alignment of protein homologs from different groups of bacteria was created by using the ALIGN program. The data for the newly sequenced fragments were added to the alignment, and the fragments were all trimmed to the same length as the amplified fragments. Phylogenetic analyses were performed in both the presence and the absence of the signature region to determine its influence on the branching pattern. The aligned sequences were used to generate 100 bootstrapped data sets with the SEQBOOT program, and genetic distances were calculated by PROTDIST by using Kimura's method (23). Neighbor-joining trees from these distances were constructed by the NEIGHBOR program (40). A consensus tree for various bootstrapped sequences was obtained by using the CONSENSE program. All of these phylogenetic programs are part of the PHYLIP software package (version 3.5; J. Felsenstein, University of Washington, Seattle, Wash.).
Nucleotide sequence accession numbers. The sequence data for all of the gene fragments cloned and sequenced in this work have been deposited in the GenBank database under accession numbers AY450950, AY452779, AY453862, AY489057, and AY453858 for D. grandis; AY450951 and AY453857 for D. proteolyticus; AY450952, AY452780, AY455864, AY489058, AY489059, and AY452782 for M. sylvanus; and AY452778, AY452781, and AY453861 for M. ruber.
|
|
|---|
In
70, which plays a central role in the transcription process by conferring promoter specificity to RNA polymerase (5), a 2-aa insert is present in a conserved region in various available Deinococcus-Thermus homologs (viz., Deinococcus radiodurans, Thermus aquaticus, and Thermus thermophilus) but not in any other bacteria. However, variable inserts are present in this region in Mycoplasma species (data not shown), which are likely of independent origin. The specificity of this insert for the Deinococcus-Thermus phylum was tested by PCR amplifying and sequencing fragments of the
70 gene from four other members belonging to this group for which no sequence information was available. Results of these studies, which are included in Fig. 1, show that all four species tested, which included two Deinococcus (D. grandis and D. proteolyticus) and two Meiothermus (M. ruber and M. silvanus) species contained the identified signature. The sequence region which flanked the identified insert (Fig. 1, boxed region) was also found to be distinctive for Deinococcus-Thermus-Meiothermus species. Since sequence information for this signature is now available for representatives from all three genera within the Deinococcus-Thermus phylum, the shared presence of this insert in all of them strongly indicates that it is very likely a distinctive characteristic of the entire phylum.
![]() View larger version (64K): [in a new window] |
FIG. 1. Partial sequence alignment for 70 proteins showing a 2-aa insert (boxed area) in a conserved region that is uniquely present in Deinococcus, Thermus, and Meiothermus homologs. Dashes in this and all other alignments indicate identity to the amino acid on the top line (E. coli protein). The position of this sequence region in the E. coli protein is indicated at the top. The accession numbers of different proteins are provided in the second column. Sequence information for only representative species from different bacterial groups is presented. The sequences marked with an asterisk were cloned and sequenced in the present work. Abbreviations for the species names are as follows: A., Agrobacterium; Aqu., Aquifex; Bac., Bacillus; Bact., Bacteroides; Bif., Bifidobacterium; Bor., Borrelia; Buch., Buchnera; C., Caulobacter; Camp., Campylobacter; Cfx., Chloroflexus; Chl., Chlamydia; Chlam., Chlamydophila; Clo., Clostridium; Cor., Corynebacterium; Cyt., Cytophaga; D., Deinococcus; Des., Desulfovibrio; E., Escherichia; Ent., Enterococcus; Fuso., Fusobacterium; Geo., Geobacter; H., Haemophilus; Hel., Helicobacter; Helio., Heliobacillus; L., Lactococcus; Lep., Leptospira; Lis., Listeria; M., Mycoplasma; Mei., Meiothermus; Myc., Mycobacterium; Nei., Neisseria; Nit., Nitrosomonas; Oce., Oceanobacillus; Pas., Pasteurella; Pse., Pseudomonas; Ral., Ralstonia; Rh., Rhodobacter; Rho., Rhodospirillum; R., Rickettsia; Sal., Salmonella; Sta., Staphylococcus; Str., Streptomyces; Strep., Streptococcus; Sy., Synechocystis; Syn., Synechococcus; T., Thermotoga; Thermo., Thermoanaerobacter; The., Thermus; Thermosyn., Thermosynechococcus; Tre., Treponema; Tri., Trichodesmium; Troph., Tropheryma; V., Vibrio; X., Xylella. GNS, green nonsulfur bacteria; Gram(+)ve, gram-positive.
|
70 sequences from different bacteria. For these purposes, 169 aa positions for which
70 sequence information was available from different species were utilized. The sequence alignment data were used to generate 100 bootstrapped data sets, and a consensus neighbor-joining tree was obtained from these data. At the same time, a neighbor-joining distance tree showing branch lengths, shown in Fig. 2, was also constructed. The bootstrap scores for different nodes which were >50 are marked on this tree. As shown in Fig. 2, most bacterial groups are clearly distinguished from each other in the tree (as shown by their high bootstrap score), but their branching orders or interrelationships are not resolved, which is a common problem with phylogenetic trees (13, 25). Importantly, in the present context, all of the Deinococcus-Thermus-Meiothermus species formed a well-defined group, branching together 100% of the time. Within this group, different Deinococcus-Thermus genera (viz., Deinococcus, Thermus, and Meiothermus) formed distinct clusters. Of these genera, Deinococcus was found to be the earliest branching lineage, whereas a closer relationship was seen between the Thermus and Meiothermus genera. A similar relationship among these groups is seen in the 16S rRNA trees (2, 39). It is noteworthy that the insert sequence in Deinococcus species consists of two alanine residues, whereas in Thermus and Meiothermus species, the insert sequence is comprised of one alanine and one lysine residue (i.e., AK), again indicating a closer relationship between these two genera. Thus, the inference from signature sequences is in accordance with results from phylogenetic analysis (2, 6, 39, 42). We have also performed phylogenetic analysis on these sequences after omitting the insert region. The tree obtained in this case was very similar to that in Fig. 2 (results not shown), indicating that the observed relationship is not dependant upon or affected by the presence of the insert.
![]() View larger version (24K): [in a new window] |
FIG. 2. A neighbor-joining distance tree with branch lengths based on 70 sequences. The tree is based on 169 aa positions for which sequence information was available for various species. The bootstrap scores (out of 100) of various nodes which were >50 are indicated. The arrow marks the suggested position where the identified insert in this gene was introduced.
|
![]() View larger version (61K): [in a new window] |
FIG. 3. Excerpt from a sequence alignment of threonyl-tRNA synthetase showing a 7-aa insert (boxed areas) that is distinctive of the Deinococcus-Thermus-Meiothermus species. Cb., Chlorobium; Pro., Prochlorococcus. See the legend to Fig. 1 for an explanation of additional abbreviations used.
|
, ß, and ß') are evolutionarily conserved in sequence, structure, and function in all species ranging from bacteria to humans (24, 37). In the ß' subunit of RNA polymerase, which is encoded by the rpoC gene, we have identified a 1- and a 3-aa insert in conserved regions that are only present in D. radiodurans, T. aquaticus, and T. thermophilus but are not found in any other bacteria or species (Fig. 4). Further studies on this indel were carried out by cloning and sequencing fragments of the rpoC gene from three other Deinococcus-Thermus species (D. grandis, M. ruber, and M. silvanus). Results of these studies, which are included in Fig. 4, show that both of these inserts were present in all of these species, indicating that they are distinctive characteristics of the Deinococcus-Thermus phylum. Furthermore, as seen in the case of
70 homologs, the sequence of the 3-aa insert in various Meiothermus and Thermus species (i.e., KDE) was identical and differed from that seen in the Deinococcus species, pointing to a closer relationship between the Meiothermus and Thermus species.
![]() View larger version (75K): [in a new window] |
FIG. 4. Partial sequence alignment of RNA polymerase ß' subunit (RpoC) showing 1- and 3-aa conserved inserts (boxed area) that are specific for the Deinococcus-Thermus-Meiothermus species. This sequence region is highly divergent in Thermotoga maritima (data not shown); hence, it is difficult to infer the presence or absence of the inserts in this species. See the legends to Fig. 1 and 3 for abbreviations used.
|
![]() View larger version (61K): [in a new window] |
FIG. 5. Sequence alignment of ribosomal L1 protein showing a conserved 1-aa insert (boxed area) that is distinctive of the Deinococcus-Thermus-Meiothermus species. Burk., Burkholderia. See the legends to Fig. 1 and 3 for additional abbreviations used.
|
![]() View larger version (83K): [in a new window] |
FIG. 6. Partial sequence alignment of UvrA protein showing a 2-aa insertion in different Deinococcus-Thermus-Meiothermus homologs (boxed area). The insert seen in B. burgdorferi could either have occurred independently or have been derived by means of LGT. See the legends to Fig. 1 and 3 for abbreviations.
|
![]() View larger version (83K): [in a new window] |
FIG. 7. Partial sequence alignments of Ffh protein showing a 5-aa deletion (boxed area) that is a unique characteristic of the Deinococcus-Thermus-Meiothermus homologs. See the legends to Fig. 1 and 3 for abbreviations.
|
![]() View larger version (83K): [in a new window] |
FIG. 8. Excerpt from SerRS sequence alignment showing a 2-aa insert (boxed area) that is present in various Deinococcus-Thermus-Meiothermus species, except D. proteolyticus. This insert was likely introduced in a common ancestor of this group after the branching of D. proteolyticus. U., Ureaplasma. See the legends to Fig. 1, 3, and 5 for additional abbreviations used.
|
|
|
|---|
In the present work, we have identified eight conserved indels in seven widely distributed proteins that are distinctive characteristics of the Deinococcus-Thermus phylum. Based on the work reported here and information available in the databases, information for six of these proteins containing seven signatures (viz., SerRs, ThrRS,
70, RpoC, UvrA, and ribosomal L1 protein) is available from all three genera within the Deinococcus-Thermus phylum. The sequence information for Ffh/SR54 is currently available from only Deinococcus and Thermus genera, but based on the observation that Meiothermus forms a sister lineage with Thermus species (9, 33, 39), it is expected that this signature will also be found in Meiothermus organisms. Except for the absence of the SerRS insert in D. proteolyticus, the identified signatures are present in all Deinococcus-Thermus species examined but not in other bacteria. These signatures thus provide molecular markers for distinguishing the Deinococcus-Thermus phylum from all other bacteria and for identifying new species related to them based simply on the presence or absence of these signatures. The presence of these distinctive signatures also provides strong evidence for the monophyletic nature of the Deinococcus-Thermus phylum as indicated by 16S rRNA trees (38, 42, 44). The most likely explanation for these signatures is that they were introduced in a common ancestor of this lineage and then were passed on to all descendants. This inference is also supported by phylogenetic analysis based on a number of these proteins. The presence of the insert in SerRS in various Deinococcus-Thermus species, but not D. proteolyticus, might be accounted for by two different possibilities. First, it is possible that this insert was introduced in a common ancestor of the other Deinococcus-Thermus species after the branching of D. proteolyticus. Alternatively, this insert may have been introduced in a common ancestor of the entire phylum but then subsequently lost from D. proteolyticus. We favor the first of these possibilities, based on the observation that in phylogenetic trees derived from 16S rRNA sequences, a branch comprised of D. proteolyticus and D. radiophilus forms the deepest group within the Deinococcus-Thermus phylum (2, 39).
LGT is indicated to have played an important role in the evolution of the Deinococcus-Themus group. These organisms are thought to have received genes from a number of other phyla such as the Archaea, Eucarya, and cyanobacteria (11, 26, 36, 43). However, for the various genes studied in the present work, which contain identified signatures, there is no evidence of lateral gene exchange between the Deinococcus-Thermus group and other bacterial phyla, except possibly the UvrA gene in B. burgdorferi. If these genes were subjects of LGTs, one would expect a more random distribution of these signature sequences in which these indels would have been present in other groups of bacteria and at the same time several Deinococcus-Thermus species would be lacking them, which is clearly not the case here. However, in contrast to these genes, a number of genes studied in earlier work contained signature sequences that were commonly shared by cyanobacteria and the Deinococcus-Thermus species, which may be the results of LGTs (13, 18).
We have also previously described many main-line signatures (i.e., indels commonly shared by a number of different bacterial phyla), which provide useful information concerning the phylogenetic placement of the Deinococcus-Thermus group within the bacterial domain (13, 15, 17). The distribution patterns of these signatures in bacterial sequences indicate that the Deinococcus-Thermus phylum has evolved after the divergence of various gram-positive phyla (viz., Firmicutes, Actinobacteria, Clostridia, and relatives) but before the emergence of Aquifex, Chloroflexi, cyanobacteria, spirochetes, the Chlamydia-Cytophaga-Flavobacteria-Bacteroides-green sulfur bacteria group, and proteobacteria (15, 17). The branching of the Deinococcus-Thermus phylum in between the gram-positive bacteria and gram-negative bacteria also accounts for a hitherto puzzling characteristic of Deinococcus. Although all Deinococcus-Thermus species are surrounded by an outer membrane, which is a distinguishing property of the gram-negative bacteria, most species belonging to the genus Deinococcus (all except D. grandis) exhibit positive Gram staining and contain a thick sacculus characteristic of gram-positive bacteria (2, 30, 31, 41). These seemingly contradictory properties are readily explained by the suggested placement of the Deinococcus-Thermus phylum between the gram-positive bacteria (monoderm bacteria surrounded by a single membrane) and gram-negative bacteria (diderm bacteria bound by both inner and outer membranes), and they indicate that this group of species may represent evolutionary intermediates in the transition between these two structurally distinct groups of bacteria (13).
We are thankful to Peter Gogarten and Lorraine Olenzenski for providing us bacterial strains and Melanine Havers for assistance in the creation of some of the signature sequence files.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»