Previous Article | Next Article ![]()
Journal of Bacteriology, September 2004, p. 6316-6319, Vol. 186, No. 18
0021-9193/04/$08.00+0 DOI: 10.1128/JB.186.18.6316-6319.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
Received 8 April 2004/ Accepted 25 May 2004
|
|
|---|
|
|
|---|
Inteins and introns are rare and have not been found previously with ThyX. An intein is a protein intervening sequence that can self-excise and concomitantly splice together its flanking extein sequences (21). Many inteins also harbor an endonuclease domain that initiates intein homing, which confers genetic mobility on the intein (3, 5) and explains its sporadic phylogenetic distribution (20). Group II introns are another type of self-splicing mobile element, and most bacterial group II introns encode a reverse transcriptase-like (RTL) protein (23) that assists intron splicing and mobility, including retrotransposition (2, 11, 12). Group II introns are believed to be evolutionary ancestors of nuclear spliceosomal introns (4), but they are strongly excluded from conserved protein-coding genes in bacteria (6-8), although some bacteriophage genes encode both inteins and group I introns (9, 13, 14). Surprisingly, a bacterial ribonucleotide reductase (RIR)-encoding gene was found recently to encode multiple inteins and group II introns (18). To explore and understand this new phenomenon, we searched for similar inteins and introns in related genes.
An intein- and intron-encoding thyX gene was found during a BLAST search (1) of the GenBank database. This gene is from the oceanic N2-fixing cyanobacterium Trichodesmium erythraeum. As illustrated in Fig. 1, the three exon and extein coding sequences are 196, 80, and 444 bp long, respectively, and they together predicted a 240-amino-acid ThyX sequence that is very similar to known ThyX sequences. The intron was identified by its strong sequence similarity to known group II introns in this organism, which included the T.er.I4 intron in an RIR-encoding gene (18) and the Tr.e.I1 intron in an intergenic sequence (6). It is more than 80% identical to the other introns in the
680-nucleotide folded region (data not shown), although it lacks the RTL coding sequence present in domain IV of the other introns.
![]() View larger version (23K): [in a new window] |
FIG. 1. Illustration of the T. erythraeum thyX gene and predicted expression products. Black boxes represent the three exons and exteins. aa, amino acids; nt, nucleotides.
|
![]() View larger version (24K): [in a new window] |
FIG. 2. Intein sequence comparison. The T. erythraeum ThyX intein sequence is aligned with the sequence of the previously identified Synechocystis sp. strain PCC6803 DnaB intein, and putative intein sequence motifs (A through H) are underlined. Dashes represent gaps introduced to optimize the alignment.
|
![]() View larger version (51K): [in a new window] |
FIG. 3. Protein splicing of T. erythraeum ThyX intein. (Top) Schematic illustration of the fusion protein construct consisting of the maltose binding protein sequence (M), the intein sequence (gray box), and the thioredoxin sequence (T). (Bottom) Observation of protein splicing. Protein production and splicing were carried out with E. coli, and the resulting products were visualized by Western blotting with anti-thioredoxin antibody as previously described (22). Lanes: 1, protein splicing of the Synechocystis sp. strain PCC6803 DnaB mini-intein as a known standard; 2, protein splicing of the T. erythraeum ThyX intein. The letter P marks the position of the precursor protein, which matches closely the predicted sizes of 74 and 91 kDa in lanes 1 and 2, respectively. The letter S marks the position of the spliced protein, which matches closely the predicted sizes of 57 and 56 kDa in lanes 1 and 2, respectively.
|
New ThyX sequences having distinctive ThyX motifs were found through BLAST searches of the GenBank database. Although not having inteins or introns, they showed ThyX motifs that are similar to but distinct from the previously defined ThyX motif RHRX7S (Table 1 and Fig. 4). Most of the new ThyX sequences (T. erythraeum, Nostoc punctiforme, Nostoc sp. strain PCC7120, Synechococcus sp. strain WH8102, mycobacteriophage Bxz1, Streptomyces coelicolor, and Streptomyces avermitilis MA-4680) are readily identified because they are more than 20% identical and 30% similar to the functionally identified Helicobacter pylori J99 ThyX protein and to the structurally determined Thermotoga maritima ThyX protein. ThyX sequences from the two thermophilic sources (Thermosynechococcus elongatus and thermophilic bacteriophage RM378), however, were less easy to identify because of their striking differences from other ThyX sequences at the N and C termini (Fig. 4B). Nevertheless, they are approximately 18% identical and 34% similar to T. erythraeum ThyX over four-fifths of the T. erythraeum ThyX sequence, which is quite significant in light of the generally low levels of sequence conservation among ThyX proteins. For example, the ThyX sequences of two cyanobacterial species (T. erythraeum and Synechocystis sp. strain PCC6803) are only 17% identical and 33% similar. The shortened N-terminal sequence of the T. elongatus and thermophilic bacteriophage RM378 ThyX proteins, relative to that of other ThyX proteins, may be compensated for by the extended C-terminal sequence, although the extended C-terminal sequence does not show similarity to that of other ThyX sequences. T. elongatus ThyX is also the only recognizable thymidylate synthase (which is functionally essential) that could be predicted from the complete genome sequence of this organism.
|
View this table: [in a new window] |
TABLE 1. Distinctive ThyX motifs in ThyX sequences of various organisms
|
![]() View larger version (84K): [in a new window] |
FIG. 4. ThyX protein sequence comparisons. In panel A, sequences are grouped according to ThyX motifs including THRX8S (T. erythraeum [Ter], N. punctiforme [Npu], Nostoc sp. strain PCC7120 [Nsp], and Synechococcus sp. strain WH8102 [SWH]), RHRX8S (Nostoc sp. strain PCC7120 [Nsp], S. coelicolor [Sco], and S. avermitilis MA-4680 [Sav]), and RHRX7S (Synechocystis sp. strain PCC6803 [Ssp], H. pylori J99 [Hpy], and T. maritima [Tma]). Positions conserved within each group are highlighted in gray. In panel B, positions conserved between the T. erythraeum sequence and the T. elongatus or thermophilic bacteriophage RM378 sequences are highlighted in gray. The intein insertion site in the T. erythraeum sequence is indicated by underlining of the two flanking residues (CS). The ThyX motif is also underlined, with the conserved R/THR and S residues in bold.
|
We suggest a more general ThyX motif, R/THRX7-8S, in order to accommodate all four of the distinctive ThyX motifs (Table 1). While this report was under review, others reported the functional characterization of two residues of the ThyX motif RHRX7-8S located at the catalytic site (15). The absolutely conserved S residue (S84) was shown to function as a nucleophile, and the first R residue (R74) was shown to participate in flavin adenine dinucleotide and dUMP binding. But our findings show that the first R residue is not absolutely conserved. However, when a ThyX motif begins with T instead of an R, it either has an R immediately before a T (in T. erythraeum, N. punctiforme, Nostoc sp. strain PCC7120, and Synechococcus sp. strain WH8102) or has a second R internally (in T. erythraeum and thermophilic bacteriophage RM378), and this could suggest functional replacement of alternative R residues.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»