Previous Article | Next Article ![]()
Journal of Bacteriology, July 2002, p. 3917-3922, Vol. 184, No. 14
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.14.3917-3922.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Hyang Choi, and Chankyu Park*
National Creative Research Initiative Center for Behavioral Genetics, Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Yusong-Ku, Taejon 305-701, Republic of Korea
Received 11 January 2002/ Accepted 22 April 2002
|
|
|---|
|
|
|---|
Despite a lack of overall primary sequence conservation, the group I introns contain blocks of conserved nucleotide sequence elements (P, Q, R, and S), forming an active center with a secondary structure consisting of characteristic stem-loops P1 through P10 (2, 22, 23). There are also extra sequences, often including open reading frames (ORFs), which are located in loops peripheral to the catalytic intron core. The group II introns contain a different secondary structure, consisting of six conserved pairings emanating from a central wheel (24). In both introns, sequence conservations are confined to small regions of nucleotide residues. The products of internal ORFs of these introns usually play a role in splicing (in which case they are called maturases) or mobility of the intron (also referred to as "homing") (17, 32).
The bacterial group I introns are found in several tRNA genes of cyanobacteria and purple proteobacteria (16, 28, 38). Some protein-coding genes, associated with DNA metabolism, of phages from Escherichia coli, Bacillus subtilis, Lactococcus lactis, Lactococcus delbrueckii, and Staphylococcus aureus contain group I introns (1, 3, 12, 18, 37). However, no group I intron has been reported so far from a protein-coding gene of a eubacterial genome. The group II introns of bacteria have been identified in cyanobacteria (Calothrix spp.), proteobacteria (Azotobacter vinelandii and E. coli [9]), and gram-positive bacteria (L. lactis and Clostridium difficile [27, 35]). Most of the eubacterial group II introns are found in genes on plasmids or associated with conjugal transfer (7).
The bacterial RecA protein is ubiquitous and highly conserved (15). It is involved in homologous recombination, in DNA repair, and in promoting proteolysis (25, 30). In recombination, RecA binds to DNA, promotes homologous pairing, and catalyzes strand exchange (31). When there is DNA damage, this process is used to fill the gap generated by the DNA damage during replication. RecA also serves as a regulator with an activation upon SOS signal to facilitate proteolysis of LexA,
repressor, and SulA involved in the control of cell division (21, 29, 34). In E. coli, the transcription of the recA gene is negatively regulated by the LexA protein, which binds to the upstream regions of the SOS genes including recA (20). Bacillus species also contain LexA, called DinR in B. subtilis, and the SOS repair system. However, the binding consensus sequence CGAACRNRYGTTYC (36), also known as the Cheo box for LexA, is different from that of E. coli (26).
Several recA genes of mycobacteria are interrupted by an intein that is removed from its precursor by a protein splicing mechanism to generate a functional RecA protein (5, 6). The inteins in six species, including Mycobacterium leprae and Mycobacterium chitae, were found in the same insertion sites (RecA-b) immediately downstream of glycine 205 in M. leprae, while the one (RecA-a) for Mycobacterium tuberculosis is after the lysine 251 (33). Inteins are also found in other eubacteria, archaea, and eukaryotes, frequently interrupting genes involved in replication and repair.
During the study of the recA genes from several gram-positive bacteria, we discovered that a putative recA gene of Bacillus anthracis is interrupted by an intervening sequence (IVS). Here we show that the IVS is a group I intron and that the recA gene codes for a functional RecA protein in E. coli. This is the first demonstration of the presence of a group I intron in a protein-coding gene of a bacterial genome.
|
|
|---|
recA::Cm allele was transferred from CP857 (A. Wolfe) to MG1655 by P1 transduction to generate a recA deletion. Cells were grown in Luria broth or Luria-Bertani (LB) agar plates at 37°C. When required, ampicillin (100 µg/ml) or chloramphenicol (30 µg/ml) was added to the medium. The nonpathogenic derivative of B. anthracis (
Stern lacking pXO1 and pXO2) was used for RNA preparation. The Bacillus strain was cultured in LBG broth (LB with 1% glucose) or LBG agar plates at 37°C. Plasmid construction. The chromosomal DNA of B. anthracis, obtained from Y. Chae, was used as a template for PCR with a pair of primers (RECA-A, 5'-TGT TGG CAA ATT GAA TTG AAA ATA GG-3', and RECA-B, 5'-CTC CAA CGA TAT AGA ACG CTT TC-3') to obtain the whole recA ORF, from 65 bp upstream of the start codon to 67 bp downstream of the stop codon. The PCR product was cloned into the SmaI site of pUC19 or pBluescript SK(+) to yield pUCrecA[int+] and pT7recA[int+], respectively. The intronless version of the recA gene was constructed by making a fusion of the two exons directly, using an overlapping extension technique of PCR (13). The recA[int-] gene was cloned into pUC19 or pBluescript SK(+) to generate pUCrecA[int-] and pT7recA[int-], respectively. The cloned region of pUCrecA[int-] or pT7recA[int-] is the same as that of pUCrecA[int+] or pT7recA[int+], except for the intron region. The sequences and orientations of the cloned recA genes were confirmed by DNA sequencing. The recA genes of pT7recA[int+] and pT7recA[int-] are under the control of the T7 promoter.
Inverse PCR and sequencing of recA from B. anthracis. Purified chromosomal DNA of B. anthracis was completely digested with Sau3AI, EcoRI, BamHI, or PstI/NsiI. Each digested DNA was ligated at low concentration to facilitate formation of circles. PCR amplification was performed with the ligated template and a pair of oppositely directed primers (RECA-1, 5'-GAT AAT AGT AAT TCA TCG ATG TTA ACA C-3', and RECA-2, 5'-ACA ATC GCA ATC TTT ATT AAC CAA ATT C-3') recognizing the internal regions of recA. The sizes of PCR products were about 0.85 (Sau3AI), 1.7 (EcoRI), 3.8 (BamHI), and 4 (PstI/NsiI) kb. After sequencing of the products with primers RECA-1, RECA-2, and RECA-3 (newly designed based on the initial sequencing result; 5'-GGC CTC TGA ACT TGA TAT CG-3'), the partial sequences were assembled to generate the complete sequence of the putative recA gene. The assembled sequence was confirmed by direct sequencing of the PCR product (about 1.5 kb) amplified with RECA-A and RECA-B from the B. anthracis chromosome. The locations and directions of the primers are shown in Fig. 1.
|
View larger version (8K): [in a new window] |
FIG. 1. Schematic diagram of B. anthracis recA gene. The gene includes a 327-bp IVS, located after residue 203. The thick arrows indicate locations of oligonucleotides used. The predicted transcriptional start site and the termination loop are also indicated.
|
Stern or DH10ß containing plasmids was obtained from cells grown in appropriate media to an optical density of 1.0 at 600 nm. For preparation of RNA from BL21(DE3) containing pBluescript SK(+), pT7recA[int+], or pT7recA[int-], cells were grown to an optical density of 0.3 at 600 nm and treated with isopropyl-ß-D-thiogalactoside (IPTG) at a final concentration of 0.5 mM. The IPTG-induced cells were grown further for 1 h. Cells were harvested from 3-ml cultures and resuspended in 400 µl of ice-cold diethyl pyrocarbonate-treated water. An equal volume of acid phenol (Amresco) was added to the cell suspension and heated to 60°C for 15 min with occasional vortexing. The mixture was then centrifuged and the upper phase was transferred to a clean tube. The total RNA was precipitated with ethanol (Merck). After the RNA pellet had been washed with 80% ethanol, RNA was resuspended in 20 µl of diethyl pyrocarbonate-treated water. When required, RNA was further purified with the RNeasy kit (Qiagen) following the manufacturer's instructions.
In vitro transcription. In vitro synthesis of RNA from a linearized (with XbaI) pT7recA[int+] plasmid was performed with T7 RNA polymerase (Takara). The reaction was carried out in a total volume of 100 µl with 1x reaction buffer, 5 mM dithiothreitol, a 0.5 mM concentration of each ribonucleoside triphosphate, 80 U of RNase inhibitor, and 100 U of T7 polymerase. The mixtures were incubated at 37°C for 2 h, and the RNA synthesized was extracted with acid-phenol and precipitated with ethanol.
Reverse transcription-PCR (RT-PCR). Synthesis of cDNA was performed with a Moloney murine leukemia virus reverse transcriptase (Stratagene). One to five micrograms of total RNA was used as a template for cDNA synthesis initiated with the oligonucleotide RECA-5 (5'-CAC GGA ATG GTG GTG CCA C-3'), specific for the region downstream of the IVS. A portion of the cDNA was used for a PCR with oligonucleotides RECA-4 (specific for the region upstream of IVS; 5'-ATC GCA GAA GCA CTT GTA CG-3') and RECA-5. The cycling profile was as follows: 1 cycle at 94°C for 1 min; 30 cycles at 94°C for 20 s, 54°C for 30 s, and 72°C for 30 s; 1 cycle at 72°C for 2 min.
Intron labeling.
Total cellular RNA (5 µg) was incubated with [
-32P]GTP (3,000 Ci/mmol, 1 µCi per µg of RNA) and 40 U of RNase inhibitor at 37°C for 0, 10, 30, and 60 min in a splicing buffer (25 mM Na-HEPES [pH 7.5] and 15 mM MgCl2) (11). The reaction was stopped by addition of EDTA (final concentration of 10 mM) and loading buffer (1 mg of xylene cyanol FF per ml, 1 mg of bromophenol blue per ml, and 10 mM EDTA in formamide), and the RNAs were then loaded onto a 5% acrylamide-8 M urea gel. After electrophoresis, a sheet of X-ray film (Kodak) was laid onto the gel and exposed for 12 h at -70°C.
Testing UV sensitivity. The E. coli strains containing pUC19, pUCrecA[int+], and pUCrecA[int-] plasmids were grown at 37°C in LB medium to a density of 2 x 108 to 3 x 108 cells per ml. Cells were resuspended in 0.1 M MgSO4, and aliquots were dispensed onto petri plates. After UV irradiation for various lengths of time, cells were diluted and plated onto LB agar plates. After 20 h of incubation at 37°C, colonies were counted to measure rates of survival.
Nucleotide sequence accession number. The nucleotide and deduced amino acid sequences of the B. anthracis recA gene have been submitted to GenBank under accession number AF229167.
|
|
|---|
In the promoter region of the recA-like coding sequence are putative -10 and -35 promoter sequences between 32 and 63 bp upstream of the predicted start codon. The putative promoter sequence (TTGGCA-18 bp-TATAAT) is very similar to the E. coli consensus (TTGACA-16 to 18 bp-TATAAT). The putative LexA binding site, located 72 to 83 bp upstream of the start codon is exactly the same as that of B. subtilis recA.
In vivo and in vitro splicings of the IVS. We examined whether the IVS of the putative recA gene can be excised from the primary transcript. Using a pair of primers, RECA-4 and RECA-5 (Fig. 1), RT-PCR was carried out for total RNA from B. anthracis, resulting in a product smaller than that from the genomic DNA (Fig. 2A). Essentially the same result was obtained for total RNA extracted from E. coli containing B. anthracis recA (DH10ß/pUCrecA[int+], data not shown). The size difference matches the size of the IVS, indicating that the IVS was excised, which was also confirmed by DNA sequencing. The spliced form of the mRNA contains an ORF encoding a protein of 343 amino acids with 85% identity to the RecA protein of B. subtilis.
![]() View larger version (37K): [in a new window] |
FIG. 2. In vivo and in vitro splicing of recA transcript. (A) In vivo splicing of recA transcript. Lane 1 shows a PCR product obtained from B. anthracis chromosomal DNA with the RECA-4 and RECA-5 primers. Results of RT-PCR from total RNA of B. anthracis using RECA-4 and RECA-5 are also shown, in the absence (lane 2) or presence (lane 3) of reverse transcriptase. Lane M, DNA size markers. (B) Labeling of intron RNA with [ -32P]GTP. Total RNAs were extracted from strains with various plasmids as described in Materials and Methods and incubated with [ -32P]GTP at 37°C in a splicing buffer. Lane 1, BL21/pBluescript SK(+) incubated for 60 min; lane 2 to 5, BL21/pT7recA [int+] incubated for 0 (lane 2), 10 (lane 3), 30 (lane 4), and 60 (lane 5) min; lane 6, BL21/pT7recA [int-] incubated for 60 min. After the reaction, samples were analyzed by 5% acrylamide-8 M urea gel electrophoresis and exposed to an X-ray film. The size of detected band was approximately the same as that of the intron (327 bp). A 32P-labeled DNA ladder was used as a marker (M).
|
Secondary structure of the recA intron. Prediction of secondary structure for the recA intron revealed the distinctive features of a group I intron (Fig. 3). Group I introns contain regions of base pairing (P1 to P10), as well as the conserved sequence elements (P, Q, R, and S), which are necessary for proper folding and excision (2). The B. anthracis recA intron is spliced after the exonic u residue in the P1 stem that forms a pair with G, and the intron ends with G as in most group I introns. In addition, the 3'-terminal guanine of the intron is followed by three residues (CCA) capable of forming a base pair (P10) with residues (UGG) in the 3' side of P1. Thus, the nucleotides on the 3' side of P1 may serve as an internal guide sequence, promoting an alignment between the 3' and 5' splice sites for ligation of the two exons (4). The predicted structure of this intron resembles that of the bnrdE-I1 group I intron, inserted in the ribonucleotide reductase gene of the SPß-related prophage from Bacillus sp. strain BSG40 (19). About 36% of 327 nucleotides of the recA intron in the catalytic core, stem-loops, and single-stranded regions are identical (Fig. 3). Structural differences lie in the extra pairings of P1a and P1b, besides the absence of an ORF in the recA intron after P6a.
![]() View larger version (21K): [in a new window] |
FIG. 3. Predicted secondary structure of the recA intron. The arrows indicate the boundaries between exons (lowercase) and intron (uppercase). The conserved base-paired regions (P1 to P9), extra pairings (P1a, P1b, P3.1, P3.2, P6a, and P7.2), and conserved sequence elements (R and S) are shown. The boxed residues, UGG near P1 and CCA near P9, can make a pair (P10) to promote an alignment between the 3' and 5' splice sites required for the ligation of exons. The G site (black box) is responsible for the binding of the guanosine cofactor. The shaded boxes represent residues identical with the bnrdE-I1 intron from the Bacillus strain BSG40 (18).
|
-32P]GTP. After separation by urea-acrylamide gel electrophoresis, a labeled RNA product whose density increased over incubation time was detected from the sample containing the intron (Fig. 2B). The size of that band correlated well with that of the predicted intron (328 nucleotides including GTP). The same result was obtained with the pUCrecA[int+] clone, but the band intensity was weak (data not shown) due to a relatively low copy number of the precursor RNA. From this result, we concluded that the IVS of the recABA gene is a self-splicing group I intron.
Complementation of a recA defect in E. coli by the RecA from B. anthracis.
In order to assess the level of RecABA expression in E. coli, we performed Western blotting with an anti-RecA antiserum raised against the E. coli RecA protein. As shown in Fig. 4A, the RecABA proteins synthesized in E. coli (MG1655
recA::Cm) from the plasmids pUCrecA[int+] and pUCrecA[int-] were identical in size and amount regardless of the presence of the intron. The size of RecABA produced in E. coli was similar to that of RecAEC.
![]() View larger version (19K): [in a new window] |
FIG. 4. Complementation of recA deletion in E. coli by expression of B. anthracis recA. (A) Bacillus RecA expressed in E. coli. Total proteins were analyzed by immunoblotting with an anti-RecAEC antiserum after sodium dodecyl sulfate-polyacrylamide (12%) gel electrophoresis and transfer to a nitrocellulose paper. Lane 1, MG1655; lane 2, MG1655 recA; lane 3, MG1655 recA/pUC19; lane 4, MG1655 recA /pUCrecA [int+]; lane 5, MG1655 recA/pUCrecA [int-]. The size of the B. anthracis RecA protein was about 37 kDa, which is similar to that of E. coli RecA. (B) Complementation of E. coli recA defect by cloned B. anthracis recA. Strains used were MG1655 (), MG1655 recA ( ), MG1655 recA/pUC19 ( ), MG1655 recA/pUCrecA [int+] ( ), and MG recA/pUCrecA [int-] ( ). The numbers of colonies were counted after UV irradiation and plating of the samples onto LB agar plates. The plates were incubated for 20 h at 37°C.
|
|
|
|---|
-32P]GTP (Fig. 2B). These experimental observations are consistent with the prediction of a group I-like secondary structure for the IVS. The high similarity of predicted secondary structure and sequence of the B. anthracis recA intron to those of the bnrdE-I1 intron from Bacillus sp. strain BSG40 may suggest a lateral transfer of these introns, most likely by transposition (7). The first identified microbial group I intron was the one in the thymidylate synthetase gene (td) of bacteriophage T4 (3). About 17 introns in protein-coding genes of bacteriophages have been reported (7), besides 74 introns in tRNAs and rRNA of bacteria. However, the group I introns have so far never been found in a protein-coding gene of a bacterial genome. Most group I introns found on the bacterial genomes are contained in a prophage, not in the genome. Although the group I intron of the thymidylate synthase gene was observed in Bacillus mojavensis s87-18 (8), it was not clearly stated whether it was from a prophage or the genome. One could imagine that the recA gene presented here might be from a prophage, and thus, another genomic copy of recA gene may exist. In a search for recA sequence from B. anthracis (Ames strain) genome sequences (www.tigr.org), we found a contig (6264) containing the recA gene with its intron. Analysis of the surrounding nucleotide sequences revealed the gene order pgsA-cinA-recA-ymdA, which is also preserved in the B. subtilis genome, except for the pbpX gene, which is inserted between recA and ymdA in B. subtilis. The orientations of these genes are identical, implying that the recA gene is a genomic copy in B. anthracis. Thus, the recA intron characterized here would be the first example of a group I intron in a protein-coding gene of a bacterial genome.
As previously indicated, there are certain characteristics shared by the group I introns from bacteriophages. Most of them are found in protein-coding genes associated with DNA metabolism and contain an ORF for an endonuclease involved in intron mobility. The group I introns found in some tRNAs or rRNAs do not usually encode a homing endonuclease, although there are a few exceptions. The recA intron from B. anthracis appears to be associated in some way with DNA metabolism similar to the phage introns but does not contain a homing endonuclease, as found in the group I introns of tRNAs or rRNAs. Since intron mobility depends on the presence of a homing endonuclease (17), the recA intron of B. anthracis would not be mobile by this mechanism.
The presence of an intron in the B. anthracis recA gene seems particularly interesting because some recA genes from mycobacteria are also interrupted by an intein, a protein intron that is removed after translation. There are two types of mycobacterial inteins in terms of their insertion sites. More than eight species of mycobacteria, including M. leprae, have insertions at an identical site (RecA-b) (33), which is located immediately downstream of glycine 205 in M. leprae, while the intein from M. tuberculosis is found in another location (RecA-a) further downstream. It was proposed that the inteins were acquired independently (6, 33). It is striking that the insertion site of the B. anthracis recA intron is nearly identical to the RecA-b site of mycobacteria, with only one amino acid shift. Their origins might be evolutionarily related. The insertion of an intron and intein at almost the same location with one codon difference has previously been reported for the nrdE gene of SPß-related prophage of Bacillus (19). Currently, no role for the RecA inteins has been assigned, and the presence of an intein does not affect RecA activity (10), pathogenicity, or the survival of the host organism. The presence of an intron in the B. anthracis recA gene does not itself affect the expression or activity of RecA. It is possible that the recA intron might play some role during evolution, providing an advantage in adaptation or survival, although its function in current species seems dispensable.
This work was supported by grants from the Creative Research Initiative Program and from BK21.
M. Ko and H. Choi contributed equally to this work.
Present address: SolGent Inc., Seo-gu Taejon 302-858, Republic of Korea. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»