Previous Article | Next Article ![]()
Journal of Bacteriology, September 2002, p. 4952-4961, Vol. 184, No. 18
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.18.4952-4961.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
R. H. Carter,2 and L. B. Rothman-Denes1*
Departments of Molecular Genetics and Cell Biology,1 Biochemistry and Molecular Biology, The University of Chicago, Chicago, Illinois 606372
Received 22 April 2002/ Accepted 14 June 2002
|
|
|---|
|
|
|---|
Transcription of coliphage N4 middle mRNAs requires the activities of three early proteins: p17, p7, and p4 (53, 70, 72). Two of these gene products, p7 (30 kDa) and p4 (40 kDa), are soluble, have been purified to homogeneity, and constitute a heterodimeric, rifampin-resistant RNAP, N4 RNAPII (71). However, this heterodimer does not transcribe promoter-containing, double-stranded DNAs (dsDNA) and utilizes single-stranded DNAs (ssDNA) with low efficiency and no specificity. At least one additional phage-coded protein, p17, is essential in vivo for N4 middle RNA synthesis but is not sufficient in vitro for utilization of N4 middle promoters (1, 70; R. H. Carter, unpublished data). Recent in vitro characterization of p17 indicates that p17 is a ssDNA binding protein that recruits N4 RNAPII to ssDNA templates (R. H. Carter and A. Demidenko, unpublished data).
In the present study, the genes encoding the p7 and p4 subunits of N4 RNAPII were sequenced and cloned. Both genes are transcribed from the same phage early promoter. The two subunits display sequence similarity to separate, nonoverlapping regions of single-subunit, T7-like RNAPs, suggesting that gene fusion or gene splitting events have occurred during the evolution of this class of polymerases. An N4 fusion polypeptide was generated that is active in vitro and complements phages carrying mutations in either of the two subunits when expressed in vivo.
|
|
|---|
- IN(rrnD-rrnE)1] and W3350supF were used in N4 wild-type and N4am mutant phage infections, respectively (53). Strain TG1 [
(lac-proAB) supE thi
(hsdM-mcrB)5/F' traD36 proA+B+ lacIq
lacZM15] was used for isolation of single-stranded and RF M13 DNA. HMS174 (F- hsdR rK12- mK12+ recA1) was used in the construction of expression vectors. Protein expression was carried out in either BL21(DE3) (62) or W3350(DE3)pLysS (11). Phage were grown as described previously (53). Preparation and manipulation of DNAs. N4 DNA was prepared according to the work of VanderLaan et al. (64). DNA amplification reaction mixtures contained 1 µg of template DNAs, 50 mM KCl, 10 mM Tris-HCl (pH 8.3), 1.5 mM MgCl2, 0.01% gelatin, 0.2 mM deoxynucleoside triphosphates, 1 µM primers, and 2.5 U of AmpliTaq polymerase (Perkin-Elmer Cetus) in a final volume of 100 µl. DNA sequences of PCR-generated products were confirmed by sequencing.
DNA sequencing and sequence analysis. DNA fragments generated by HpaI digestion of N4 genomic DNA and cloned into pBR322 (35) were transferred to M13mp18 or M13mp19. Viral ssDNA was isolated and sequenced by the chain termination technique using Sequenase T7 DNA polymerase (version 2.0) and reagents purchased from United States Biochemical Corp. (USB). DNA sequences were determined for both strands. Oligonucleotides used for sequencing and cloning were synthesized at the University of Chicago Howard Hughes Medical Institute Oligonucleotide Facility or purchased from Operon Technologies, Inc. (Alameda, Calif.).
DNA sequences were translated and compared to available sequence databases by using the BlastP program (2). Further regions of sequence similarity were identified by comparison of the p7 and p4 polypeptide sequences to an alignment of 26 T7-like single-subunit RNAPs generated in this laboratory by using the ClustalX program (22).
Construction of N4 RNAPII expression plasmids. All cloning and transformations followed standard techniques (54). A DNA fragment corresponding to nucleotides (nt) 4606 to 6674 of the N4 genome, which comprise ORF15 (p7 subunit) and ORF16 (p4 subunit), was generated by PCR amplification using the 5' oligonucleotide primer 5'-GGAATTCCATGGGGTCTACTATCGAACATCAG-3' and the 3' primer 5'-CGGGATCCCGTTAAGATAGAGCGTATTCGG-3'. The 5' primer introduces EcoRI and NcoI sites upstream of the p7 coding region and a single glycine codon (GGG) between the first two naturally occurring codons of the p7 gene. The 3' primer introduces a BamHI site downstream of the p4 coding region. The PCR product was digested with EcoRI and BamHI (New England Biolabs) and ligated into M13mp18. After confirmation of the DNA sequence, M13 RF DNA was digested with NcoI and BamHI to release the fragment containing ORF15 and ORF16, which was cloned into the pET11d expression vector (Novagen) under the control of a T7/lacO promoter.
Three different expression plasmids were constructed by using the above oligonucleotides and either wild-type or mutant N4 phage genomic DNAs as PCR templates. Plasmid pSH+ contains a fragment generated by using wild-type N4 phage DNA as a template, pSHam15 contains a fragment generated by using N4am15 DNA as a template, and pSHam23 contains a fragment generated by using N4am23 DNA as a template.
To create a p7/p4 fusion polypeptide, mutagenesis was performed on M13mp18 containing the wild-type N4 PCR fragment by using the T7-Gen in vitro mutagenesis kit (USB). The wild-type N4 PCR fragment includes the p7 and p4 coding sequences separated by a 47-bp intergenic region. This intergenic region and the termination codon for ORF15 were precisely deleted. The oligonucleotide used to create the deletion was complementary to nt 5398 to 5409 and 5460 to 5478 of the N4 genome and had the sequence 5'-GGGCAGTAAAAGTTTGCATATCAATTACTTC-3'. Removal of the intergenic region from M13mp18 was verified by restriction digestion. The desired fragment was isolated and cloned into pET11d to create plasmid pSHF.
Labeling of proteins after induction. E. coli W3350(DE3)/pLysS bearing the N4 RNAPII expression vectors described above was grown at 37°C to an optical density at 620 nm of 0.5 in minimal salts medium (40) supplemented with 4% Lenox L Broth (LB) (Gibco-BRL), 1% Casamino Acids (Difco), and 0.01 mg of thiamine/ml. Cells were collected by centrifugation and resuspended in fresh medium. After addition of 2 mM isopropyl-ß-D-thiogalactopyranoside (IPTG) to induce expression of T7 RNAP and subsequent transcription of the N4 genes, followed by incubation for 30 min at 37°C, 200 µg of rifampin/ml was added to inhibit host transcription. After incubation for 90 min at 37°C, 50-µl samples were labeled with 5 µl of Tran35S-label (>1,000 Ci/mmol; ICN Radiochemicals, Irvine, Calif.) for 10 min at 37°C. Cells were collected by centrifugation and processed as described previously (14). Samples were analyzed by electrophoresis on sodium dodecyl sulfate (SDS)-12% polyacrylamide gels followed by autoradiography.
Enzyme purification. Recombinant N4 RNAPII subunits were expressed from plasmid pSH+ in E. coli BL21(DE3) cells. Cells were grown in a Lab-Line Hi-Density fermentor at 37°C with agitation and aeration to an optical density at 600 nm of 0.6, at which time IPTG was added to 0.4 mM. Induction proceeded for 3 h. Cells were collected by centrifugation and resuspended in 50 mM NaCl-50 mM Tris-HCl (pH 7.9)-2 mM EDTA-1 mM dithiothreitol (DTT)-1 mM phenylmethylsulfonyl fluoride (Sigma). All the following procedures were carried out at 4°C. Cells were lysed by sonication after one cycle of freezing and thawing. After removal of cell debris by centrifugation, chromosomal DNA was precipitated with 0.6% polyethylenimine-0.6 M NaCl. Proteins precipitating between 30 and 65% ammonium sulfate were resuspended in buffer A-10 (10 mM Tris-HCl [pH 7.9], 10 mM MgCl2, 1 mM EDTA, 1 mM DTT, and 10% [vol/vol] glycerol) and applied to a heparin Sepharose CL-6B column (25 ml of resin/liter of cell culture; Amersham-Pharmacia) equilibrated in the same buffer. After a wash with 2 column volumes of 150 mM NaCl in A-10 buffer, bound protein was eluted with a 7-column-volume linear gradient of 150 to 450 mM NaCl in buffer A-10. Fractions containing RNAP activity were pooled and applied to a Biogel HTP hydroxyapatite column (8 ml of resin/liter of cell culture; Bio-Rad). Bound protein was eluted with a 10-column-volume linear gradient of 10 to 80 mM sodium phosphate buffer (pH 7.9)-10 mM MgCl2-1 mM EDTA-1 mM DTT-10% (vol/vol) glycerol. Transcriptionally active fractions were pooled and applied to a Q-Sepharose Fast Flow column (15 ml/liter of cell culture; Amersham-Pharmacia). Bound protein was eluted with a 9-column-volume linear gradient of 50 to 500 mM NaCl in buffer A-10. Fractions were pooled, dialyzed overnight against buffer A-10 containing 50% glycerol, and stored at -80°C.
The fusion polymerase, expressed from plasmid pSHF, was purified in the same manner as heterodimeric N4 RNAPII, with the exception that the pooled hydroxyapatite fractions were purified by using a similar gradient profile on a Mono-Q HR5/5 column (Amersham-Pharmacia) instead of the Q-Sepharose Fast Flow column.
In vitro N4 RNAPII assays. The in vitro transcription assay used to detect N4 RNAPII activity during purification was performed as described previously (1, 70). In the presence of rifampin, this assay allows for the specific identification of RNAPII during purification from lysates that also contain E. coli RNAP. The DNA/membrane complex isolated from N4am15/23-infected cells provides template-bound p17 required for efficient transcription (1).
In vitro runoff transcription reaction mixtures contained 10 mM MgCl2, 0.1 mM EDTA, 1 mM (each) ATP, CTP, and GTP, 0.1 mM UTP, [
-32P]UTP (1,000 Ci/mmol; Amersham-Pharmacia), 100 µg of bovine serum albumin/ml, 20 mM Tris-HCl (pH 8.0), 75 nM poly(dC)-tailed template, and either purified heterodimeric or fused N4 RNAPII. Reaction mixtures were preincubated at 37°C for 5 min before addition of the four ribonucleoside triphosphates. Reactions were allowed to proceed for 5 min at 37°C before quenching by addition of 1.5 volumes of Sequenase Stop Buffer (USB) and analysis on 8% polyacrylamide-8 M urea gels.
To construct poly(dC)-tailed templates, DNA including the HpaI-HaeIII Mc fragment of the N4 genome (35) was generated by PCR amplification. One of the amplification primers used contains a KpnI site that, after digestion of the PCR product with KpnI, provides a preferred substrate for tailing by terminal transferase. Addition of poly(dC) tails was performed in a 50-µl reaction mixture containing 2 mM CoCl2, 0.2 mM DTT, 0.1 M potassium cacodylate (pH 7.2), 15 pmol of KpnI-PCR product, 20 µM dCTP, and 30 U of terminal transferase (Gibco-BRL). Reaction mixtures were incubated at 37°C for 1 h and terminated by phenol extraction and ethanol precipitation. The precipitated template was resuspended in RNase-free 10 mM Tris-HCl (pH 8.0)-1 mM EDTA buffer for use in transcription reactions. The resulting dC-tailed template contains a 294-bp dsDNA segment.
Plating efficiency. Bacterial strains were grown in brain heart infusion broth (Becton Dickinson) to a density of 3 x 108 cells/ml. One hundred microliters of N4 phage stocks diluted in 15 mM Tris-HCl (pH 8.0)-10 mM MgCl2 was mixed with 0.1 ml of bacterial culture. The mixtures were incubated for 10 min at 37°C to allow for phage adsorption, plated on LB agar, and incubated at 37°C. In work with bacterial strains containing expression vectors, IPTG (0.4 mM) and ampicillin (0.2 mg/ml) were added to the top agar at the time of plating. Phage mutations were considered rescued when the wild-type PFU/total PFU ratio (efficiency of rescue) obtained after plating on a nonsuppressing host was 100- to 1,000-fold higher than the ratio obtained by using mutant phage stocks grown in a host containing a control plasmid (35).
Nucleotide sequence accession numbers. The nucleotide sequences presented in this paper can be found in GenBank under accession numbers AY074660 and AY074661.
|
|
|---|
![]() View larger version (13K): [in a new window] |
FIG. 1. Organization of the leftmost 8 kb of the bacteriophage N4 genome. (A) Mutations within RNAPII subunits. (B) HpaI restriction map of the region. (C) Positions of the potential 18 ORFs predicted by sequence analysis. The solid rectangle indicates the position and length of the genomic terminal repeat. (D) Positions of the early promoters Pe1, Pe2, and Pe3 are indicated by solid arrowheads. Transcription terminators are designated t1 to t7. (E) Polypeptides required for middle gene transcription.
|
|
View this table: [in a new window] |
TABLE 1. Rescue of amber mutant phage by cloned fragments of N4 DNA
|
The deduced amino acid sequences encoded by ORFs 15 and 16 were independently compared to the National Center for Biotechnology Information database by using the BlastP program (2). Both p7 and p4 sequences display similarity to separate, nonoverlapping regions of single-subunit, DNA-directed RNAPs. These RNAPs include phage-encoded enzymes, nuclear-encoded mitochondrial enzymes, and linear-plasmid-encoded enzymes. The p7 and p4 sequences were next joined to create a hypothetical "fusion polypeptide" of 673 aa. The fusion joins these sequences in the order in which ORFs 15 and 16 are found in the N4 genome. When the hypothetical fusion polypeptide sequence was subjected to BlastP analysis, significance scores obtained for matches to single-subunit RNAPs were much better than those obtained by using the individual subunits.
The N4 RNAPII sequence was compared to an alignment of 26 single-subunit RNAPs in order to identify additional regions of sequence similarity. An alignment of the p7/p4 fusion polypeptide sequence to that of T7 RNAP and the three highest-scoring (by BlastP) linear-plasmid-encoded enzymes (Fig. 2A) reveals 13 blocks of conserved sequence (Fig. 2B). These correspond approximately to blocks II to X defined for this family of enzymes by Li et al. (33). No match was detected to the most amino-terminal block I. The alignment includes four conserved motifs (DxxGR, A, B, and C) essential for polymerase activity (59). The N4 RNAPII sequence contains all invariant residues located within these motifs, which in T7 RNAP are R425, D537, R627, K631, Y639, H811, and D812 (Fig. 2A) (59).
![]() View larger version (69K): [in a new window] |
FIG. 2. N4 RNAPII is a member of the single-subunit family of RNAPs. (A) Alignment of N4 RNAPII sequence to conserved sequence domains of single-subunit RNAPs. Alignment to three linear-plasmid-encoded fungal RNAPs that are most closely related to N4 RNAPII (as determined by BlastP analysis) and to T7 RNAP is presented. Sequences are ordered relative to their degree of similarity, with the Podospora anserina enzyme being most similar to N4 RNAPII. Conserved sequence blocks correspond approximately to those previously defined by McAllister and Raskin (38) and Li et al. (33). Numbers in parentheses are numbers of residues occurring between sequence blocks. The p7/p4 junction is marked in red. Asterisks indicate positions at which the identical amino acid is found in all five sequences. Colons indicate positions at which amino acids possessing similar physicochemical properties are found in at least four of the sequences. The sequences of motifs essential for polymerase activity are boldfaced. Residues experimentally shown to be important for polymerase activity are in red (6, 25, 44). The corresponding secondary-structure elements, as determined from the T7 RNAP crystal structure (27), are shown above the alignment. Rectangles, alpha helices; arrows, beta strands; lines, coil regions. Structure elements are color coded as in the work of Jeruzalmi and Steitz (27). Vertical lines mark elements that continue beyond the sequence blocks shown. The RNAP sequences shown have the following accession numbers: T7, CAA24390; Agaricus bitorquis, P33539; Neurospora crassa, P33540; P. anserina, S26945; N4 RNAPII p7, AY074660; N4 RNAPII p4, AY074661. (B) Comparison of N4 RNAPII and T7 RNAP. Relative locations of conserved sequence blocks (see panel A) are schematized. Numbers indicate the known boundaries of structural domains in T7 RNAP (27) and inferred boundaries in N4 RNAPII. Numbering in N4 RNAPII reflects the positions of these blocks in the p7/p4 fusion polymerase sequence. Residue 270 corresponds to aa 1 of p4. Blocks are colored according to the designations of Jeruzalmi and Steitz (27): yellow, N-terminal domain; green, thumb; red, palm; orange, palm insertion domain; blue, fingers; pink, foot module. The locations and functional roles of polymerase motifs and catalytically important residues within them are indicated.
|
![]() View larger version (50K): [in a new window] |
FIG. 3. Expression of plasmid-borne N4 ORF15 and ORF16. (A) Recombinant RNAPII subunits were synthesized in vivo in the presence of [35S]methionine and analyzed by denaturing gel electrophoresis and autoradiography. Lanes 1 to 3, cell lysates from W3350(DE3)pLysS carrying pSH+; lanes 4 to 6, lysates from W3350(DE3)pLysS carrying pSHam15; lanes 7 to 9, lysates from W3350(DE3)pLysS cells carrying pSHam23. Arrows indicate the positions of p4 and p7. (B) Sequence of the ORF15 and ORF16 fusion junction. A diagram of the intergenic region removed by site-specific mutagenesis is shown at the top. (C) Coomassie-stained gel of recombinant protein expression. Lane 1, W3350(DE3)pLysS cells containing pET11d; lane 2, expression of p7 and p4 from pSH+; lane 3, expression of the p7/p4 fusion polymerase from pSHF. Sizes of protein standards are indicated at the left of the gel. Arrows indicate the positions of proteins of the expected sizes.
|
|
View this table: [in a new window] |
TABLE 2. Plating efficiencies of N4 phages on various E. coli strains
|
The N4 RNAPII fusion polymerase is active in vitro. N4 RNAPII activity was purified from cells bearing either pSH+ or pSHF after induction of recombinant protein expression. The enzyme purified from cells bearing pSH+ consists of two polypeptides of sizes similar to those of the enzyme purified from infected cells (71). The enzymes from both sources have the same specific activity. A single polypeptide migrating as a species of approximately 70 kDa is obtained after purification from cells carrying pSHF (Fig. 4A). This is lower than the calculated molecular mass of 77, 726 Da but is in agreement with the anomalous migration of p4.
![]() View larger version (63K): [in a new window] |
FIG. 4. Relative specific activities of the heterodimeric N4 RNAPII and the p7/p4 fusion polymerase. (A) SDS-PAGE of protein samples used in transcription. Lanes 1 and 4, molecular weight markers; lane 2, purified p7/p4 fusion polymerase; lane 3, purified N4 RNAPII heterodimer. (B) Transcripts produced in vitro by the N4 RNAPII heterodimer (lanes 1, 2, and 5 to 8) and the fusion polymerase (lanes 3 and 9 to 12). Reaction mixtures contained either no template (lane 1), 75 nM tailless template (lanes 2 and 3), or 75 nM dC-tailed template (lanes 4 to 12). Polymerase was omitted from the reaction in lane 4. N4 protein p17 was present in all reaction mixtures. Sizes of DNA markers (nucleotides) are indicated to the right of the gel. (C) Quantitation of in vitro transcription products synthesized from dC-tailed templates. The amounts of 294-nt transcript synthesized by the N4 RNAPII heterodimer (solid line) and the fusion polymerase (dotted line) are plotted as a function of enzyme concentration.
|
|
|
|---|
The T7-like RNAPs belong to a superfamily of single-subunit polymerases (28). The crystal structures of representative superfamily members have been determined; T7 RNAP, members of the PolI and Pol
DNA polymerase families, human immunodeficiency virus type 1 (HIV-1) reverse transcriptase, and poliovirus RNA-dependent RNAP all share a common polymerase domain structure (17, 60, 61). This structure resembles a cupped hand in which thumb and fingers subdomains rise on each side of a palm subdomain (31). Sequence alignment of both RNA- and DNA-dependent RNA and DNA polymerases belonging to this superfamily identified two sequence motifs (A and C) containing highly conserved carboxylate residues. Another motif (B) is present only in DNA-dependent polymerases (13). These motifs occupy similar positions in the various structures: motifs A and C, which contain invariant aspartates, lie in the palm. These aspartates coordinate catalytically essential Mg2+ ions (43, 44, 59, 66). Motif B, which contains an invariant lysine, lies in the fingers subdomain and makes up part of the enzyme's nucleotide binding pocket (27). An additional motif common to DNA-directed polymerases [(T/D)xxGR], which lies in the palm subdomain, has also been identified (39). Recent studies indicate that this motif plays a role in stabilizing the RNA-DNA hybrid during early stages of T7 RNAP transcription initiation (25).
The locations of these four motifs and other blocks of sequence similarity in N4 RNAPII and T7 RNAP are diagrammed in Fig. 2B. Blocks of sequence similarity are colored according to their subdomain location in the T7 RNAP crystal structure (27) and are divided between the two N4 RNAPII subunits. Interestingly, the p7/p4 junction (aa 269-270) lies close to the boundary between structural subdomains. The p7 subunit contains sequences corresponding to the amino-terminal domain, the thumb, and a fragment of the palm in which the DxxGR motif is located, while p4 contains the remainder of the palm and the fingers subdomains, and all catalytically important residues within motifs A, B, and C (Fig. 2).
As a result of the high degree of sequence similarity observed, we expect that the overall tertiary structure of N4 RNAPII, as well as many secondary structural elements, will be similar to that of T7 RNAP. On this assumption, we mapped the blocks shown in Fig. 2 on the crystal structure of T7 RNAP (PDB 1ARO) (data not shown). Based on this analysis, we can make several predictions. We predict that the N4 RNAPII palm domain will possess structural elements related to those found in other single-subunit polymerases. A high degree of structural conservation has been observed for the palm domains of T7 RNAP, E. coli Klenow fragment, and HIV-1 reverse transcriptase, and the locations of the motif A and C carboxylates within the palms of these enzymes are essentially identical (29). However, in contrast to these enzymes, the N4 RNAPII sequence contains a long insertion (74 aa relative to 5 aa for T7 RNAP [Fig. 2A]) N-terminal to the sequence block (and by inference, the ß hairpin [Fig. 2A]) containing motif C.
We also predict that the N4 RNAPII thumb and fingers subdomains will be shorter than the corresponding regions of T7 RNAP (Fig. 2B and Table 3). Deletions within the T7 RNAP thumb result in a loss of elongation complex stability and processivity (5). Notably, the p7/p4 heterodimer does not appear deficient in processive transcription when initiating on dC-tailed templates (data not shown). The fingers subdomain contains motif B. N4 RNAPII contains an exact match to the consensus sequence Rx3Kx7YG, as do the phage- and nuclear-encoded mitochondrial RNAPs. In contrast, the majority of linear-plasmid-encoded enzymes contain the sequence Rx3Kx7YN. The significance of this sequence change is unknown because functional data are not available for most of the plasmid-encoded enzymes. It is also noteworthy that a serine residue immediately follows motif B in the N4 RNAPII sequence; serine at this position is found only in the phage-encoded polymerases (T7, T3, SP6, and K11) (8, 38). Cross-linking studies performed using T7 RNAP suggested that the motif B lysine plays a role in nucleotide binding (34). The corresponding N4 RNAPII residue (K407) lies in the p4 subunit. Surprisingly, catalytic autolabeling of N4 RNAPII using a cross-linkable derivative of the initiating nucleotide and denatured N4 DNA as a template resulted in labeling of the p7 subunit (18). The exact site of cross-linking of the derivatized nucleotide has not been identified.
|
View this table: [in a new window] |
TABLE 3. Comparison of domains between the T7 RNAP and N4 RNAPII
|
The palm insertion module closes off the back of the active site; its exact function remains to be determined. A segment of similar length and sequence is conserved in all T7-like RNAPs, including N4 RNAPII (Fig. 2B, orange blocks) (27). Residues within the specificity loop have been implicated in sequence-specific promoter recognition and interactions with the RNA product (9, 48, 52, 63). N4 RNAPII does contain an insertion corresponding to the location of the specificity loop, though we cannot predict its exact length (in Table 3, we have indicated the distance between the two flanking blocks of fingers sequence). We do not know if this insertion plays an equivalent role in promoter recognition at this time. Inherent promoter recognition is not a universal property of the single-subunit RNAPs. Yeast mitochondrial RNAP is composed of two subunits: a catalytic core (Rpo41) homologous to T7 RNAP and a specificity factor (Mtf1) required for promoter recognition (15, 37, 65). Mtf1 shares sequence similarity with eubacterial sigma factors (26). The similarity extends to its behavior: Rpo41p and Mtf1p interact in solution prior to promoter binding, and Mtf1p is released from the complex following initiation (36). Evidence exists for corresponding specificity factors in higher eukaryotes (24, 45). N4 RNAPII, although it contains the polymerase catalytic core, is unable to utilize promoter-containing DNA templates (71). N4 middle RNA synthesis is dependent on at least one additional N4 gene product, p17 (1, 70). p17 has been purified to homogeneity and characterized (Carter, unpublished). To our surprise, we found that p17 is a ssDNA binding protein that specifically recruits N4 RNAPII to ssDNA (Carter and Demidenko, unpublished). These findings suggest that N4 RNAPII must use a novel mechanism for promoter recognition.
The finding that functional domains contained within one polypeptide in a given species are distributed over more than one polypeptide in another species is not unique to N4 RNAPII. A number of instances involving nucleotide polymerases have been reported in the literature. The Methanobacterium thermoautotrophicum PolB enzyme, which is a member of the single-subunit RNA/DNA polymerase superfamily, is encoded by two ORFs located
650 kbp apart and on opposite strands (58). The smaller PolB2 subunit contains a section of the palm and the thumb domain, while the larger PolB1 subunit contains the remainder of the palm and the fingers domain. Evidence suggests that both subunits are required for activity (30). Unlike the arrangement in N4 RNAPII, the DxxGR, A, B, and C motifs are all located in PolB1 (K. M. Kazmierczak, by inspection of GenBank accession numbers AAB85697 and AAB84714). We have searched available sequence databases for other examples of split T7-like RNAPs. A number of examples of fragmented, probably inactive enzymes in mitochondrial genomes of fungi (21, 51) and Beta vulgaris (7) have been described. Others were found by us in the Spizellomyces punctatus mitochondrial chromosome sequence. We did find one candidate pair of ORFs (ORF670 and ORF348, GenBank accession numbers NP_064021 and NP_064020) in the B. vulgaris mitochondrial genome sequence (32). These two ORFs are separated by 41 nt, in the B. vulgaris sequence. ORF670 contains motifs DxxGR and A, while ORF348 contains motifs B and C; all catalytically essential residues are present in the four motifs. Whether these ORFs encode a functional enzyme is unknown.
The conserved sequences present in the ß' subunit of E. coli RNAP are divided between two polypeptides (A and C) in the archaebacteria Sulfolobus acidocaldarius, Halobacterium halobium, and M. thermoautotrophicum (3, 47) and between polypeptides RpoC1 and RpoC2 in cyanobacteria and chloroplasts (4). In H. halobium and M. thermoautotrophicum, the ß-subunit sequences are also divided between two polypeptides (B" and B') (3, 47). In all cases, the naturally occurring breakpoints fall between blocks of conserved sequence.
Monomeric enzymes have also been split experimentally to examine how protein domains function when they are located on separate subunits. Active E. coli RNAP has been assembled from ß-subunit fragments (57). In Helicobacter pylori and Wolinella succinogenes, the ß and ß' subunits of RNAP are naturally fused (68, 69). Separation of the ß and ß' subunits in H. pylori yields viable bacteria that are able to colonize mice (49).
In addition, examples exist of covalent linkage of genes to yield a fusion polypeptide. The second (hisD) and third (hisC) genes of the histidine operon of Salmonella enterica serovar Typhimurium have been fused into a single polypeptide (50, 67). In vitro characterization of the fused polypeptide indicated that, while the activity of the imidazolylacetol phosphate:L-glutamate aminotransferase (HisC) was normal, histidinol dehydrogenase (HisD) activity was more heat labile and had lower affinity for one of its substrates, histidinol, than the wild-type enzyme (50). The p7/p4 fused polypeptide exhibits 20% of the activity of the heterodimer in in vitro transcription elongation assays (Fig. 4), although it fully complements mutations in both polypeptides in vivo (Table 2). We have compared the sequence of the p7/p4 polypeptide to that of T7 RNAP at the N4 fusion junction. The distance between the two blocks of conserved sequence flanking the junction is longer (12 versus 9 aa) in the p7/p4 polypeptide than in T7 RNAP (Fig. 2A and Table 3). This suggests that, if the structure of the N4 RNAPII catalytic domain is similar to that of T7 RNAP, the spatial location of secondary structural elements flanking the junction should not be grossly perturbed in the fusion polymerase relative to the heterodimeric enzyme. However, we cannot rule out the possibility that the observed reduction in activity is caused by potential differences in the length or orientation of such structural elements. Additionally, the flexibility and motion of the surrounding domains, such as large protein rearrangements in the vicinity of the p7/p4 breakpoint during the transition from the initiation complex to a stable elongation complex, may be affected. We do not know if the fused N4 RNAPII polypeptide can tolerate breakpoints in other positions. Whether p7 and p4 evolved from a single polypeptide or whether, conversely, they represent the ancestral state of this class of polymerases is unknown.
|
|
|---|
We thank Roger Hendrix, Graham Hatfull, and the members of the Pittsburgh Bacteriophage Institute for confirmation of and corrections to the sequence of the N4 genome spanning positions 960 to 7997, and we thank Bill McAllister for many discussions and for providing helpful comments on the manuscript. S.H.W. performed some of the work described in this report while at the laboratory of F. Stahl (University of Oregon, Eugene). We thank him for his hospitality.
This work was supported by NIH grant AI 12575 to L.B.R.-D. K.M.K. and R.H.C. were partially supported by NIH grants T32 GM 07197 and T32 GM 07183, respectively.
Present address: Infectious Diseases Research, Lilly Research Laboratories, Indianapolis, IN 46285. ![]()
|
|
|---|
factors. J. Biol. Chem. 266:22671-22677.
H. J. Biol. Chem. 247:28751-28761.
29 DNA polymerase active site. J. Biol. Chem. 269:30030-30038.
H: functional analysis and comparative genomics. J. Bacteriol. 179:7135-7155.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»