Previous Article | Next Article ![]()
Journal of Bacteriology, January 2005, p. 488-497, Vol. 187, No. 2
0021-9193/05/$08.00+0 doi:10.1128/JB.187.2.488-497.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Microbiology and Immunology and Program in Bioinformatics and Proteomics/Genomics, Medical College of Ohio, Toledo, Ohio
Received 8 June 2004/ Accepted 8 October 2004
|
|
|---|
|
|
|---|
A subset of type II RM systems includes, in addition to genes for the MTase and REase, a third gene for a conserved regulator called the C (controller) protein. C proteins were originally discovered in the PvuII (58, 59) and BamHI (22, 57) RM systems and were then noted in several other systems as well (1, 11, 53, 59). The C protein gene has no fixed orientation relative to the gene for MTase but to date has consistently been found upstream of and in the same orientation as the gene for REase. In several cases, the C protein has been shown to be a positive regulator required for expression of REase. In the PvuII RM system, the C protein (C.PvuII) has been shown to activate transcription of its own gene, with the downstream gene for REase having no independent promoter (60). The apparent regulatory logic is that when these genes enter a new host cell, only the gene for MTase is expressed immediately. In contrast the gene for the C protein, subject to autogenous activation, is initially expressed at a very low level but its transcription is predicted to ramp up dramatically over time along with that of the downstream gene for REase. Thus, REase expression is delayed relative to that of MTase. This interpretation is consistent with observations that pre-expressing C protein in a cell makes it impossible to transform that cell with the intact RM system, presumably because of premature expression of the REase (44, 60).
On the basis of sequence similarity and mutation analysis, the C proteins were predicted to be dimeric helix-turn-helix (HTH) proteins (58, 59). In fact, a newly determined high-resolution structure for C.AhdI, the first for a member of the C.PvuII family, reveals an all-
-helical homodimeric protein containing a classical HTH domain that can be assigned to the Xre family of transcriptional regulators (33; J. McGeehan and G. Kneale, personal communication).
Comparison of C protein-encoding genes revealed the presence of conserved upstream sequences, called C boxes, having a consensus (53) that has since been redefined (3, 60) as shown in Fig. 1. The C boxes have been shown in several cases, including PvuII (60), EcoRV (64), EcoO109I (27), and BamHI (J. Brooks, personal communication), to be bound by the cognate C protein. For PvuII, it has been shown that mutation of the C box can abolish this binding (60). As indicated in Fig. 1A (bottom line) and B, the C box sequences appear to reflect recognition of a symmetrical core sequence (labeled 1A/1B and 2A/2B), as would be expected for homodimeric HTH proteins. The nature of the C boxes, including the strong conservation of asymmetric sequences between the box elements, has been noted previously (60).
![]() View larger version (80K): [in a new window] |
FIG. 1. Control region for the PvuII RM system. (A) Comparison of regions upstream of known and some hypothetical C protein-encoding genes. These genes were identified by using the Entrez BLink feature with both C.PvuII and C.SmaI as seeds and screening for proteins with predicted recognition helices similar to that of C.PvuII (HRTYI) (59). A question mark in the protein name indicates an open reading frame (ORF) that has not yet been formally named. Sequences upstream of the C open reading frames are shown; in all cases, the initiation codon is to the right (underlined where shown; others are farther downstream). The sequences are alphabetical by C open reading frame name, except for the top three: PvuII, the focus of this study; SptAI, which is very closely related to PvuII (43); and SmaI, for which the start of transcription has been determined (20). The shaded nucleotides indicate actual or predicted C boxes; the proposed symmetrical core (3, 60) is shown at the bottom. For C.PvuII (top line), the underlined nucleotides in box 2B indicate the previously identified transcript starts (60) while the shaded A of the C.PvuII initiation codon (far right) was identified in this study as a transcript start. (B) Sequence logo derived from the sequences in part A. The height of each stack reflects the extent of conservation at that position, while the height of symbols within the stack indicates the relative frequency of each nucleotide (56). The logo was generated by the server at http://weblogo.berkeley.edu. Gray bars under the numbers indicate the C boxes, the black bar delineates a region shown in this study to be dispensable for C-dependent transcription, while the white bar shows the regions found in this study to be required (at least in part) for C-dependent transcription. Vertical arrows indicate the transcript starts found previously (60) (solid) or in this study (dotted). (C) Genetic map of the PvuII RM system. Numbering is relative to the initiation codon of pvuIIC. The hatched area represents the previously identified promoter area for the pvuIICR genes, and the small gray rectangles above indicate the positions of the C boxes (the sequences shaded in panel A). The white rectangle between pvuIIM and pvuIIC corresponds to the white rectangle on the right of panel B; the two pvuIIC transcript starts (rightward bent arrows) indicate the one previously identified (solid, 27) and the one identified in this study (dotted, +1). Two promoters for the pvuIIM gene are also shown (leftward bent arrows) (60). Salm., Salmonella.
|
This study began as an attempt to determine whether or not the C box-promoter region is both necessary and sufficient for C-dependent activation of the genes for the PvuII C and REase proteins (pvuIICR). We find that the previously identified promoter is a weak, C-independent promoter that may initiate production of C.PvuII when the PvuII-encoding genes enter a new cell. We show that a portion of the pvuIIC gene between the C boxes and the initiation codon is responsible for C.PvuII-dependent expression of pvuIICR and demonstrate that this region includes the 10 hexamer for a C-dependent promoter that yields a leaderless mRNA. We also show that, surprisingly, the distance between the C boxes and the 10 hexamer of the C-activated promoter can be varied ±4 bp without reducing the level of activated transcription.
|
|
|---|
Promoter clones and their derivatives. Details of the construction of various plasmids used in this study, together with a table of the oligonucleotides used (Table S1), are available as supplemental material to this article.
Assays for reporter gene activity.
ß-Galactosidase activity was measured at several times during exponential growth. The assay used, which was described by Miller (37) and modified by Platko et al. (49), monitors hydrolysis of o-nitrophenyl-ß-D-galactopyranoside. The units of ß-galactosidase activity were calculated by dividing
A420 by the reaction time and by the volume of permeabilized cells used. The specific activity was obtained from the slope of a plot of ß-galactosidase activity versus the optical density (OD) of the culture at 420 nm.
Protein concentration. The Micro BCA kit from Pierce (Rockford, Ill.) was used to determine protein concentrations in accordance with the manufacturer's instructions. A standard curve based on bovine serum albumin and ranging from 0.5 to 200 µg/ml was generated.
Chloramphenicol acetyltransferase (CAT) assay. The Quan-T-CAT kit from Amersham Biosciences (unfortunately no longer available) was used to measure CAT reporter activity via transfer of acetyl groups from 3H-labeled acetyl coenzyme A. The substrate in this assay is missing the hydroxyl group of carbon 7 so that only monoacetylated CM can be produced, simplifying the kinetics, and the CM is biotinylated to allow separation from unreacted acetyl coenzyme A with streptavidin beads. A 250-µl sample was removed and pelleted for 1 min in a microcentrifuge when the culture OD at 600 nm reached 0.3, 0.4, and 0.5. The cells were resuspended in 180 µl of 100 mM Tris (pH = 7.8) and subjected to six freeze-thaw cycles. The extracts were incubated for 10 min at 65°C to decrease endogenous deacetylase activity (16). The cell extracts were centrifugally cleared, with 40 µl of supernatant taken for CAT assay, and portions of the remainder were used to determine the protein concentration. The CAT reaction mixture was incubated at 37°C for 1 min before streptavidin beads were added. 3H-labeled acetylated CM was measured in a scintillation counter after resuspending the beads in 1 ml of ScintiSafe 30% scintillant (Fisher Scientific, Chicago, Ill.). CAT activity was calculated as counts per minute per microgram of protein per minute.
Generation and analysis of library randomized for region downstream of C boxes. Oligonucleotide DK31 (Table S1 in the supplemental material) spans the region between the C boxes and the pvuIIC initiation codon and includes nine consecutive randomized positions (IDT, Coralville, Iowa). Oligonucleotide DK32 was used to generate the complementary strand. The two oligonucleotides were hybridized, and the shorter primer was extended with Klenow polymerase. The DNA was sequentially digested with HindIII and SalI, followed by ligation into pDK156 (which contains the pvuIICR promoter from 93 to 24, relative to the initiation codon of pvuIIC, in pKK232-8) and electroporation into Electromax EC100 E. coli cells (Epicentre, Madison, Wis.). Cells were plated onto LB agar containing CB. Colonies were pooled, and plasmid DNA was recovered with the midi plasmid kit (QIAGEN). The library DNA was electroporated into an E. coli strain containing pDK74 (a pACYC184-based plasmid that specifies C.PvuII), and Cat+ colonies were selected on plates containing CB and 25 µg of CM per ml. Plasmid DNA was isolated from these colonies, and the 9-nucleotide (nt) randomized region was sequenced with primer DK5.
Nucleotide sequence analysis and transcript mapping. All of the plasmid inserts generated in this study were sequence confirmed. DNA sequencing was done with the Big Dye termination system (Applied Biosystems, Foster City, Calif.), a variant of the dideoxy method (55). Reaction mixtures were resolved on a Prism 310 DNA analyzer (Applied Biosystems).
For total-cell RNA, 100-ml cultures were grown in LB medium to an A600 of 0.5. Ten milliliters of 5% phenol in ethanol was added to the culture to stabilize the RNA (31), and cells were pelleted. RNA was extracted by three alternative methods: (i) guanidine isothiocyanate (15); (ii) Trizol, a monophasic solution of phenol and guanidine isothiocyanate (Invitrogen; in accordance with manufacturer's protocol); and (iii) RNeasy (QIAGEN; in accordance with the manufacturer's protocol). RNase-free reagents and materials were used. To remove phenol, which affects reverse transcriptases, Phase Lock gel (heavy, 2 ml, from Eppendorf) was used to separate the organic and aqueous phases.
Nuclease S1 mapping was performed as described at http://www.fhcrc.org/labs/hahn/methods/mol_bio_meth/s1_oligo_probe.html. Annealing temperatures of 45 to 65°C were tested; the final annealing temperature for both the 77- and 51-nt probes (DK37 and DK38; Table S1 in the supplemental material) was 55°C. Total RNA was isolated from E. coli strains TOP10 and JM107MA2, each carrying the pPvuRM3.4 or pPvuRM3.4Esp19 plasmid (59), and Proteus vulgaris carrying the pPvu1 plasmid (5). Assays included 25 or 50 µg of total RNA. Samples were resolved on Tris-borate-EDTA-urea 10 to 20% acrylamide gradient gels (Invitrogen). A 10-bp ladder of size standards (Invitrogen) was loaded onto each gel.
Primer extensions were performed as recommended by the manufacturer of each reverse transcriptase used (Moloney murine leukemia virus from Amersham Biosciences and SuperScript II and Avian Thermoscript from Invitrogen). RNA preparations were those described above under S1 mapping. One picomole of 32P-end-labeled oligonucleotide (RV1, DK35, or DK36; Table S1 in the supplemental material) was mixed with 5 µg of total RNA at 70°C. Moloney murine leukemia virus and Thermoscript reaction mixtures were then placed on ice for 5 min, while Superscript II reaction mixtures were cooled to 42°C for 2 min. Reactions were allowed to proceed for 50 min and terminated via heat inactivation, and protein was precipitated with yeast tRNA and isopropanol and resuspended. Samples were resolved on a 6% polyacrylamide-Tris-borate-EDTA-urea sequencing gel along with a 10-bp DNA marker ladder.
|
|
|---|
Deletion, from the 5' side, of nt 93 to 60 (relative to the initiation codon of pvuIIC) replaced the putative 35 hexamer for the previously reported promoter (TCAACA, ending 6 nt upstream of the sequence shown in Fig. 1A) with the vector-derived sequence GTCGAC while leaving the 18-bp spacer region unchanged. This deletion substantially reduced the level of C-independent transcription, although that level was very low to begin with (Table 1). However, this upstream deletion had relatively little effect on stimulation of transcription by active C.PvuII (a roughly twofold reduction).
|
View this table: [in a new window] |
TABLE 1. Effect of deleting the region upstream of the C boxes
|
|
View this table: [in a new window] |
TABLE 2. Transcriptional effect of sequences downstream of the C boxes
|
|
View larger version (24K): [in a new window] |
FIG. 2. Effect of replacing DNA downstream of the C boxes on pvuIICR promoter activity. Promoter clones, all having the 5' end at 93 relative to the pvuIIC initiation codon, were placed upstream of the promoterless cat gene in pKK232-8. Only the sequence 3' of C box 2A is shown. To readily replace portions of the DNA, the nucleotides immediately following C box 2B were altered to generate restriction sites (underlined). Lowercase letters indicate vector DNA. These plasmids were used to transform E. coli already containing a compatible plasmid expressing either WT C.PvuII (black bars) or a defective variant (white bars), and CAT activity was determined in triplicate as described in Materials and Methods. Error bars indicate the standard deviation. For comparison, the sequence of the SptAI RM system is also shown (top line); the genes of this system are extremely closely related to those of PvuII, and their transcription is activated by C.PvuII (43), but even in this case conservation of the region between the C boxes and pvuIIC initiation codon is limited. Also for comparison, the bottom lines show the sequence context into which the randomized sequences were placed in other experiments (Library) and the location of the putative (Put.) 10 hexamer identified as a result.
|
![]() View larger version (45K): [in a new window] |
FIG. 3. Selection of functional sequences from a randomized library. (A) Two synthetic oligonucleotides were acquired. The 28-mer includes nine consecutive positions (N) that were synthesized in the presence of an equal mixture of all four phosphorothioate nucleotides. The second strand was synthesized by using the 12-mer shown and extending with Klenow polymerase. The shaded areas indicate restriction sites used for subsequent cloning. In the resulting plasmids, the pvuIICR promoter and C boxes are upstream of the sequence shown while downstream is a promoterless cat gene. (B) The plasmid library was used to transform an E. coli strain that already carried a compatible plasmid producing C.PvuII, and the transformants were plated onto agar containing CM to select for functional sequences. Twenty of these were sequenced as described in Materials and Methods; the 9-nt randomized region is shown with the native PvuII sequence at the bottom. Shading indicates a match to the native sequence, with number of matching positions indicated to the right; underlining reveals shifted occurrences of a TAT motif. In all 20 clones, the 9-nt region was preceded by ACC and followed by CCC. (C) Sequencing trace of the pooled randomized plasmid library before selection (portion underlined in panel A). The lower strand was sequenced, so the image has been reversed to match other portions of the figure and the complementary bases indicated above the trace. In the unselected randomized region, two positions showed enough bias for the sequencing software to call them as shown. (D) Logo analysis (56) of sequences shown in panel B. The logo was generated at weblogo.berkeley.edu and is aligned with the corresponding positions of panel C. (E) Logo analysis of 350 E. coli 10 promoter hexamers generated by the Computational Genomics Research Group, Department of Plant and Microbial Biology, University of California at Berkeley, and available at weblogo.berkeley.edu.
|
![]() View larger version (71K): [in a new window] |
FIG. 4. Determination of pvuIIC mRNA 5' terminus via S1 nuclease digestion. (A) Sequence complementary to the 77-nt probe used in S1 mapping analyses. Gray boxes indicate the C box elements and the ATG initiation codon of pvuIIC. Transcript starts identified previously (60) and in this study are shown in black boxes. The distances from the labeled 5' end of the probe (corresponding to the 3' end of the sequence shown) are 30 nt to the ATG start and 56 or 57 nt to the AGTC start. In addition, a 51-nt probe was used, the 5' end of which is complementary to the position 4 nt upstream of the ATG start codon (GTCT) and that extends 7 nt upstream of the entire sequence shown. (B) S1-digested products were resolved on acrylamide gels as described in Materials and Methods; autoradiograms are shown. The two marker lanes (M) show a 10-nt ladder, with the bands shown ranging from 20 to 100 nt. Unless noted otherwise, all reaction mixtures included the 77-nt probe. Lanes: 1, RNA from the strain used as background, E. coli TOP10; 2 and 3, 50 and 25 µg of RNA from TOP10 carrying pPvuRM3.4 (WT PvuII RM system); 4, same as lane 3 but with no nuclease S1 added; 5, same as lane 4 but with both 77- and 51-nt probes present; 6 and 7, 50 and 25 µg of RNA from TOP10 carrying pPvuRM3.4; 8 and 9, 50 and 25 µg of RNA from P. vulgaris, the native host for the PvuII RM system; 10 and 11, same as lanes 6 and 7 but with the 51-nt probe; 12 and 13, same as lanes 8 and 9 but with the 51-nt probe.
|
In contrast, we consistently saw a shorter 30-nt S1 digestion product, suggesting transcript initiation at the first nucleotide of the translation initiation codon (Fig. 4B). This would result in the production of a leaderless mRNA, as has been seen in several other cases (26, 39, 46), and would explain the lack of an obvious Shine-Dalgarno sequence upstream of the ATG.
Surprisingly, the primer extension analyses never showed the ATG initiation point even with the same RNA preparations that clearly showed this start in S1 mapping experiments. Two of the three RNA preparation methods also yielded no primer extension products for the C box start previously reported by Vijesurier et al. (60). However, RNAs prepared via use of guanidine isothiocyanate, which is what Vijesurier et al. had used, did reveal faint extension products including the previously reported C box starts (not shown). The reason for this disparity is unclear although, as described above, the results in Table 2 also indicate weak but significant promoter activity within the C boxes.
To verify that the pvuIICR mRNA detected by S1 mapping is leaderless, we confirmed that the assigned initiation codon is actually used for translation of pvuIIC. This codon had been implicated on the basis of being near the start of the open reading frame, and in vitro transcription and translation of this gene had yielded a product of the expected size (58). To further support this choice, we altered the initiation codon without changing the first (transcript initiation) position, ATG
AGA, and confirmed the remainder of the pvuIIC sequence. This change reduced restriction of bacteriophage
vir 106-fold (not shown), even when the entire downstream pvuIIR gene was replaced to rule out its inadvertent alteration during the site-directed mutagenesis. This loss of restriction is probably not due to transcriptional effects, since an ATG
CGT substitution at that position reduced C-dependent transcription of a cat reporter gene by only about a third (pDK151 in Fig. 2).
Effects of altered spacing between the C boxes and the DSR. Our data suggest that the region downstream of the C boxes functions by contributing the 10 hexamer to a pvuIICR promoter that initiates transcription at the pvuIIC initiation codon. In this case, the 35 hexamer would overlap C box element 2A (TAGATT on the basis of 17-bp spacing from the 10 hexamer; Fig. 1A). We assessed the effects of changing the spacing between the 10 hexamer and the C box region. As shown in Fig. 5, adding or removing 4 bp in the spacer region had minimal effects on C-dependent transcription. This is inconsistent with the behavior of standard promoters, in which even 1-bp changes from the ideal 17-bp 10 to 35 spacing profoundly reduce promoter activity (4, 25, 62). These 4-bp changes shift the DNA double helix face on which the 10 hexamer is exposed by ±140° relative to the C boxes (assuming B-form DNA). In contrast, inserting 10 or 20 bp, which would change the relative phasing by just 7 or 14°, respectively, eliminated C-dependent transcription. Thus, the functional relationship between the C boxes and the 10 hexamer depends on linear distance but not, apparently, on phasing.
![]() View larger version (39K): [in a new window] |
FIG. 5. Effect of altered spacing between the C boxes and putative 10 hexamer on pvuIICR transcription. (A) Promoter variants tested. Each promoter began at 93 relative to the pvuIIC initiation codon; only sequences 3' of C box 2B are shown. All include nt 23 to +6 of the native PvuII DNA and were inserted into pKK232-8 upstream of the promoterless cat gene. The nucleotides from 24 to 19 were replaced to generate a restriction site (underlined), and the spacing between the putative 10 hexamer for the ATG transcription start and the C boxes/promoter was varied by cleaving the restriction site and digesting the single-stranded tails, filling in the single-stranded regions, or inserting oligonucleotide duplexes of various lengths prior to ligation. The putative 10 hexamer, CATTAT, is 14 to 9 relative to the pvuIIC initiation codon. WT spacing of 10 nt between C box 2B and the putative 10 hexamer is indicated by ±0. (B) The sequences of all variants were confirmed, and they were used to transform E. coli already containing a compatible plasmid expressing either WT C.PvuII (closed circles) or a defective C variant (open circles); CAT activity was determined in triplicate as described in Materials and Methods. The bars indicate standard errors.
|
CGCCAG for PpvuIIM82 and GATAGA
GAGAGC for PpvuIIM120). In one orientation upstream of cat, these mutations abolished detectible transcription from the pvuIIM promoters, while in the opposite orientation they had no effect on C-dependent transcription of pvuIIC (not shown). Interestingly, the 10 hexamer region of the C-activated pvuIICR promoter stimulated transcription from at least one of the two opposing pvuIIM promoters (Table 3). The upstream pvuIIM promoter M120 is truncated at position 24 in our reporter plasmids, but the complete M82 promoter (60) is present in the longer pvuIICR promoter clones (56 to +123 relative to the M82 initiation point). The region encompassing the pvuIICR 10 hexamer is +33 to +48 relative to the PpvuIIM82 transcript start. The data in Table 3 reveal a large positive effect of the DSR on pvuIIM transcription but, as expected, no effect at all on C.PvuII. The basis for this effect has not yet been determined.
|
View this table: [in a new window] |
TABLE 3. Effect of the DSR on transcription from pvuIIM promoter M82
|
|
|
|---|
In contrast, S1 nuclease mapping consistently indicated a different starting point (Fig. 4), at the first nucleotide of the initiation codon. This starting point was not seen in primer extension analyses, even with the same RNA preparations that gave clear nuclease S1 signals. This type of result could be explained by such phenomena as stable RNA secondary structures or RNA splicing, but there is no obvious evidence that either applies in this case. It is more difficult to rule out mispriming on background mRNA in the primer extensions, although we used two species (E. coli and P. vulgaris) and multiple primers. Irrespective of the primer extensions, our results are most consistent with C-dependent production of a leaderless pvuIICR mRNA. The results in Table 2 suggest that the previously identified initiation point (within the C boxes) represents weak C-independent transcription, such as would be required early after gene transfer into a new cell to initiate the autoactivation circuit.
Implications for mechanism of transcription activation by C proteins.
The 23-bp distance between the C boxes and the ATG transcript initiation point would place bound C.PvuII at a rare but by no means unique position for transcriptional activators (17, 32, 48, 52, 63). In addition, a fortuitous region of very high similarity between C.PvuII and the cI repressor-activator of bacteriophage
(Fig. 6) suggests a specific activation mechanism. The match covers a region known (for
cI) (24) or predicted (for C.PvuII) (59) to include the activation patch on the upstream helix of the HTH motif. The key contact for
cI activation is between Glu34 and Arg588 in region 4 of
70 (RpoD) (8, 29, 30, 45). Glu34 is in the middle of the 11-amino-acid segment conserved between
cI and C.PvuII, and this Glu and flanking amino acids are highly conserved among C proteins (Fig. 6).
cI activates transcription via effects on the isomerization step, although there is evidence that the same protein-protein interactions can result in activation via DNA binding or isomerization, depending on the precise spatial context of the interaction with RNA polymerase (18).
![]() View larger version (53K): [in a new window] |
FIG. 6. Model for interaction between C proteins and region 4 of RpoD. The lower half shows sequences of the predicted HTH motifs of selected C proteins. The sequence above C.PvuII is that of cI protein from bacteriophage lambda, which has been structurally characterized and has a known activation mechanism (24). The shading indicates the nearly complete conservation of the upstream helix between cI and C.PvuII (one Val Leu substitution out of 11 consecutive amino acids), including the key activation residue (Glu shown in white on black). In the case of cI, this Glu activates by contacting a conserved Arg in the HTH motif of region 4.2 at the carboxyl end of RpoD ( 70; upper half). This Arg, in turn, is coordinated with a conserved Glu (both shown in white on black), and the Glu contacts position 33 in the promoter 35 hexamer (see text for details). In the RpoD sequences, gray shading indicates amino acids that differ from those in the E. coli ortholog. The RpoD sequences shown were chosen to match the native hosts of the C protein sequences in the lower half. Sequences are from the GenBank database, except for RpoD of P. mirabilis, which was kindly made available from the partially complete genome sequence (http://www.sanger.ac.uk/Projects/P_mirabilis/).
|
-helical dimeric protein that contains a classical HTH domain (33; McGeehan and Kneale, personal communication). Homology modeling of the structure of C.AhdI with the
cI-DNA-RNA polymerase ternary complex is entirely consistent with direct contact between C.PvuII and the
70 subunit (McGeehan and Kneale, personal communication), although this remains to be proven. A surprising feature of the pvuIICR promoter we report here is its relative insensitivity to changes in spacing (Fig. 5). The native sequence has a repetitive quality that might mask changes in spacing. Specifically, the WT sequence has a 10 hexamer (CATTAT) with 17-bp spacing from the putative 35 hexamer; the 4 variant might function with a TATCCC 10 hexamer having 16-bp spacing, and the +4 variant might function with a CATCAT 10 hexamer having 18-bp spacing. However, there may simply be no 35 hexamer. While the 10 hexamer lacks the TGN sequence characteristic of 35-independent, extended 10 sequences (7), in some promoters the activator functionally replaces the 35 hexamer (28). In this case the varied spacing between the C boxes and the promoter may reflect a structural feature of the C box-C.PvuII complex. Specifically, the center-to-center distance between box 1A/1B and box 2A/2B is 1.4 helical turns of the DNA (15 bp; Fig. 1A); this would likely position a pair of bound C.PvuII homodimers on opposite faces of the DNA, similar to what has been found for DtxR (13), so that a half turn of the DNA in either direction would still leave an activation surface in place to contact RNA polymerase.
Implications for host range of C-dependent RM systems.
Transcription activators often have a narrower host range than repressors, as the former have to make spatially constrained productive contacts with RNA polymerase (although these contacts can involve a very small number of amino acids). For example, a transcription activator from Bacillus could function in E. coli only when part of the E. coli RNA polymerase
subunit was replaced with the corresponding Bacillus sequence (35). However, the C proteins appear to have a broad host range (23).
We suggest three features that may contribute to the apparent host range breadth of C-dependent RM systems. First, as illustrated in Fig. 6, if the analogy to
cI is valid, then the activating contact involves a very highly conserved segment among major vegetative sigma factors. Second, the apparent independence of PpvuIICR from a 35 hexamer, suggested by the results in Fig. 5, might contribute to host range breadth. The replacement of conserved sequences having fixed spacing requirements (both the 35 hexamer and the spacer region) (61) with a more flexibly spaced site for binding an activator (47) could bypass some species-specific promoter requirements. Third, while the 3' end of 16S rRNA is highly conserved, there is evidence that the Shine-Dalgarno element in mRNAs plays roles beyond simply serving as a hybridization tag for ribosome docking (19). If this is correct, then a leaderless mRNA such as that for pvuIICR could bypass some species-specific regulatory signals; indeed, leaderless mRNAs are present and translated in all kingdoms of life (39). Given the potentially lethal consequences of misregulating RM systems, the host range of the regulatory system may be a limiting factor for the host range of the entire RM system.
This research was supported by a grant from the U.S. National Science Foundation (MCB-9904523) with additional support from the Research Challenge Program of the Ohio Board of Regents via the Medical College of Ohio.
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
70 region 4 involved in transcription activation by lambda cI. J. Mol. Biol. 324:17-34.[CrossRef][Medline]
70 subunit of Escherichia coli RNA polymerase. J. Bacteriol. 184:3957-3964.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»