Previous Article | Next Article ![]()
Journal of Bacteriology, July 2007, p. 4718-4728, Vol. 189, No. 13
0021-9193/07/$08.00+0 doi:10.1128/JB.00281-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Aleksandra M. Mironczuk,
Wiep Klaas Smits,
Leendert W. Hamoen,
and
Oscar P. Kuipers*
Department of Genetics, University of Groningen, NL-9751 NN Haren, The Netherlands
Received 21 February 2007/ Accepted 17 April 2007
|
|
|---|
|
|
|---|
ComK activates transcription by binding to specific sequences, so-called K-boxes, in the upstream region of ComK-activated genes (Fig. 1). Each K-box consists of two AT-boxes with the consensus sequence AAAA-(N)5-TTTT. These boxes are separated by a spacer of a discrete number of helical turns, which positions both AT-boxes on the same side of the DNA helix. Based on the length of the spacer (two, three or four helical turns calculated between the first A's of both AT-boxes), K-boxes are divided into three different classes, i.e., those belonging to class I, class II, or class III promoters, respectively. Functional ComK is thought to act as a tetramer composed of two dimers, each binding to an AT-box. Tetramerization of the two dimers is accompanied by bending of the DNA (9). Previous research has demonstrated that the major role of ComK in transcription activation at the comG promoter is in stabilizing RNA polymerase binding, probably by facilitating contacts between the polymerase and the upstream DNA through bending of the DNA in the promoter region (29).
![]() View larger version (39K): [in a new window] |
FIG. 1. Overview of the ComK-binding site (K-box) in B. subtilis. (A) Schematic representation of a K-box. A K-box consists of two AT-boxes (shaded gray) with the consensus sequence AAAA-(N)5-TTTT. In this study, the AT-box closest to the 35 position is indicated as AT-box 1, while the box further upstream is called AT-box 2. In each AT-box, the positions of the four A's and T's are indicated with subscript numbers. The three distinguished classes of K-boxes (C I, C II, and C III) are indicated with the lengths of their respective spacers. The average AT content of the spacer region is over 60%. (B) Helical representation of a K-box. In this figure, a class I K-box with a spacer of two helical turns is depicted in a helical projection. The dark vertical bars represent the AT base pairs in the AT boxes. The gray circles represent the (A)4 stretches, and the black circles represent the (T)4 stretches in both AT-boxes.
|
As is depicted in Fig. 1, different elements can be distinguished in a K-box, like the spacer between the AT-boxes and the consensus sequence of these boxes. In principle, these characteristics could serve as additional critical elements to determine whether a K-box is regulated and, when regulated, to what level ComK stimulates transcription at this particular box. In the present study, a search for critical determinants in a K-box was performed by investigating the effects of alterations in the lengths and GC contents of the spacers between the AT-boxes. Furthermore, the effects of point mutations in the consensus sequence of the AT-boxes were determined using the K-box of the comG operon as a model, as well as an idealized K-box. This K-box was chosen because the level of transcription of the comG operon is among the highest ComK-activated transcription levels and requires only ComK, minimizing the chance of interference of other regulatory proteins (9, 29). The effects on regulation by ComK were investigated by monitoring transcription activation in vivo using ß-galactosidase assays and by determining the DNA-binding affinity of ComK for wild-type and mutant K-boxes in vitro by electrophoretic mobility shift assay (EMSAs). The results show the particular importance of the second thymine-adenine base pair in an AT-box for the regulation of the K-box by ComK, suggesting that this position is crucial for ComK-DNA interactions.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Bacterial strains and plasmids used in this study
|
PCR amplifications and plasmid constructions. PCRs were performed as described previously (15). For amplification of PCR products, Pwo or Expand polymerase (Roche) was used. Unless specified otherwise, chromosomal DNA of B. subtilis 168 was used as a template. Plasmids and primers used in this study are listed in Table 1 and Table 2, respectively.
|
View this table: [in a new window] |
TABLE 2. Primers used for the construction of different K-box mutants
|
Plasmids pG-b1+2P-SpGC-middle and pG-b1+2P-SpGC-end were constructed by amplification of the comG promoter region and K-box with primers comG-b2P-SpGC-middle and comG-b2P-SpGC-end, respectively, and combined with primer comG-end-XbaI. The PCR products were digested with EcoRI and XbaI and ligated into EcoRI/XbaI-digested pILORI4. In both plasmids, the GC content of the spacer is increased to 60%.
To investigate the effect of point mutations in the AT-box sequences, comG-lacZ fusion constructs were generated. The wild-type comG promoter fragment containing the common K-box was amplified with primers comG-AT2-EcoRI and comG-end-XbaI. To obtain an idealized K-box in the comG promoter, the guanine at position 2 in the A stretch of AT-box 2 was mutated into an adenine by amplifying the comG promoter fragment with primers comG-perfect and comG-end-XbaI. In both cases, the PCR products were digested with EcoRI and XbaI and ligated into EcoRI/XbaI-digested pILORI4, resulting in plasmids pG-wt and pG-perfect, respectively. Using the same strategy, point mutations were introduced in the AT-boxes. Plasmids were constructed using the primer combinations listed in Table 2. Single mutations were made in AT-box 1, while AT-box 2 remained perfect. The resulting plasmids were named pG-b2P-b1AxG and pG-b2P-b1TxG, with x referring to the position in the A or T stretch of AT-box 1 which is mutated into a G. Some single mutations were combined with the A2-to-G (A2-G) mutation in AT-box 2 that is present in the wild-type K-box of the comG promoter. In these cases, plasmids containing the single mutations were used as templates for PCR amplification with primers comG-AT2 and comG-end-XbaI, yielding a comG promoter fragment containing two mutations in the K-box. The created plasmids are named pG-b2A2G-b1AxG and pG-b2A2G-b1TxG, again with the x referring to the mutated position. Plasmid pG-b1T2G-b2P with only a T2-G mutation in AT-box 1 was constructed by amplification of the K-box region with primers comG-b1T2G and comG-end-XbaI. The same primers were used to construct plasmid pG-b1T2G-b2T2G, combining the T2-G mutation in AT-box 1 with a T2-G mutation in AT-box 2, using plasmid pG-b1T2G as the template. For all plasmids, cloning was performed in L. lactis NZ9000. Plasmids were checked by sequencing and transformed into B. subtilis 168.
To exclude the possibility that differences in observed transcription levels are due to variations in the numbers of copies of the reporter plasmids, an alternative strategy was developed, using chromosomal integrations for a representative series of mutants. For this purpose, all K-boxes with single-base-pair mutations, the perfect K-box control, and the T2 mutation series were cloned into plasmid pWK-Sp (W. K. Smits et al., unpublished), a cloning vector that allows integrations into the amyE locus of B. subtilis 168. To construct these pWKS plasmid series, the (mutant or perfect) K-box-comG promoter lacZ fragments were obtained by amplification with primers pG_F (5'-CGATGCATGCCATGGTA) and pG_R (5'-GCCACCTTCGTTTTCAGACTTTGC), using the original low-copy-number plasmids with the same mutations as the template (Table 1). The PCR products were digested with EcoRI and HindIII and ligated into EcoRI/HindIII-digested pWK-Sp. The resulting plasmids were transformed into E. coli MC1061 and, after isolation and sequencing, into B. subtilis 168. The obtained clones were verified for the absence of amylase activity on starch plates, indicating integration of the plasmids in the chromosomal amyE locus. All constructs were checked by DNA sequencing.
Transcription activation assays. Transcription activation by ComK on wild-type and mutant K-boxes upstream of the comG promoter was tested in B. subtilis using fusions with lacZ as a reporter. Cultures were grown in minimal medium to stimulate competence development, and samples were taken from the transition point until 3 to 4 h into the stationary growth phase, at 1-hour intervals. Samples were analyzed for ß-galactosidase activity as described by Israelsen et al. (16) and for protein expression levels by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (19) and Western blotting (31). ComK was detected with a ComK-specific first antibody (34) and an anti-rabbit horseradish peroxidase-conjugated secondary antibody (Amersham) and visualized by chemiluminescence detection using the ECL Western blotting analysis system (Amersham).
EMSAs.
EMSAs were performed essentially as described previously (33). For this purpose, ComK was purified according to the method of Hamoen et al. (9). Double-stranded DNA probes were amplified with PCR, using the same primer combination as was used for the construction of the plasmids (Table 2). Probes for determining the effect of spacer length were amplified with primers oligo-start-EcoRI and comG-end-XbaI. Probes were end labeled with T4 polynucleotide kinase using [
-32P]ATP. Proteins and DNA probes were premixed on ice in 20 µl binding buffer [20 mM Tris-HCl (pH 8.0), 5 mM MgCl2, 100 mM KCl, 0.5 mM dithiothreitol, 0.05 mg/ml poly(dI-dC), 0.05 mg/ml bovine serum albumin, and 8.7% glycerol]. Samples were incubated for 15 min at 37°C to allow protein-DNA complexes to form. Fifteen microliters of each sample was loaded on a nondenaturing 6% polyacrylamide gel. Gels were run in TBE buffer (89 mM Tris, 89 mM boric acid, and 2 mM EDTA) at 100 V, dried, and autoradiographed using phosphor screens. Protein shifts were quantified using the Quantity One software package.
|
|
|---|
![]() View larger version (58K): [in a new window] |
FIG. 2. Effect of the length of the spacer on the regulation of a K-box by ComK. (A) ß-Galactosidase (Betagal) assays were performed to investigate transcription activation by ComK on perfect K-boxes with different spacer lengths, representing a class I, II, or III promoter. The sequences of the spacers used to create a class I, II, or III K-box are depicted underneath the two diagrams. The sample for the depicted ß-galactosidase assay was taken after 4 h in the stationary growth phase. ß-Galactosidase activities are represented as percentages of the activity for the class II K-box. (B and C) EMSAs were performed to determine DNA binding by ComK to the three classes of K-boxes. Binding was tested for different ComK concentrations. (B) ComK shifts were quantified and are represented as percentages of the shift seen for the class II K-box for every ComK concentration. Light-gray bars, class I K-box; medium-gray bars, class II K-box; black bars, class III K-box. (C) Concentrations are schematically indicated by the wedge. ComK concentrations increased in twofold increments from 0.035 to 1.12 µM. Upper autoradiogram, class I K-box; middle autoradiogram, class II K-box; lower autoradiogram, class III K-box.
|
The spacer GC content does not affect regulation by ComK. B. subtilis is an AT-rich bacterium, with an average AT base pair content in the genome sequence of about 60%; also, the DNA of the spacer region in a K-box is relatively AT rich. Binding of ComK to a K-box induces a bend in the DNA, which is required for transcription activation, as was shown at the comG promoter (29). In general, AT base pairs are regarded as more flexible in bending than the somewhat more rigid GCs (17, 24), suggesting that the AT content of the spacer region might be an important determinant for regulation by ComK. To investigate this possibility, the AT content was reduced from 60% to 40%, using the comG promoter K-box as a model. In order to focus only on the influence of the spacer sequence, the AT-boxes were changed into perfect consensus sequences. Furthermore, it could be imagined that the position of the GC base pairs along the spacer affects bending abilities and thereby regulation by ComK. To take this possibility into account, two constructs were tested: (i) a construct in which the ends of the spacer were enriched in GC content (the sequence of the spacer was CCGCGTATAACGTCTCGC, with mutations in bold) and (ii) a construct in which the AT-GC substitutions were positioned in the middle of the spacer (the sequence of the spacer was CAGCACCTGGCATCTCAC). Surprisingly, these changes in the spacer sequence did not significantly affect transcription activation by ComK compared with that of a K-box with a wild-type spacer (CAGCATATAACATCTCAC). ß-Galactosidase assays showed only a slight increase (110% of the control level) in transcription when the spacer middle was enriched in GC base pairs and a very slight decrease (95% of the control level) when the GCs were introduced at the spacer ends. In addition, EMSAs demonstrated comparable levels of binding of ComK to control K-boxes and spacers in mutant K-boxes (results not shown).
Position T2 in AT-box 1 is critical for activation by ComK. As described above, the characteristics of the spacer region do not determine whether a K-box is used for regulating the transcription of the downstream gene but rather to what extent the ComK-dependent promoter is activated. In a search for possibly other features of a K-box that do determine whether and to what level transcription is regulated from a K-box, the influence of mutations in the consensus sequence of the AT-boxes was investigated. To determine the effect of a single-base-pair mutation on ComK regulation, a clean test system was developed using the class II promoter of the comG operon as a model. In previous research, this promoter was used to investigate the role of ComK in transcription activation, since it is among the promoters most highly activated by ComK and transcription activation is dependent only on ComK (29). The wild-type comG promoter contains an additional AT-box at two helical turns upstream of the start site of the common K-box. Since the goal of this research is to focus on determinants of general importance for ComK regulation, the extra upstream AT-box was omitted and a comG promoter containing only the common K-box was fused with the lacZ gene in plasmid pG-wt. The K-box of comG has one base pair deviation from the consensus sequence, i.e., an A-G mutation at position 2 in AT-box 2. This mutation was repaired in plasmid pG-perfect, creating a perfect, idealized K-box. As shown in Fig. 3A and B, ComK activates transcription at both promoters to comparable levels. Furthermore, ComK showed similar levels of DNA binding on both K-boxes (Fig. 3C).
![]() View larger version (13K): [in a new window] |
FIG. 3. Comparison of the wild-type comG promoter K-box and a perfect K-box. (A) ß-Galactosidase assays were performed to determine the levels of transcription activation by ComK at the wild-type K-box of the comG promoter and at an idealized, perfect K-box. Samples were taken from transition point until 4 h into the stationary growth phase. ß-Galactosidase activities (open symbols) are shown in Miller units. Levels of the growth of the cultures were measured by determining the optical density at 600 nm (OD600) (filled symbols). Triangles, wild-type K-box; squares, perfect K-box. (B) Average ß-galactosidase activities over four experiments were determined and are represented as percentages, with the value for the wild-type K-box set to 100%. The error bars show the standard deviations. The values for the last two time points are depicted, indicating little difference between both K-boxes. (C) EMSAs were performed to investigate DNA binding by ComK at the wild-type and perfect K-boxes. Binding was determined for different ComK concentrations. ComK shifts were quantified and are represented as percentages of the shift seen for the wild-type comG promoter K-box for 0.07 and 0.14 µM ComK. Black bars, wild-type comG promoter K-box; gray bars, perfect K-box.
|
![]() View larger version (47K): [in a new window] |
FIG. 4. Effect of single mutations on transcription activation and binding by ComK. (A) ß-Galactosidase assays were performed to determine the effects of single-base-pair mutations on transcription activation by ComK in vivo, using different K-box-comG promoter-lacZ fusion constructs located on low-copy-number plasmids in B. subtilis. For all mutants, ß-galactosidase (Betagal) activities are represented as percentages of the level of activity determined for the perfect K-box in the same experiment. The figure depicts the activities measured after 4 h of stationary growth. Error bars representing the standard deviation are included for all single-base-pair mutations. (A and B) P indicates the perfect K-box. Numbers 1 to 4 under "b1A" or "b1T" indicate the position that is replaced by a G in the A or T stretch of AT-box 1 (b1; the right AT-box, in which the A's and T's are named Ax or Tx) of the perfect K-box, used as the control in this experiment. (B) Details are the same as described for panel A, but in this case, the K-box-comG promoter-lacZ fusion constructs used were integrated into the amyE locus on the chromosome of B. subtilis. The A2-G mutant is missing, because the required mutation was not obtained. (C and D) To determine the effect of single-base-pair mutations on DNA binding by ComK, EMSAs were performed. ComK shifts for the mutant K-boxes were quantified and are represented as percentages of the shifts for the perfect K-box at different ComK concentrations. The standard deviation determined for the average over multiple experiments are depicted for each mutant. Quantifications are shown for the binding reaction with 0.07 µM ComK. For each gel in panel D, results for a blank () and three samples with ComK concentrations increasing in twofold increments from 0.018 to 0.07 µM are shown (indicated by the wedges). Perfect, the perfect K-box; Ax-G, A-G substitution at position x in the A-stretch of AT-box 1 (right AT-box); Tx-G, T-G substitution at position x in the T stretch of AT-box 1.
|
In addition to affecting transcription activation, single mutations in a K-box also affect DNA binding by ComK (Fig. 4C and D). The binding effects for each position usually correlate rather well with the effect seen on transcription, although binding at the A3-G mutation K-box is more comparable to transcription determined with the integrated reporter than that determined with the plasmid-located reporter. Except for T1, the T-stretch mutations show a clear reduction in binding by ComK. Like the effect of the mutation on transcription, a T-G substitution on position T2 shows the largest effect on ComK binding, indicating that this base pair position is most important for ComK-DNA interactions.
Transcription of and binding to constructs with double mutations in a K-box. For further investigation of the importance of the consensus sequence of the AT-boxes, constructs with double mutations in a K-box were created. For this purpose, single mutations in AT-box 1 producing either a wild-type, an intermediate, or a low level of transcription activation were combined with the natural mutation found in the wild-type comG promoter K-box, i.e., A2-G in AT-box 2. Double mutants were tested for transcription activation and ComK binding and compared with the wild-type comG promoter in pG-wt. Mutants with mutations at A1 and T3 were the least affected, showing a relatively mild decrease in transcription to 60 to 80% of the wild-type level, while the observed transcription levels for mutants with mutations at positions T2 and T4 in AT-box 1, combined with the A2-G mutation in box 2, were severely reduced to at most 15% of the wild-type level (Fig. 5A). Similar effects could be seen on ComK binding, although these effects seem to be somewhat stronger than on transcription (Fig. 5B). This is especially clear for the mutant with the combination of A2-G in AT-box 2 and T3-G in AT-box 1, in which DNA binding affinity is only near 20% of the wild-type level, although transcription activation is still over 60% compared with that of the wild type. Apparently, a decrease in binding affinity does not necessarily result in a similar reduction in transcription activation.
![]() View larger version (24K): [in a new window] |
FIG. 5. Effect of double mutations on transcription activation and binding by ComK. (A) Five single AT-box mutants were combined with an A-G substitution at position 2 in AT-box 2, the mutation seen in the wild-type comG promoter. ß-Galactosidase (betagal) assays were performed to determine the effects of those mutations on transcription activation by ComK in vivo. The activities determined 4 h into the stationary growth phase are shown. ß-Galactosidase activities and standard deviations are determined as described for Fig. 4, in this case relative to the activity in the wild-type-comG promoter K-box. (B) To determine the effect of the mutations on DNA binding by ComK, EMSAs were performed for different concentrations of ComK. Quantifications were performed as described for Fig. 4, and values are relative to that determined for the wild-type comG promoter K-box, used as a control (standard deviations are included for each mutant). Results are shown for the binding reaction with 0.07 µM ComK. Wt indicates the wild-type comG promoter K-box (the sequence of which is depicted above the graph in panel A; this K-box was used as the control in this experiment). With Ax or Tx, x indicates the position that is replaced by a G in the A or T stretch, respectively, of AT-box 1 (right AT-box), which is in addition to the A2-G substitution in AT-box 2 (left AT-box).
|
![]() View larger version (18K): [in a new window] |
FIG. 6. Importance of T2 positions in both AT-boxes for regulation by ComK. (A) ß-Galactosidase (betagal) assays were performed to determine the effect on transcription activation by ComK of a single T-G substitution at position 2 in AT-box 1, in AT-box 2, and in both AT-boxes, using plasmid-borne reporters. ß-Galactosidase activities are represented as percentages of the activity in a perfect K-box. The results are depicted for 4 h after entry into the stationary growth phase. Error bars were determined as described for Fig. 4 and 5. (B). EMSAs were performed to determine the effects of the mutations on DNA binding by ComK. Shifts were quantified and are represented as percentages of the shift determined for the perfect K-box. The averages and standard deviations were calculated and are depicted for the binding reaction with 0.07 µM ComK. Black bars, perfect K-box; light-gray bars, AT-box 1 with a T2-G substitution and a perfect AT-box 2; hatched bars, AT-box 2 with a T2-G substitution and a perfect AT-box 1; dark-gray bars, both AT-boxes with T2-G substitutions.
|
In silico analyses of ComK-regulated K-boxes. In 2002, three transcriptome studies of the ComK regulon were reported, providing an overview of natural transcription activation levels at K-boxes with various deviations from the consensus sequence (3, 11, 22). In that study, the ComK-regulated genes found in all three studies were grouped on the basis of their level of activation by ComK (strong to weak). Some ComK-dependent genes are organized in operons, meaning that the transcription of more genes depends on ComK binding to one K-box upstream of the entire operon. In this study, only unique K-boxes were included by taking into account only the most highly activated (n-fold) gene from an operon and assigning the transcription activation level for this gene to the K-box of the entire operon. The resulting list contained 88 ComK-regulated genes and was used as a reference provided by B. subtilis itself to determine the occurrence of mutations at each AT-box position. However, it should be mentioned that the possibility that some of the genes are regulated by additional factors (other than ComK) cannot be excluded. The ranking of the transcription activation levels, therefore, does not necessarily depend only on the occurrence of mutations in the K-box of a particular promoter but can also reflect influences of additional regulation. A weight matrix based on the sequences of all of these 88 ComK-regulated K-boxes shows a wide variety of mutations throughout the complete K-box (Fig. 7A), indicating that there are no positions where a mutation is never tolerated. This correlates with the previous knowledge that individual K-boxes with up to three deviations from the consensus sequence can still be good targets for regulation by ComK. Eleven out of these 88 ComK-activated genes were shown to contain a T-N substitution at position 2 in AT-box 1. A weight matrix based on these K-boxes clearly demonstrated that a concomitant mutation at the other T2 position in AT-box 2 is never seen among ComK-regulated boxes (Fig. 7B).
![]() View larger version (30K): [in a new window] |
FIG. 7. Weight matrices demonstrating natural variation in ComK-regulated K-boxes. (A) Weight matrix based on the sequences of 88 ComK-regulated K-boxes. On the x axis, the two AT-boxes, composing a K-box, are shown. In order to align the AT-boxes for the three classes of K-boxes, the spacer was in all cases set to the same length. For each position in the AT-boxes, which by consensus contain four A's and four T's, the percentage of occurrence of each of the four possible nucleotides among the 88 ComK-regulated K-boxes was calculated. This percentage is shown on the y axis. (B) Weight matrix based on the sequence of 11 out of the 88 ComK-regulated K-boxes, which contain a T-N substitution at position 2 in AT-box 1.
|
|
|
|---|
Furthermore, K-boxes with up to three deviations from the consensus sequence have been shown to be activated by ComK, resulting in gene transcription. In the present study, an attempt was made to elucidate the role of the different elements of a B. subtilis K-box in regulation by ComK, which may explain why at least some of the unregulated putative K-boxes (92% of the total number of K-boxes) are not used by ComK to activate gene transcription.
It was demonstrated that the difference in the lengths of the spacers between the AT-boxes, as seen in the three classes of K-boxes, influences the levels of transcription activation and DNA binding by ComK (Fig. 2). In correlation with results of DNA arrays (11), it was shown that ComK activates transcription at a class III K-box to a lower level than at class I and II K-boxes, likely because binding to class III K-boxes requires higher ComK concentrations to achieve shifts similar to those obtained for the other two classes (Fig. 2B). A natural example of a class III K-box is found at the comK promoter. Of this K-box, it is known that efficient regulation by ComK requires the binding of DegU to prime ComK binding and transcription activation at the onset of competence development, when ComK concentrations in the cell are still low (10). It might be that additional regulators are also involved in other class III K-boxes to stimulate transcription activation by ComK.
Previous research demonstrated that binding of ComK results in DNA bending of the promoter region, as was shown at the comG and comF promoters, where bending of 60 to 70° was determined (9). DNA bending is known to be easier for the more-flexible AT-rich sequences than for GC combination sequences (17, 24), but surprisingly, increasing the GC content of the spacer from <40 to 60% did not significantly influence regulation by ComK, nor did the positioning of the GC base pairs throughout the spacer region. Apparently, the sequence of the spacer is not an important determinant for the level of regulation by ComK.
In contrast to the spacer, the consensus sequence of the AT-box repeats is more critical for determining whether or not a K-box has a high chance of being ComK regulated, as was determined by the introduction of an A-G or T-G mutation in one AT-box in a further idealized K-box background. In the setup of mutational scanning of a K-box, it should be noted that replacement of a T by a G might have a larger effect than replacement of an A by a G, since in the latter situation the base pair remains a purine, while mutation from a T to a G also implies a change from a pyrimidine to a purine. Indeed, our study reveals larger effects on T stretches than on A stretches (Fig. 4), so it should be noted that the latter effects might be an underestimate of the importance of the positions in the A stretch.
Using single-base-pair mutational scanning of a K-box, it was shown that, although large natural variations are allowed, a single replacement of the second thymine residue in AT-box 1 by guanine decreased transcription activation about threefold compared to that in a perfect K-box (Fig. 4). The reduction in transcription activation could be explained entirely by a decrease in DNA binding, which was reduced to a similar level, namely, 30% of the control K-box level. This is, however, not the case for all positions in the consensus sequence of a K-box, as for example in the case of a T3 or A3 mutation in AT-box 2 (Fig. 4). In both cases, binding is more affected than transcription activation. It might be that the level of ComK binding is reduced, but that it is still enough to drive transcription activation to the same level as in the control K-box or, for the A3-G mutant, to an even higher level. However, for most positions in the AT-box, binding and transcription activation by ComK are affected to similar extents.
Since a single mutation at the T2 position of a K-box was shown to most strongly reduce the activity of this box from regulation by ComK, additional experiments were performed to investigate the importance of the T2 positions. Transcription activation and DNA-binding assays demonstrated that introduction of a second T2 mutation in the K-box reduced transcription and DNA binding to below 5% of control levels, suggesting that a combination of mutations at both T2s is not allowed in ComK-regulated boxes (Fig. 6). This result was corroborated by in silico analyses demonstrating the absence of double T2 mutations among K-boxes upstream of ComK-regulated genes (Fig. 7). In contrast, for all other individual positions in an AT-box, mutations at the same positions in both boxes could be found among ComK-activated K-boxes. In addition to the base pair at the T2 position in AT-box 1, the base pairs at positions A2 and A4 in AT-box 2 are conserved when there is a T2 mutation in AT-box 2 (Fig. 7B). Whether this is of biological importance or just due to the relatively low number of boxes with a T2 mutation occurring in vivo among ComK-activated K-boxes cannot be concluded from this study.
A possible way of maintaining the activity of a K-box with a T2 mutation could be the introduction of compensating mutations in other base pairs in the AT-boxes. Often, protein-binding sites display direct or inverted symmetry in the recognition site, as was demonstrated for the E. coli cyclic AMP receptor protein, the arginine repressor ArgR, and the lac repressor LacI (4, 7, 23, 25). In the case of the K-box, the two AT-boxes form direct repeats separated by a spacer, and within each AT-box, an inverted repeat is formed by the A and T stretches. Potential candidates for compensating mutations would therefore be the very important T2 in the other AT-box or the A3 position in the same AT-box as the T2 mutation. However, in silico analyses did not clearly point out conserved mutations in AT-box base pairs, which could be introduced to maintain the activity of a K-box with a T2 mutation, as can be seen in Fig. 7B.
The most interesting question with respect to the importance of the T2 positions for regulation by ComK is what could be the reason for its dominating role. We demonstrated that a replacement of T2 by a guanine lowers the efficiency of ComK regulation by reducing ComK binding to the DNA. The extent of shift is not changed, so alteration of the T2 position does not affect, for example, the oligomerization of ComK on the DNA. Although the mechanism behind the reduced DNA binding by ComK at a K-box with a T2 mutation is unclear, it is likely that the T2 position is the residue that is in closest contact with or has the highest affinity for ComK in protein-DNA interactions. It is tempting to speculate that the positioning of the K-box along the DNA helix does not allow a guanine at the second position in the thymine stretches. It might be that a G/C base pair affects the minor groove of the DNA helix, thereby decreasing the binding efficiency of ComK at the K-box. This hypothesis is supported by research on the Arg-boxes bound by the arginine repressor protein in E. coli. Wang et al. (36) demonstrated that this repressor contacts the DNA at four major and two minor grooves of a long consensus sequence (Arg-box). The minor groove in the center of the Arg-boxes faces the repressor protein and was shown to contain almost exclusively A/T base pairs, suggesting that G/C combinations are not allowed in this region. They suggest that these base pairs would introduce an inhibitory group into the narrow minor groove, thereby inhibiting binding of the Arg repressor. Furthermore, they remark that there is growing evidence that the methyl groups of thymine containing base pairs are often essential for protein-DNA interactions. Although the present study describes a first, general overview of the important base pairs in a K-box, the three-dimensional structure of the protein-DNA complex should preferably be elucidated to unravel the exact nature of ComK-DNA interactions at the site of a K-box.
Published ahead of print on 27 April 2007. ![]()
Present address: Department of Pharmaceutical Biosciences, School of Pharmacy, University of Oslo, P.O. Box 1068, Blindern, 0316 Oslo, Norway. ![]()
Present address: Institute for Cell and Molecular Biosciences, The Medical School, University of Newcastle, Framlington Place, Newcastle NE2 4HH, United Kingdom. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»