Previous Article | Next Article ![]()
Journal of Bacteriology, September 2005, p. 6166-6174, Vol. 187, No. 17
0021-9193/05/$08.00+0 doi:10.1128/JB.187.17.6166-6174.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Bioengineering, University of California San Diego, San Diego, California,1 Department of Biochemistry, University of Wisconsin Madison, Madison, Wisconsin,2 Genome Center of Wisconsin, Madison, Wisconsin,3 Department of Bacteriology, University of Wisconsin Madison, Madison, Wisconsin4
Received 20 April 2005/ Accepted 14 June 2005
|
|
|---|
70-specific promoters occurring upstream of an expressed open reading frame and 74% (273/368) within 1,000 bp. Conversely, many known promoters were not detected by chIP-chip, leading to an estimated 26% negative-detection rate. Most of the detected binding sites could be associated with expressed transcription units, but 299 binding sites occurred near inactive transcription units. This map of RNA polymerase binding sites represents a foundation for studies of transcription factors in E. coli and an important evaluation of the chIP-chip technique. |
|
|---|
, one ß, one ß', and one
. Core RNAP becomes transcriptionally active holoenzyme with the addition of a
factor (Fig. 1A). E. coli possesses seven interchangeable
factors, each with specificity for different promoters. Sigma factors function as global regulators of gene expression and mediate the transcriptional response to conditions under which a large number of genes need to be turned on or off, such as stationary phase or heat shock. Except for
54, sigma factors do not bind to DNA except as part of the holoenzyme complex (9).
![]() View larger version (30K): [in a new window] |
FIG. 1. RNAP transcription cycle and the effect of rifampin. (A) Mechanisms of transcription initiation and rifampin binding. RNAP (blue) binds DNA to form a closed complex with affinities that differ among promoters by at least a factor of 100. The closed complex isomerizes to an open complex at rates that differ among promoters by at least a factor of 100. The open complex is stable at some promoters; at others it collapses back to the closed complex unless it binds initiating nucleoside triphosphate substrates. The competition between abortive initiation and promoter escape (initiation) is highly variable among promoters. Rifampin can bind to all species of RNAP except the elongation complex and traps RNAP in open complexes by forcing the release of 2- to 3-nt abortive transcripts. (B) Effect of rifampin on growth and RNAP location. E. coli cells were cultured, and the absorbance was determined at given time intervals. Rifampin (150 µg/ml) was added at 200 min to one culture ( ), while the other was allowed to continue growing as a control ( ). After 20 min, the culture with rifampin added stopped growing, while the control continued in exponential growth. The inset shows the results of quantitative PCR using primers designed to amplify in the promoter, 16S, or 23S subunit genes from all seven rRNA operons. Cells were treated with rifampin for 0, 20, or 40 min followed by immunoprecipitation using antibody for RNAP ß' core subunit. The results of PCR performed on genomic DNA before immunoprecipitation are shown on the right end of the gel image, labeled "input." The quantified relative intensity of each 16S and 23S band is given below the gel image.
|
A promoter is a DNA sequence to which RNAP binds and initiates the transcription of RNA. Knowledge of promoter locations is the first step in the elucidation of the transcriptional regulatory network. It allows the identification of which genes are cotranscribed and, from the DNA sequence, identification of regulatory motifs associated with different regulators. It also provides a basis for the interpretation of the binding site locations of various transcription factors. Promoters have been identified on an individual basis by primer extension and DNA footprinting studies, by which 961 promoters have been identified (www.cifn.unam.mx/Computational_Genomics/regulondb/) (36). In addition, DNA binding motifs have been used to predict the locations of 4,641 promoters throughout the genome (21, 40). New experimental evidence of promoter locations may validate or refute these predictions and will assist in understanding the function of uncharacterized genes and operons.
A powerful method combining chromatin immunoprecipitation and microarray analysis called chIP-chip (or genome-wide location analysis) has been developed for the global identification of transcription factor binding sites (10, 23, 34). In E. coli it has been used to identify the locations of MelR binding sites (16), and in yeast it has been used to identify the targets of over 200 transcription factors (18). Furthermore, chIP-chip was used to identify the targets of yeast RNA polymerase III (Pol III) and to investigate the roles of the accessory factors TFIIIB and TFIIIC (19, 31, 35).
These previous studies used microarrays spotted with PCR products of intergenic DNA and sometimes coding sequence as well. The resolution of the method depends on the size of the spotted PCR products in addition to the size of DNA fragments used in chromatin immunoprecipitation. Oligonucleotide microarrays have also been used (13) and hold the promise of greater resolution and the statistical confidence endowed by multiple signals from each region of the genome. However, the distribution of oligonucleotide probes is highly skewed in E. coli Affymetrix arrays and many regions of the genome are devoid of probes. Therefore, in this study we used a custom-designed oligonucleotide array from Nimblegen containing an evenly spaced "tiled" set of probes. Custom Nimblegen arrays have been used previously for chIP-chip of human transcription factors (25), but to our knowledge, this is the first publication describing the use of whole-genome, uniformly tiled arrays.
The accuracy and resolution of chIP-chip have been evaluated in a number of previous studies. For yeast RNA Pol III, essentially all known targets were correctly identified (19, 31, 35). A comparison of known transcription factor interactions to a large set of chIP-chip data revealed a false-negative rate of 80% (41). In the present study, we used rifampin to trap RNAP at promoters and compared promoter locations determined by chIP-chip to those determined by established methods. The existence of hundreds of known promoter locations in E. coli provides a novel opportunity for the evaluation of this technique.
|
|
|---|
For chIP-chip, three biological replicates were performed (three individual cultures). For each culture, two cell pellets of 50 ml culture were processedone experimental sample as described below and a second control sample identical to the first but with no antibody added to the immunoprecipitation reaction. The chIP DNA of the first biological replicate was analyzed by real-time PCR using primers in the promoter regions of lpdA and fepB, which were measured to be present at levels 35- and 7-fold higher than background, respectively. The experimental and control PCR-amplified samples were labeled with Cy3 and Cy5 by Nimblegen, swapping the cyanine dyes in one biological replicate. The microarrays consisted of 371,034 oligonucleotides from the E. coli genome (release m56 accession no. NC_000913) plus controls. The oligonucleotides were 50 nt long and spaced every 25 nt on both top and bottom strands.
The method used for chromatin immunoprecipitation was adapted from Lin and Grossman (28). E. coli MG1655, obtained from the E. coli Genetic Stock Center (CGSC no. 7740), was inoculated from frozen stocks into M9 minimal medium-0.2% glucose and grown overnight at 37°C. It was then diluted 1:50 into 200 ml M9-0.2% glucose in a 500 ml flask, and the mouth of the flask was sealed with foil and stirred for 5 to 6 h to an optical density (OD) at 600 nm of
0.4. Rifampin dissolved in methanol was added to a final concentration of 150 µg/ml and stirred for 20 min. Cultures were monitored by OD to verify the inhibitory effects of rifampin. A slight increase in OD was observed but with a greatly reduced growth rate. Samples (50 ml) were transferred to tubes containing formaldehyde and sodium phosphate (pH 7.6) such that the final concentrations were 1% and 10 mM, respectively. They were incubated at room temperature for 10 min with gentle agitation, and then glycine was added to 100 mM. They were gently agitated at 4°C for 30 min. Cells were centrifuged at 3,500 x g and washed twice in chilled phosphate-buffered saline. The cells were transferred to 1.5 ml tubes, centrifuged, and stored at 80°C.
Cells were resuspended in 0.5 ml of a solution containing 10 mM Tris (pH 8), 50 mM NaCl, 10 mM EDTA, and 20% sucrose, and then 0.2 µl Ready-lyse lysozyme (Epicenter) was added. They were incubated for 30 min at 37°C, and then 0.5 ml of a solution containing 200 mM Tris (pH 8), 600 mM NaCl, 4% Triton X-100, 1 mM phenylmethylsulfonyl fluoride, and 2 µg/ml RNase A was added. They were incubated at 37°C for 10 min and then chilled on ice. The lysates were sonicated four times for 20 s each time using a Sonics and Materials VC50 with a 3 mm microtip. Cell debris was removed by centrifugation, and a sample was removed and extracted with phenol and phenol:chloroform for DNA size analysis. DNAs ranged from 100 to 1,200 bp, with the greatest intensity at
600 bp. Pan Mouse immunoglobulin G magnetic beads (Dynal catalog no. 110.22) (40 µl) were resuspended and washed per the manufacturer's instructions and then added to the lysates and rotated at 4°C for 3 h. The beads were removed using a magnet and discarded. A total of 2 µl of antibody for RNA polymerase ß' (Neoclone catalog no. W0001) was added, and the samples were rotated overnight at 4°C. Magnetic beads (50 µl) were resuspended and washed as before and then added to the samples and rotated 1 h at 4°C. The beads were recovered and then washed once in 250 mM LiCl-100 mM Tris (pH 8)-2% Triton X-100, twice in 100 mM Tris (pH 8)-600 mM NaCl-2% Triton X-100, twice in 100 mM Tris (pH 8)-300 mM NaCl-2% Triton X-100, and twice in TE (10 mM Tris-HCl, 1 mM EDTA). DNA and protein were eluted from the beads by resuspending in 50 mM Tris (pH 8)-10 mM EDTA-1% sodium dodecyl sulfate and then incubated at 65°C for 20 min. Beads were removed, cross-links were reversed by incubating at 65°C for 4 h, and then DNA was purified using Qiaquick (QIAGEN).
ChIP DNA was amplified based on the Random DNA Amplification method of June 2001 from http://www.microarrays.org/protocols.html. The following primers were designed specifically for E. coli: PF43 (TGGAAATCCGAGTGAGTNNNNNNNNN) and PF44 (TGGAAATCCGAGTGAGT). Round B amplification using exTaq (Takara) was performed for 35 to 40 cycles. Round C amplification was not performed. PCR products were purified with Qiaquick and then ethanol precipitated. Pellets were resuspended in water, and a sample was diluted in 1 mM Tris (pH 8) to measure the DNA concentration. Experimental samples (4 µg) and control samples (3 µg) were hybridized to microarrays by Nimblegen.
Data analysis. For each array, the Cy5 signal values were multiplied by a scaling factor in Microsoft Access so that the total signals for Cy3 and Cy5 were equal. The log2 of the ratio of experimental signal to control signal was then calculated. The microarray was designed from both the top- and bottom-strand sequences, which means that for every probe, its reverse complement was also represented on the array. To distinguish them in the resulting data, 12 nt were added to the nominal genome position of probes from the bottom strand. Genome-scale data was visualized using Nimblegen's SignalMap software, and most other analysis was performed with Matlab version 6.5 (see the program code in the supplemental material). For each probe, the mean log2 ratio from all three biological replicates was calculated and then smoothed using a moving average with a window size of 500 bp. Peaks were identified using a reiterative process in which the probe with the highest log2 ratio was located upon each iteration. To prevent the same peak from being reidentified each time, the probe with the highest log2 ratio and all data within 1,000 bp were set to zero. This process was reiterated 1,000 times, and the resulting peaks were manually edited using SignalMap for visualization. An additional 139 peaks with a log2 ratio greater than 0.1 were then identified by the same method and manually edited.
|
|
|---|
Binding sites for RNA polymerase in the genome of E. coli strain MG1655 were identified using chIP-chip. Briefly, cells were grown to mid-log phase in minimal glucose medium and then treated for 20 min with rifampin. DNA and protein were cross-linked with formaldehyde, and cells were lysed. DNA was sheared to
600 bp with sonication, and RNAP was immunoprecipitated using an antibody specific for the ß' subunit. DNA bound by RNAP and from a no-antibody mock immunoprecipitation control was amplified using random primers and then labeled with Cy3 and Cy5 fluorescent dyes and hybridized to a custom microarray consisting of 371,034 E. coli oligonucleotides spaced 25 bp apart across the whole genome.
Reproducibility. This experiment was repeated three times. The reproducibility of the log2 ratios from each array is shown in Fig. 2. Higher correlation was seen when comparing just the experimental channels (coefficients of 0.71, 0.63, and 0.84) or just the controls (0.66, 0.61, and 0.87). This level of correlation is lower than typically observed in microarray analysis of gene expression, but important differences should be considered. In this experiment, the whole genome was represented with oligonucleotide probes, and yet only a fraction of the genome should be present in the DNA that was hybridized to it. Thus, the majority of oligonucleotide probes on the array are not hybridized, leading to an increased amount of noise. The log2 ratios of chIP-chip experiments tend to have less dynamic range and greater noise than those from expression analysis (R. Green, Nimblegen, personal communication).
![]() View larger version (31K): [in a new window] |
FIG. 2. Reproducibility of RNA polymerase chIP-chip data. Log2 ratios of immunoprecipitated versus no-antibody control DNAs from three hybridizations are plotted against each other. A cluster of outliers in biological replicate 1 consists almost entirely of probes from rRNA gene operons. corr. coef., correlation coefficient.
|
![]() View larger version (24K): [in a new window] |
FIG. 3. A sample of RNA polymerase chIP-chip data. In both panels, genome position is indicated at the top and ORFs are shown at the bottom, with those above the line being in the forward orientation and those on bottom being the reverse. (A) rRNA-encoding genes showed a different pattern of distribution in biological replicate 1. The smoothed log2 ratios from the three individual biological replicates are shown, along with the peaks identified in this region. (B) A sample from another region shows the combined log2 ratios from all biological replicates before and after smoothing with a moving average over 500 bp.
|
Binding site locations. From the combined smooth data, 1,139 peaks were identified, representing binding sites for RNAP. They may also represent promoters, if RNA is transcribed from them. Binding sites less than a few hundred base pairs apart or on opposite strands are expected to appear as a single peak. Binding and initiation by RNAP is strongly dependent on other transcription factors and the conditions tested. Presumably, the sites we detected are a subset of the total binding sites in E. coli. For instance, peaks were not observed at the sites of either the araBAD or lacZYA promoters, probably because those binding sites are occluded by repressors in glucose medium.
There are
2,800 RNAP molecules in E. coli in minimal medium (8) and 2,428 predicted transcription units (TUs) in the genome, not including
40 TUs containing recently annotated small regulatory RNAs (TU predictions are from www.biostat.wisc.edu/gene-regulation/) (7). Of these TUs, 1,552 contain an open reading frame (ORF) that appears to be expressed under these conditions (log2 signal > 8.0 in gene expression data generated with Affymetrix arrays) (data from reference 14). Of these, 1,141 have 5' ends that are more than 500 bp apart, providing an estimate of the number of binding sites that we might expect in this experiment. This number agrees remarkably well with our results. Of the 1,139 sites we detected, 958 occurred within 1,000 bp of the start site of a TU. For 659 of these, the TU contained an ORF with a log2 signal > 8.0 in gene expression data from the same growth conditions. Interestingly, this indicates 299 binding sites that are not associated with active transcription. In these cases, RNAP may bind to the chromosome but not activate transcription until a repressor or activator is triggered in response to environmental conditions, as is generally the case for transcription from
54-specific promoters. It should be noted, though, that after treatment with rifampin, all active promoters may become saturated by RNAP, forcing binding to sites that are not normally bound. Additionally, some promoters may bind RNAP weakly and yet still be active by having high rates of open complex formation or promoter clearance.
Of the 1,139 binding sites, 501 occurred in intergenic regions and 638 occurred in ORFs. Those in ORFs can be subdivided into three groups on the basis of the distance from the site to the nearest 5' end of an ORF (median distance = 158 bp). The first group consists of binding sites that are probably located upstream of the ORF but mistakenly identified inside the ORF due to poor resolution of the chIP-chip technique. In this group are 372 sites for which the nearest 5' end of an ORF is that of the ORF in which they occur and is <500 bp away. The second group consists of sites that are probably promoters for a downstream ORF. In this group are 191 sites for which the nearest 5' end of an ORF is an ORF that is different from the one in which it occurs and is <500 bp away. The third group consists of sites that occur >500 bp away from the 5' end of any ORF (see Table S3 in the supplemental material). Of these 75 sites, many have small log2 ratios and occur in insertion elements or in genes of unknown function. The others occur either in the middle of an ORF (sites in sapA, rfbX, evgS, emrA, nlpD, pyrG, xylG, and fimD) or near the intergenic region between two convergent genes (sites in pinQ, intQ, and otsA). These sites may play a regulatory role, possibly as RNAP pause sites or promoters of antisense transcripts, or they may be cryptic promoter-like elements ordinarily masked by transcribing RNAP.
Correlation to other factors.
There was very little correlation between the chIP-chip log2 ratio and the similarity of known promoters to a
70 consensus binding matrix (Fig. 4A). This result is surprising given that (i) the relative efficiency of chIP, often measured as "occupancy," has been observed to correlate with in vitro binding affinity (39) and (ii) RNAP has high affinity for promoters containing sequences close to consensus (26). A possible explanation for this lack of correlation may be the excess of RNAP molecules over the number of potential binding sites in the presence of rifampin. In excess and "trapped" at promoters, RNAP may bind to all sites equally. We also found very little correlation between the log2 ratio of chIP-chip peaks and the expression of the nearest downstream ORF (Fig. 4B), which is not surprising considering that the log2 ratio correlates poorly with the consensus matrix. This observation differs from studies of yeast Pol III in which the log2 ratio was directly correlated with transcription (35). The discrepancy may be due to biological differences between eukaryotic and prokaryotic RNAP, such as the ratio of enzyme molecules to potential binding sites or to disparate mechanisms of gene regulation. More likely, it is due to experimental differences; in the study performed by Roberts et al. (35) rifampin was not used to trap RNAP at the promoters and occupancy was measured in a transcribed region.
![]() View larger version (28K): [in a new window] |
FIG. 4. Poor correlation between the magnitude of the RNAP chIP-chip log2 ratio, gene expression, and promoter similarity to the consensus matrix. (A) The RNAP binding site log2 ratio was plotted against the similarity of the promoter to consensus. Of 681 known 70 promoters from RegulonDB, 497 were located within 1,000 bp of an RNAP binding site in chIP-chip data. For 413 of these, a score was available measuring the similarity of the promoter 10 and 35 boxes to consensus, plotted on the vertical axis. The score was calculated by RegulonDB as described previously (21) by establishing a consensus from 584 known promoters and then scoring each promoter by the use of this consensus matrix. (B) For each of the 1,139 RNAP binding sites (peaks) detected by chIP-chip, the log2 ratio was plotted against gene expression of the nearest downstream ORF. Gene expression data from reference 14 were generated from three biological replicates with Affymetrix arrays using the same growth conditions as for chIP-chip (M9 minimal medium plus glucose at mid-log phase).
|
1,900 to 3,400 kb with periodicity of
660 kb, which is significant relative to the periodicity found in randomized data. The regions of high and low average log2 ratios correlated with regions of high and low average gene expression levels (Fig. 5C). Periodicity of this sort has been observed for many different genomic parameters across the evolutionary spectrum, though the meaning, whether related to the physical organization of the genome or some other factor, is unknown (1, 24) (T. E. Allen, N. D. Price, A. R. Joyce, B. Ø. Palsson, unpublished data). It is unusual that only a fraction of the genome shows periodicity for RNAP binding sites. It is also interesting that a correlation between log2 ratio and gene expression was only observed when the log2 ratios were smoothed over 330 kb. On a gene-by-gene basis there may have been too much noise to see this trend.
![]() View larger version (40K): [in a new window] |
FIG. 5. Density and periodicity of RNA polymerase binding sites in glucose minimal medium as detected by chIP-chip. (A) The log2 ratios of 1,139 binding sites were averaged in each 1-kb section of the genome. Those sections lacking a binding site were assigned a value of zero. These data were examined for periodicity by wavelet analysis, which smoothes the data over a range of window sizes to reveal the wavelengths at which the data fluctuates most consistently. The period (wavelength) was plotted against genome position, with colors indicating the magnitude of average log2 ratio on a scale from white to yellow to red to black. The data were randomized and reanalyzed 1,000 times, allowing the estimation of significance of the original plot. Areas of the wavelet plot where the false-discovery rate was <10% are highlighted, while other areas are faded (5). (B) Profile of 660-kb periodicity. The average log2 ratios per kilobase were smoothed with a moving average window size of 330 kb. The vertical axis indicates the number of standard deviations by which the data differed from the mean (mean = 0.177; standard deviation = 0.027). Regions marked in red and green indicate positive and negative phases of periodicity at wavelength of 660 kb from panel A above. (C) The same as panel B above, but instead of average log2 ratios from chIP-chip, the log2 signal from gene expression measurements in glucose minimal medium was smoothed and plotted (14) (mean = 8.38, standard deviation = 0.32). D) The same as B above, except the number of detected RNAP binding sites per kb was smoothed and plotted (mean = 0.246; standard deviation = 0.028).
|
70, but high percentages of
24-,
28-, and
32-specific promoters were also detected. These sigma factors are associated with the transcription of genes for flagella biosynthesis and stress response. Gene expression measurements indicate that the genes downstream from
28-specific promoters are probably not expressed under these conditions (average log2 signal = 7.13), while those downstream of
24- and
32-specific promoters probably are (average log2 signals of 9.62 and 10.71, respectively). The expression of stress response genes does not necessarily indicate the presence of stress; heat-shock proteins are normally expressed at 37°C (20), and
32 is required for growth above 20°C (42). The log2 ratio of chIP-chip peaks near
24- and
32-specific promoters showed poor correlation to gene expression (r = 0.17). Few
38- and
54- specific promoters were detected by chIP-chip. These sigma factors are associated with the transcriptional response to stationary phase and nitrogen assimilation, respectively. |
View this table: [in a new window] |
TABLE 1. Proximity of known promoters to RNAP binding sites detected in this study
|
70 is predominant, followed in abundance by
28 and
54, while the others were undetectable (22). Under the conditions used here for chIP-chip, the mRNAs for all of the sigma factors appeared to be expressed, though
19 and
28 were less abundant (Table 1). In the presence of rifampin, the alternate sigma factors may have an increased ability to form complexes with core RNAP due to an increased level of free core RNAP and the depletion of
70 through binding with RNAP at promoters. The
38- and
54-specific promoters may be less often detected because of the activity of other transcription factors precluding the binding of RNAP to
38- and
54-specific promoters under these conditions. In any case, the detection of binding sites for the alternative sigma factors indicates that either those sigma factors are present at some level under these conditions or there is degeneracy in the binding specificity of
70-holoenzyme in vivo.
The distribution of distances between known
70-specific promoters and chIP-chip RNAP binding sites is shown in Fig. 6. We found that 49% of known
70-promoters correlated to a chIP-chip peak within 200 bp, and 73% correlated within 1,000 bp. Of 681 known
70-promoters, 532 occur upstream of an apparently expressed ORF (the nearest downstream ORF had a log2 signal > 8.0). Of these, 368 were separated by at least 500 bp, and out of 368 sites, 273 could be linked to a chIP-chip peak within 1,000 bp. This leaves 95 known expressed promoters that we were unable to detect, resulting in a 26% (95/368) false-negative rate. This compared favorably to chIP-chip experiments in yeast, for which a false-negative rate of 80% has been estimated (41). It falls short, though, of the nearly perfect detection of RNA Pol III binding sites (19, 31, 35). The false-positive rate is impossible to estimate, because binding does not always result in transcription and because of the lack of any other viable methods for measuring protein binding in vivo.
![]() View larger version (29K): [in a new window] |
FIG. 6. Histogram of the distance from 681 known 70 promoters to the nearest RNA polymerase binding site (peak) observed in chIP-chip data.
|
70 and ß' with Affymetrix arrays showed that the
70 sites were a subset of the sites detected using antibody for ß', as expected (see Table S4 and Fig. S2 in the supplemental material). Of 13,499 probes with
70 log2 ratios > 1, 97% had similar ß' log2 ratios (difference < 1). On the other hand, of 15,032 probes with ß' log2 ratios > 1, only 74% had similar
70 log2 ratios. The log2 ratios from the ß' and
70 Affymetrix chIP-chip experiments correlated with a coefficient of 0.72. The log2 ratios from the Affymetrix chips were compared to those from the Nimblegen chips. For each Affymetrix probe, the nearest Nimblegen probe in the same genomic orientation was identified, correcting for the different versions of the E. coli genome sequence that were used in chip design. The correlation coefficients were 0.60 for comparisons of Affymetrix ß' to Nimblegen ß' and 0.64 for comparisons of Affymetrix
70 to Nimblegen ß'. Affymetrix arrays were not used in any further experiments because of the incomplete coverage of the E. coli genome provided by Affymetrix expression arrays (see Fig. S2 in the supplemental material).
Another likely explanation for failure to detect RNAP at known
70 or other sigma factor promoters is the high variability of the rates of open complex formation and promoter escape and of the affinity of promoters for initial binding of RNAP holoenzyme (33). These same parameters may explain why it takes so long to clear RNAP from the rRNA operons. At some E. coli promoters, open complexes are so unstable that they initiate immediately upon binding nucleoside triphosphate without the slow promoter escape and high probability of abortive initiation observed at many promoters. The rrn P1 promoter is a classic example of such a promoter. It exhibits no abortive initiation and has the added features of highly efficient initial RNAP binding and the ability to sequester up to 80% of transcribing RNAP into elongation complexes on the seven rrn operons in E. coli (15, 32). RNAP released at a terminator may rebind an rRNA promoter and reform an rrn elongation complex more rapidly than rifampin can bind to it (
70 can associate with RNAP during termination) (3). Thus, RNAP may become trapped at rRNA promoters only after multiple rounds of transcription and when the rifampin concentration inside the cell accumulates to higher concentration, accounting for the slow clearance of RNAP from the rrn operon (Fig. 3A). Other promoters, which form unstable open complexes but lack the avid initial RNAP binding and rapid promoter escape properties of rrn promoters, may never accumulate significant RNAP, because RNAP will equilibrate away to promoters able to form stable open complexes. Thus, certain promoters that form unstable open complexes may escape detection. Consideration of the kinetic properties of promoters will be an important issue in the development of promoter mapping by the rifampin-trapping chIP-chip method.
It should also be noted that many of the sites we identified are likely to be specific to the minimal glucose conditions used here. Expansion of this experiment to other culture conditions will likely reduce the number of undetected known promoters and reveal differences in RNAP binding resulting from the activity of the regulatory network. Gene expression data also show changing patterns under different conditions but are often difficult to interpret. By using chIP-chip, direct regulatory interactions can be distinguished from indirect regulatory interactions (27, 34), but care should be practiced given the false-negative rate of 26% that we observed.
Closing remarks. Transcriptional regulation can be seen as a combinatorially based molecular "computation." In this analogy, the binding and interaction of various transcription factors with RNAP are the input variables and transcription level is the output. The identification of transcription factors involved in each "computation" is a fundamental step in understanding observations of gene expression. As each new factor is identified by chIP-chip or some other technique, its role will be defined by its interaction and colocation with RNAP. The work presented here lays the foundation for future work in elucidating the regulatory network of E. coli. The location data can also be used for validation of computational predictions of promoters, and the log2 values may provide insight into some characteristic of RNAP binding or activity. Whether the observed false-negative detection of known promoters reveals an inherent limitation of the technique or is merely a peculiarity of the experimental conditions is yet to be seen. A comparison of experiments with or without rifampin added, under different growth conditions, and with antibodies for various sigma factors will shed light on this question.
|
|
|---|
This work was initiated at the University of Wisconsin Madison with the support of University of Wisconsin I&EDR funds, Hatch McIntyre Stennis (U.S. Department of Agriculture), and University of Wisconsin trust funds (to A.Z.A.) and a grant from the National Institutes of Health (GM38660 to R.L.) and completed at the University of California San Diego with support from the National Institutes of Health (GM62791 to B.Ø.P.). C.D.H. was supported in part by NLM 5T15LM007359.
B.Ø.P. serves on the Scientific Advisory Board of Genomatica Inc., a spin-off company from the University of California San Diego.
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
70), promoters, and the kinetics of the steps of transcription initiation, 792-821. In F. C. Neidhardt, R. Curtiss III, J. L. Ingraham, E. C. C. Lin, K. B. Low, B. Magasanik, W. S. Reznikoff, M. Riley, M. Schaechter, H. E. Umbarger (ed.), Escherichia coli and Salmonella: cellular and molecular biology, 2nd ed. ASM Press, Washington, D.C.
32. J. Bacteriol. 170:3640-3649.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»