Previous Article | Next Article ![]()
Journal of Bacteriology, August 2003, p. 4539-4547, Vol. 185, No. 15
0021-9193/03/$08.00+0 DOI: 10.1128/JB.185.15.4539-4547.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Wouter T. Hendriksen,
He Yang, Jeffrey A. Beamish, Carlos J. Paredes, and Eleftherios T. Papoutsakis*
Department of Chemical Engineering, Northwestern University, Evanston, Illinois 60208
Received 16 December 2002/ Accepted 14 April 2003
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The objective of this study was to use DNA array-based large-scale transcriptional analysis in order to study two important mutants of C. acetobutylicum, SKO1 and M5, relative to the parent strain (wild type [WT]). SKO1 is the result of the chromosomal inactivation of the spo0A gene (10), which results in an asporogenous, filamentous, and solventogenesis-deficient phenotype. M5 is the result of the megaplasmid pSOL1 loss (3, 4) and is also asporogenous and nonsolventogenic. pSOL1 contains all essential genes for butanol and acetone formation. Although these two mutant strains have apparently similar phenotypes, they are genetically very different. Our aims include being able to relate gene-expression patterns to specific phenotypes and to discover gene expression differences between the two mutants but also to establish similarities to and differences from B. subtilis. An important aim is to assign functions to groups of or individual C. acetobutylicum genes and use this information to formulate specific hypotheses for further testing. DNA array-based analysis has been extensively used in human, yeast, and Escherichia coli systems but less so in B. subtilis (for example, see references 6 and 12) and not at all in clostridia or other anaerobes. We thus had first to develop and validate this high-throughput tool for the transcriptional analysis of C. acetobutylicum.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Analytical methods. Cell growth was determined by measuring the absorbance at 600 nm (A600) with a Thermo Spectronic (Rochester, N.Y.) BioMate3 spectrophotometer (26). Culture supernatants from the bioreactor samples were analyzed for acetate, butyrate, acetone, butanol, ethanol, acetoin, and glucose levels with a Waters (Milford, Mass.) high-pressure liquid chromatography system (2, 26).
Growth conditions and maintenance. C. acetobutylicum strains were grown in an anaerobic chamber (Forma Scientific, Marietta, Ohio) at 37°C. Liquid cultures were grown in clostridium growth medium (CGM), and colonies were obtained from agar-solidified 2x YTG (28). WT liquid cultures were inoculated with a single colony, at least 4 days old, which had been heat shocked at 70°C for 7 min. Liquid cultures of the asporogenic M5 and SKO1 strains were inoculated with single colonies not older than 1 day, without heat shocking. The absence or presence of the pSOL1 megaplasmid in strains M5 and SKO1, respectively, was verified by monitoring amylase activity on 2x YTGMA plates (10, 20). Frozen stocks were prepared from cells at an A600 of 0.8 to 1.0 and were stored in CGM plus 20% glycerol at -85°C. SKO1 cultures were supplemented with 100 µg of erythromycin per ml unless otherwise noted.
Fermentation experiments.
WT strain ATCC 824 and strain M5 were grown as static flask cultures in 400 ml of CGM at 37°C. The static flasks were inoculated with 8 ml (1/50) of preculture at an A600 of 0.6. Bioreactor fermentations with pH controlled at
5.0 were carried out as previously described (26). The reactor medium (CGM) was supplemented with 75 µg of clarithromycin (Abbott Labs, Abbott Park, Ill.) per ml and 0.15% antifoam.
RNA sampling, isolation, and purification. Cell pellets from 5 to 15 ml of culture were collected by centrifugation at 4°C and 5,000 x g for 10 min. Pellets were resuspended in 200 µl of SET buffer (25% sucrose, 50 mM Tris [pH 8], 50 mM EDTA [pH 8]) with 20 mg of lysozyme per ml, and the samples were incubated at 37°C for 5 min (10). Cold TRIzol reagent (1 ml; Invitrogen, Carlsbad, Calif.) was added, and the samples were vortexed for 30 s. The TRIzol samples were immediately frozen at -85°C, and the RNA was purified within 1 month to avoid degradation. For isolation and purification, the TRIzol samples were thawed at room temperature and diluted fivefold in ice-cold TRIzol up to 1 ml. Chloroform (200 µl) was added to 1 ml of the diluted TRIzol-treated samples, vortexed, and allowed to stand for 2 min at room temperature. The samples were centrifuged at 12,000 x g for 15 min at 4°C, and the aqueous phase was transferred to a fresh tube. Isopropanol (0.5 ml) was added, the tubes were inverted several times, and the samples were allowed to stand for 10 min and then centrifuged at 12,000 x g for 10 min at 4°C. The resulting pellet was washed with 75% RNase-free ethanol and spun at 8,000 x g for 4 min at 4°C. After drying for 10 min, the RNA was finally resuspended in RNase-free water and quantitated with a UV spectrophotometer (A260 and A280). Each sample was run on a 1.2% agarose gel to check for lack of RNA degradation. Samples were stored at -85°C.
Northern analysis. RNA samples (20 µg) were used for Northern analysis as described previously (10, 16), with the following modifications. RNA was transferred from the formaldehyde gel to a Nytran membrane (Schleicher and Schuell, Keene, N.H.) with a Bio-Rad vacuum blotter by using the supplied protocol. Probes for spo0A and phosphotransbutyrylase-butyrate kinase (ptb-buk) were prepared from PCR fragments produced with the following primers: spo0A, 5'-GCCTGACCTTGTTGTTCTCG-3' and 5'-CGTGACCATGCAACTTCAATA-3'; ptb-buk, 5'-TGCAGATGCTATTCTTGTTGG-3' and 5'-TCATTTTTGTTTCATGGCTGTC-3'. Probes for detection of the thiolase (thl) and aldehyde/alcohol dehydrogenase-acetoacetyl-coenzyme A (CoA):acetate-butyrate:CoA transferase (aad-ctfA-ctfB) mRNA transcripts were prepared as previously described (10). Double-stranded DNA probes were purified by using a GFX DNA purification column (Amersham Biosciences, Piscataway, N.J.).
cDNA microarrays. cDNA microarrays with spots representing 1,019 open reading frames (ORFs), approximately one-fourth of the C. acetobutylicum genome, were printed by using the TIGR protocol (11). Genes in this generation of arrays include, among others, all 178 pSOL1 ORFs, 123 DNA replication and repair genes (90% of the total of such genes as identified by genome annotation [18]), 97 cell division- and sporulation-related genes (92%), 85 carbohydrate and/or primary metabolism genes (31%), 67 energy production genes (52%), 63 outer membrane and cell envelope genes (36%), 48 lipid metabolism genes (80%), and 42 motility and chemotaxis genes (39%). A complete list can be found at http://www.chem-eng.northwestern.edu/Faculty/papou.html. PCR primers (MWG Biotech, High Point, N.C.) were designed (Integrated Genomics, Chicago, Ill.) to amplify gene fragments with an average size of approximately 470 bp, such that nonspecific hybridization on the DNA arrays is minimized. PCRs (volume, 60 µl) with approximately 4 µg of chromosomal template DNA were performed with AmpliTaq DNA polymerase (Applied Biosystems, Foster City, Calif.) according to the manufacturer's suggested protocol. The resulting PCR products were run on an agarose gel to verify that fragments were of the proper size and that only a single DNA product was produced. Reactions with multiple bands were repeated under more stringent conditions. Additionally, 48 products were randomly chosen and sequenced on a 377 ABI sequencer (Applied Biosystems). The PCR products were purified with GFX columns, eluted with 60 µl of water, dried in a vacuum centrifuge, redissolved in a 35% dimethyl sulfoxide solution, and spotted at least in triplicate on Corning (New York, N.Y.) CMT-GAPS or TeleChem (Sunnyvale, Calif.) ArrayIt SuperAmine glass microarray slides with a BioRobotics (Woburn, Mass.) MicroGrid II DNA arrayer (125-µm spot size with 200-µm spacing). Many genes involved in solventogenesis and sporulation are represented by as many as 12 spots. In addition to the 1,019 ORFs, 22 control genes (3 from Clostridium pasteurianum, 9 from Saccharomyces cerevisiae, and 10 from Arabidopsis thaliana [SpotReport array validation system; Stratagene, La Jolla, Calif.]) with no known homologies to the C. acetobutylicum genome sequence were spotted as negative controls (see http://www.chem-eng.northwestern.edu/faculty/papou.html for a complete list). After spotting, the slides were UV cross-linked (Stratagene cross-linker) and baked in an oven at 80°C for 2 to 4 h.
cDNA labeling and hybridization. Labeled cDNA was synthesized by random hexamer-primed reverse transcription reactions in the presence of Cy3-dUTP or Cy5-dUTP by using Moloney murine leukemia virus (Promega, Madison, Wis.) or SuperScript II (Invitrogen) reverse transcriptase. Twelve micrograms of RNA was mixed with 2.4 µg of random hexamer primers (Roche, Indianapolis, Ind.), heated to 70°C for 10 min, and cooled on ice for 1 min. Unlabeled deoxynucleoside triphosphates (0.60 mM dATP, 0.15 mM dTTP, and 0.40 mM dGTP and dCTP), either Cy3- or Cy5-labeled dUTP (Amersham), 400 U of reverse transcriptase, 5x reverse transcription buffer, and 0.50 µl of SUPERaseIn (Ambion, Austin, Tex.) were added to a final volume of 25 µl. The samples were incubated at 42°C for 2 h. The reaction was stopped by the addition of 20 mM EDTA, and the RNA was degraded by the addition of NaOH (30 mM final concentration) followed by incubation at 70°C for 10 min. The mixture was cooled on ice and neutralized by adding HCl (30 mM final concentration). The labeled probe was purified with a GFX purification kit, and the DNA was eluted with 50 µl of Tris-EDTA (pH 8). The purified probe was dried to completion in a rotary SpeedVac and stored at -20°C until use.
For hybridizations, the spotted arrays were incubated in prehybridization buffer (5x SSC [1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate], 0.1% sodium dodecyl sulfate, 1% bovine albumin) at 42°C for 45 min. The slides were then washed by dipping five times in Millipore water and twice in isopropanol and were then allowed to air dry. Oppositely labeled dried probes were resuspended in 5 µl of Tris-EDTA (pH 8) and mixed. One microliter of sonicated salmon sperm DNA (10 mg/ml; Stratagene) was added, and the mixture was denatured at 95°C for 3 min. An equal volume of 2x hybridization buffer (10x SSC, 50% formamide, 0.2% sodium dodecyl sulfate) was added, and the sample was loaded onto the array under a LifterSlip (Erie Scientific, Portsmouth, N.H.). The slides were hybridized 18 h at 42°C in Corning hybridization chambers with 100 µl of 10x SSC to maintain humidity. After hybridization, the slides were washed with TeleChem ArrayIt DNA microarray wash buffers A, B, and C for 5 min in each buffer. The slides were dried by centrifugation for 5 min at 500 x g. The hybridized arrays were analyzed with a GSI Lumonics scanner and ScanArray software (Perkin-Elmer Life Sciences, Boston, Mass.). Spot intensities were quantitated with QuantArray Microarray analysis software (Perkin-Elmer Life Sciences).
Microarray data analysis.
The data were normalized and genes which showed significant differences in expression levels were determined by a novel normalization and gene selection method (30). All array data were subjected to a filtering criterion based on the spot intensity (signal) for a given channel (which was background subtracted and corrected for nonspecific binding) and the standard deviation of the local background (noise). The criterion is described by the following:
![]() |
Average linkage hierarchical clustering was performed with Cluster (5). Self-organizing-map (SOM) analysis (24) was performed with GeneCluster 2.0 (Whitehead Institute for Biomedical Research). Gene clusters were visualized in TreeView (5).
-factor and 0A box binding site and operon prediction.
The full set of C. acetobutylicum ATCC 824 intergenic regions was scanned for 0A boxes and sigma factor binding sites by using a dot plot-like strategy (9). Two mismatches were allowed for sigma factor binding sites, and only one was allowed for 0A boxes. Changes in the coding strand or the presence of a sigma factor consensus sequence was used to predict the start of transcriptional units. The following motifs (23) were utilized:
A, TTGACA(16-18)TATAAT;
D, TAAA(14-16)GCCGATAT;
E, ATA(16-18)CATACANT;
F, GYWTA(15)GGNRANANTW;
G, GNATR(15)CATNNTA,
H, RNAGGAWWW(11-12)RNNGAATWW;
L, TGGCA(5)CTTGCAT; spo0A, TGNCGAA; and reversed spo0A, TTCGNCA (the numbers in parentheses indicate spacing between nucleotide sequences; R = A or G; W = A or T; Y = C or T; N = A, C, G or T).
| RESULTS |
|---|
|
|
|---|
Validation of the DNA array analysis protocol. We used two independent strategies for validation of the DNA array analysis protocol. The first was based on the genes of the megaplasmid pSOL1, which is absent in strain M5 (4). In a comparative analysis of WT and M5, any gene which resides on the pSOL1 megaplasmid should fall into one of two categories: significantly upregulated in the WT or nondifferentially expressed. An Eisen plot (5) of the 56 pSOL1 genes which are differentially expressed at the 95% confidence level for at least two time points is shown in Fig. 1. The predominance of green in the Eisen plot, indicating higher expression in the WT, demonstrates the validity of the DNA arrays and associated protocols. pSOL1 genes with no expression in the WT at a particular time point will have expression ratios that are randomly distributed about zero. It is therefore not entirely unexpected to see a modicum of light red (higher M5 expression) on the Eisen plot. In 2,244 classifications (132 pSOL1 genes [not including hypothetical proteins] analyzed on 17 arrays), 2,231 were proper identifications (99.4%). Analysis of pSOL1 hypothetical proteins revealed significant regions of homology with chromosomal genes, despite the effort to minimize such homologies during designing of the cDNA probes for array spotting. Of the 13 misidentified genes, only one was present on more than one slide. Nine of the misidentifications had expression ratios less than 1.9, while only one was greater than 2.5.
|
|
D-mediated transcription and regulation (6). The flagellin of C. acetobutylicum is known to be posttranslationally glycosylated (15), and we accordingly noted upregulation of two glycosyltransferase genes (CAC2186 and CAC2522). CAC3647, annotated as the transitional state gene regulator abrB, is included in cluster S2 and in B. subtilis in category II. PCR primers were designed to amplify a 242-bp fragment from this gene. However, there are two other genes in the C. acetobutylicum genome annotated as abrB, each having a high degree of homology with the CAC3647 PCR product as evaluated by BLASTN (CAC0310, 173 of 202 bp; CAC1941, 155 of 199 bp), and it is impossible to discriminate among transcripts from these three genes. A notable gene of cluster S2 is ftsZ (CAC1693), which is part of a predicted bicistronic operon (CAC1692 and CAC1693). The cell division GTPase FtsZ and its partner FtsA play an essential role in cell division and sporulation in B. subtilis and all eubacteria (7, 13). Of the genes in cluster S2, 72% belong to transcriptional units (operons) with promoter regions predicted to have a 0A box, while 44% have a predicted
F binding site. This is statistically significant compared to the entire genome (52 and 31%, respectively), further suggesting that these genes are regulated by Spo0A.
|
|
Transcriptional analysis of strain M5. Fermentations of WT and M5 cultures in static flasks were used for DNA array analysis. The strains grew at nearly the same rate (Fig. 3, panel IIA), allowing RNA samples (A to I) used for DNA array analysis to be closely paired. The WT culture (Fig. 3, panel IIB) produced typical amounts of solvents (69 mM butanol, 38 mM acetone, and 10 mM ethanol) and acids (44 mM butyrate and 48 mM acetate) for static culture. Strain M5 produced no acetone or butanol (Fig. 3, panel IIC), less ethanol (7 mM) and acetate (25 mM), but significantly more butyrate (79 mM). Samples from nine time points (Fig. 3, panel IIA) were analyzed on 17 DNA arrays. All duplicate arrays were hybridized with reverse-labeled samples (e.g., WT-Cy3/M5-Cy5 and M5-Cy3/WT-Cy5) except for point D, for which only one labeled sample was successfully analyzed. Of the 1,019 ORFs represented on the arrays, 253 genes were determined to be differentially expressed (same criteria as SKO1 analysis) and were clustered by SOM analysis (Fig. 5).
|
F also falls within this cluster. CAP0009 is a two-component response regulator possibly involved in autolysis. There are also two transcriptional regulators with unknown functions: CAP0087 (highly expressed in the WT) and CAP0178.
Chromosomal genes which fall into one of these two clusters (D2 and D3), and to a great extent clusters D6 and D7, are likely to have higher expression in the WT as a result of pSOL1 loss. These genes may therefore play a role in generating the asporogenous phenotype of M5. As previously mentioned, sigF (cluster D2) has persistently higher expression in the WT.
F is the first compartment-specific sigma factor activated by spo0A in B. subtilis. spo0A (CAC2071) is in cluster D7 and is generally expressed to a greater extent in the WT, with the exception of the very first time points. Cluster D7 (Fig. 5) also contains a gene for a two-component signal transduction histidine kinase (CAC3319) with high homology to kinA of B. subtilis, which as discussed above may be involved in initiation of sporulation. Cluster D2 (Fig. 5) contains genes for two additional two-component histidine kinases: cheA (CAC2220), involved in chemotaxis, and CAC1701, involved in general phosphate regulation. Cluster D6 contains cheC, an inhibitor of chemotaxis protein methylation through CheD binding (19).
An important family of genes that show differential expression is that related to stress response. Nine stress-related genes are included in cluster D7 (Fig. 5), the majority having higher expression in M5 during early exponential and late stationary phases, with substantially higher expression in the WT during late exponential and transition phases. It has been reported that the onset of solvent production (not seen in M5) during late exponential and transition phases is preceded or accompanied by induction of the stress response genes (1, 21). Stress genes in cluster D7 include the major stress response genes from the groE operon, groES and groEL (CAC2704 and CAC2703, respectively), and the dnaK operon, hrcA, dnaK, and dnaJ (CAC 1280, CAC1282, and CAC1283). Both operons are negatively regulated by HrcA (1). Also included are the molecular chaperones of the small- and large-heat-shock-protein families, hsp18 (CAC3714) and hsp90 (CAC3315), respectively. Cluster D7 also includes a member of the class III hsp100 heat shock family, clpC (CAC3189), and an associated transcriptional regulator of class III heat shock proteins, ctsR (CAC3192). Additional members of the clp family which are differentially expressed include clpA (CAC1824, cluster D0), clpP (CAC2640, cluster D6), a clpP family serine protease gene (CAC1893), and clpX (CAC2639, cluster D1). Two additional stress-related genes from the lon protease family, lonA (CAC0456, cluster D6) and CAC2637 (cluster D3), show slightly lower expression in M5 during the late stationary and transition phases. Members of both the clp and lon families have been indirectly implicated in both positive and negative regulation of several sporulation-specific sigma factors, including sigF and sigH (14, 22). clpP has been shown to indirectly affect intracellular Spo0A levels by regulating the activity of enzymes responsible for its phosphorylation (17).
Four genes from the fts family, involved in cell division and chromosome segregation through the establishment of cellular asymmetry during sporulation, are differentially regulated. ftsA and ftsZ (CAC1692 and CAC1693, cluster D1) form a predicted operon whose expression appears to be elevated during the late transition phase in M5 but shifts to higher expression in the WT during the stationary phase (differential expression of ftsA is at the 90% confidence level). ftsX (CAC0498, cluster D2) has persistently lower expression levels in M5, while ftsK (CAC3709, cluster D7) expression is lower in M5 through the transition phase of growth and higher in M5 during the late transition phase. The significantly different expression patterns of fts family genes suggest altered cell division and chromosome segregation programs.
Several genes involved in control of DNA replication and topology are also dramatically altered in M5. Genes with DNA gyrase (CAC0006 and CAC0007) and DNA helicase (CAC2262) functions are expressed at a lower level in M5 during the exponential and transition phases with increased expression into the stationary phase (clusters D5 and D6). Transcription of most E. coli operons is sensitive to DNA supercoiling (8). In C. acetobutylicum, DNA topology has been shown to affect the rate of transcription of several acid and solvent formation genes and has been implicated as a possible signal for the onset of solventogenesis (27, 29). In fact, clusters D5 and D6 also contain the acid formation genes pta (CAC1742), ack (CAC1743), buk (CAC3075), and ptb (CAC3076), along with 12 other genes involved in carbon metabolism. Genes involved in glycolysis (except for hexokinase [CAC2613]) are downregulated during exponential growth and subsequently upregulated during stationary phase. Genes responsible for conversion of acetoacetyl-CoA to butyryl-CoA also follow this pattern. The chromosomal thiolase gene (CAC2873), however, is upregulated throughout. Expression of the acetate formation genes remains lower in M5; however, expression of butyrate formation genes is greater in M5 during the stationary phase of growth. Production of butyrate was observed to continue through the entire course of the M5 culture, while acetate production was lower at each stage of the fermentation (Fig. 3, panel II). As expected, all genes involved in solvent formation, including those located on the chromosome, were expressed at a higher level in the WT.
Clusters D5 and D6 have numerous motility and chemotaxis genes, including fliFGHL (flaA operon), fliP, fliY, flgD, and flgK. Cluster D4 has two additional motility genes, a flagellin gene (CAC1555) and fliS (CAC2206). All these genes exhibit lower expression in M5 during early exponential growth with increasing expression through the stationary phase. This is the opposite of the expression pattern of spo0A (Fig. 5) and is in direct agreement with observations for SKO1 and B. subtilis (6).
| DISCUSSION |
|---|
|
|
|---|
This is the first study in which DNA arrays were used for large-scale transcriptional analysis in C. acetobutylicum. A number of both expected and unexpected patterns of gene expression were captured. Many genes known to be directly or indirectly influenced by Spo0A in B. subtilis are similarly regulated in SKO1, including increased expression of the chemotaxis and motility genes, sigF, and spoIIAB. An enrichment of genes with predicted 0A boxes and
F binding sites in clusters containing genes known to be controlled by Spo0A was also noted. While the majority of gene expression patterns in SKO1 are similar to known patterns in B. subtilis, several differences exist. For example, classification of spoVAD as positively controlled by Spo0A is the opposite of that seen in B. subtilis. Upregulation of the glycosyltransferases was unique to C. acetobutylicum. Finally, several genes related to electron transport were discovered to be differentially regulated (different groups of genes in opposite directions).
Several similarities were observed between SKO1 and M5, which have very similar phenotypes (asporogenous and deficient in solventogenesis). As expected, both strains had significantly lower expression of the solvent formation genes. Both strains also had decreased expression of spo0A, sigF, and CAC3319 (kinA). However, significant differences were also noted. For example, in contrast to SKO1, the chemotaxis genes cheA and cheC have significantly lower expression in M5 than in the WT. The fts genes also show significantly different expression patterns. Several unique classes of genes were noted as being differentially regulated in M5 but not in SKO1. The gyrases and DNA helicases were differentially expressed, and their expression pattern appears to be closely linked to expression of genes involved in acid formation and glycolysis; this lends support to their role in regulating C. acetobutylicum metabolism. Finally, expression of the stress response genes in M5 is greatly altered, including regulation of clpP and lonA with known roles in regulation of Spo0A and sporulation specific sigma factors, respectively.
| ACKNOWLEDGMENTS |
|---|
We acknowledge use of the Keck Biophysics Facility at Northwestern University, and we thank Nadereh Jafari from the Microarray Core Facility Center for Genetic Medicine at Northwestern University, Abbott Laboratories for donation of clarithromycin, Marija Tesic for S. cerevisiae DNA, and J. W. Peters and B. Lemon for WT C. pasteurianum.
| FOOTNOTES |
|---|
Present address: DSM, 6401 JH Heerlen, The Netherlands. ![]()
Present address: Erasmus University, 3000 DR Rotterdam, The Netherlands. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |