| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
,
Department of Microbiology, Institute of Biomedical Sciences, São Paulo University, São Paulo, Brazil
Received 31 August 2007/ Accepted 5 December 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The SOS response is extremely important for prokaryotic cells, and therefore, it is widespread among the Bacteria. However, two interesting features have been demonstrated to be highly variable among different bacterial species: the sequence recognized by LexA as the SOS operator and the actual set of genes subject to LexA control (2, 11, 18). The structure of the regulon can vary depending on the organism, but still, a canonical set of genes has been proposed to be LexA regulated in all Proteobacteria studied to date. It is comprised of lexA, recA, ssb, uvrA, and ruvCAB (19).
Several different strategies have been used to identify the genes directly regulated by RecA-LexA. Microarray technology has extensively been used for E. coli (13, 31, 48) to compare the level of induction in noninduced and induced states (UV-mediated DNA damage, lexA Ind– versus lexA Def strains, and mitomycin C-mediated DNA damage). Another successful strategy has been the in silico identification of LexA-regulated genes based on algorithms devoted to the identification of the SOS operator and/or the search of the genome sequence for promoters containing the conserved motif. This strategy helped to expand the array of E. coli genes known to be SOS regulated (21, 33). The use of this type of approach has prompted the in silico characterization of the SOS regulatory circuit in several bacterial species (16, 18, 19) and is constantly expanding our knowledge about the SOS response in models other than E. coli. Nevertheless, most of the in silico analyses of SOS regulons done to date lack extensive biological validation, making the assumptions of gene content less robust and missing important information about differential levels of SOS induction for different genes.
The characterization of the SOS response in other bacteria led to the conclusion that the E. coli SOS genes cannot be assumed to be universally LexA regulated in all bacterial species. In the deltaproteobacteria Bdellovibrio bacteriovorus, for example, some genes considered to be part of the core of the proteobacterial SOS genes (recA, uvrA, ruvCAB, and ssb) are not repressed by the LexA protein (9). Likewise, dissection of the SOS regulon in other species may show different genes involved in the DNA damage response which are not present in the E. coli genome. Recently, we have demonstrated that a three-gene operon encoding a hypothetical protein (ImuA), a protein similar to members of the Y family of DNA polymerases (ImuB), and a second copy of dnaE (DnaE2) is responsible for the DNA damage-inducible mutagenesis in Caulobacter crescentus, a function performed by the UmuDC proteins in E. coli. (25). This SOS-regulated operon is widespread in Bacteria (1) and was recently shown to be part of the SOS-mediated response to the antibiotic ciprofloxacin in Pseudomonas aeruginosa (11), demonstrating the relevance of expanding the knowledge about the SOS regulatory circuit in other models.
In the present work, we report the characterization of the SOS regulon of Caulobacter crescentus. We have performed a computational screening for the LexA binding motif in the whole genome of this organism and found that the SOS operator is in good agreement with the direct repeat GTTCN7GTTC, the same repeat suggested previously to be universal among alphaproteobacteria (19). We have confirmed the functionality of this operator with site-directed mutagenesis experiments of the LexA-regulated imuA promoter and by mapping its position relative to the actual transcriptional start site of two SOS genes. The expression of several genes identified in silico as potentially SOS regulated, by means of the presence of the operator in their putative regulatory regions, was investigated. The assays were performed by comparing the levels of expression of the selected genes in a C. crescentus lexA strain and its parental counterpart. We have been able to identify some unreported genes that are part of the SOS regulon, as well as confirm the inducibility of previously described LexA-regulated genes.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
In order to promote complementation of the phenotypes of the lexA strain, a full-length gene, including the promoter region, was amplified with the lexA5ext2 and lexAre oligonucleotides and cloned in the low-copy-number pMR20 vector, using the restriction sites introduced in the primers. The resulting fragment contains 1,481 bp and includes the whole lexA gene plus 597 bp before the transcription initiation codon and 146 bp after the gene stop codon.
Growth determination experiments. Overnight cultures of the C. crescentus strains were diluted to an initial optical density at 600 nm (OD600) of 0.1 in PYE medium and incubated with constant shaking at 30°C. At each indicated time interval, two aliquots of the cultures were removed. The first one was utilized for the optical density determination, and the other one was subjected to serial dilutions and plated on solid PYE medium for CFU counting, which was carried out 2 days after the plating.
In silico analysis. Ab initio searches for overrepresented DNA motifs were done with the Gibbs Motif Sampler program (60) in regions from –250 bp to +50 bp of the predicted start codons of the current set of known LexA-regulated genes in C. crescentus. A motif width of 19 bp was used as a parameter in these searches to account for the previously reported 15-bp alphaproteobacterial LexA binding motif (19) and to allow for additional conserved positions in its vicinity. A position weight matrix derived from the most abundant motif found in each search was used to score candidate LexA binding sites. This matrix is given by mbi = –ln (fbi/pb), where fbi is the frequency of the base b in position i of the motif and pb is the frequency of base b in the whole genome (55). Genome sequence and gene annotations were obtained from the TIGR database (http://www.tigr.org/cmr/).
Real-time analysis of gene expression. The relative expression levels of the genes belonging to the SOS regulon in C. crescentus were determined by quantitative reverse transcription (RT)-PCR experiments comparing the level of expression in the wild-type strain with that of the strain containing the lexA gene disruption. RNA from exponentially growing cells was extracted with Trizol reagent (Invitrogen) and treated with DNase I to eliminate contaminant DNA. An aliquot of 2 µg of RNA pretreated with DNase I was used as a template for total cDNA synthesis in 20-µl reaction mixtures with random hexamers by use of the Superscript First-Strand synthesis system for RT-PCR (Invitrogen). For quantitative PCR, an amount of cDNA corresponding to 25 ng of input RNA was used in each reaction. Reactions were performed with the Sybr green PCR master mix (Applied Biosystems) and analyzed in the ABI 7500 real-time system. Relative expression levels were calculated as described previously (47), using the rho gene as an endogenous control.
Site-directed mutagenesis of the imuA promoter and β-galactosidase assays. PCR-mediated mutagenesis of the imuA promoter was accomplished using primers with the desired mutation. The introduced mutations led to two base substitutions in PimuA. Two complementary mutagenic primers (see Table S1 in the supplemental material) were used with external primers to amplify fragments of PimuA containing the desired mutation. Both fragments were used in a further reaction with only the external primers to generate full-length fragments containing the mutation. All the PCRs were carried out with the high-fidelity Pfx enzyme (Invitrogen), using the cloned PimuA (pP3213) (25) as a template. After the full-length mutant promoter fragment was produced, it was subcloned in the pGEM-T Easy vector (Promega) and sequenced to ensure that the correct mutation was introduced and that no additional mutations were generated during amplification. The fragment was then subcloned in the pRKLacZ290 plasmid to create a transcriptional fusion with lacZ. Measurements of promoter activity with lacZ transcriptional fusions were performed as described previously (25), both before and 60 min after UV irradiation in exponentially growing cells in PYE medium.
UV irradiation. All the cell irradiations were carried out in rich medium (PYE). A germicidal lamp that preferentially emits UVC (254 nm, dose rate of 3.26 J/m2/s) was used. The UV dose was monitored by a VLX 3W radiometer with a CX-254 monochromatic sensor (Vilber Lourmat, Marne-la-Vallée, France). A UVC dose of 45 J/m2 was used in the β-galactosidase assays described above. This dose was shown previously to yield about 6,000 β-galactosidase units in a lacZ gene fusion with the imuA gene promoter in a wild-type background (25).
Determination of transcriptional start sites by 5' RACE experiments. RNA was extracted from exponentially growing lexA cells with the Trizol reagent. 5' rapid amplification of cDNA ends (RACE) experiments using the 5' RACE system (Invitrogen) were conducted as follows: three primers (GSP1 to GSP3) were designed for each gene. GSP1 was used for gene-specific cDNA synthesis. A poly(T) tail was added to the 5' end of the cDNA with terminal deoxynucleotidyl transferase (Tdt). The tailing reactions were used as templates for PCR with the primers GSP2 and 3' RACE. A nested PCR was performed to increase specificity, using primers GSP3 and AUAP, and the amplification products were cloned in the pGEM-T Easy vector (Promega). Nine clones were sequenced for the CC_2272 5' RACE reactions, and 10 for the imuA 5' RACE reactions. The transcriptional start site was then identified as the base adjacent to the poly(T) tail represented more among the sequenced clones, which meant 70% of the clones for imuA and 66.7% of the clones for CC_2272.
| RESULTS |
|---|
|
|
|---|
|
After a number of iterations of the procedure described above, we obtained the set of genes regulated by LexA shown in Table 2. The promoter regions of the 32 genes that have a twofold expression change in the lexA strain and that are in the first position of the respective putative operon were used in the construction of the final LexA binding site model, whose logo (14) is shown in Fig. 2. See Fig. S1 in the supplemental material for the genomic regions comprising the highest scoring putative LexA sites, obtained in a whole-genome scan of the final box model.
|
|
Gene expression analysis of the SOS regulon. To ascertain which of the genes identified in silico as potentially belonging to the SOS regulon were actually regulated by LexA in vivo, we conducted quantitative RT-PCR assays during each of the iteration steps. The assays were performed by measuring the relative expression levels of the selected genes in the lexA null strain versus the wild-type strain. We used as the endogenous control the rho gene, which encodes the transcription terminator Rho and is not induced in a LexA-dependent fashion, as determined by β-galactosidase assays (data not shown).
We performed the quantitative RT-PCR assays for 44 genes (Table 2). Among these, there were four putative operons, for which we tested all the genes possibly cotranscribed (CC_1926-CC_1927, CC_2332-CC_2333, CC_2879-CC_2880-CC_2881, and CC_3238-CC_3237-CC_3236, the ruvCAB operon). On the other hand, when an SOS operator was found in the noncoding region between two divergently transcribed genes, we performed the assay for both of them in order to establish if the same box could repress both promoters. This was the case for genes CC_1086 and CC_1087, CC_1531 and CC_1532, CC_1927 and CC_1928, CC_2589 and CC_2590, CC_2878.1 and CC_2879, CC_3038 and CC_3039, and CC_0382 and CC_0383 (Table 2; see Fig. S1 in the supplemental material). Among the divergently transcribed genes, we found that CC_3039 and the putative operon CC_3036-CC_3037-CC_3038, CC_1531 and CC_1532, and CC_0382 and CC_0383 were induced in a lexA null background. Some of the putative operons (CC_2332-CC_2333, CC_2879-CC_2880, CC_3036-CC_3037-CC_3038, and the ruvCAB operon) had similar variations of transcriptional levels, supporting the idea that they are arranged in operons that are part of the SOS regulon. In other cases, some of the putative operons presented clearly different levels of transcript (CC_1926 and CC_1927; CC_2879 and CC_2881), suggesting that they are not transcribed as operons.
In total, of the 44 genes tested, 35 showed increased expression in the lexA strain (Table 2), exhibiting at least twofold increased expression in the lexA strain compared to the wild type. Among them, there were some genes known to be part of the canonical SOS response in other proteobacteria, such as recA (CC_1087), lexA (CC_1902), uvrA (CC_2590), ssb (CC_1468), and recN (CC_1983). The ruvCAB operon was proposed to be part of the SOS regulon in all alphaproteobacteria (19), and all three genes (CC_3238, CC_3237, and CC_3236) are in fact induced in the lexA null background. We also observed that imuA, the first gene in the operon devoted to mutagenic DNA repair in C. crescentus, is among the most highly expressed in the lexA strain, confirming our previous results (25). Some of the genes identified in the approach have putative functions related to DNA metabolism, such as CC_2272 (encoding an endonuclease III family protein), CC_0382 (tag gene, DNA methyladenine glycosylase I), CC_2332 and CC_1330 (which exhibit a photolyase domain), and CC_3518 (UvrC-like domain). Most of the genes encode proteins of previously uncharacterized function, here determined for the first time to be involved in the proteobacterial DNA damage response. One of the surprises of this study was the finding that dnaB (a replicative helicase) and the ORF CC_2433 (encoding a conserved hypothetical protein) were downregulated in the strain containing the lexA deletion, in spite of containing SOS boxes with significant scores. This is indicative of a positive direct or indirect regulation of these genes by the LexA protein, a feature that is not commonly found in the SOS regulon of other bacteria, and highlights the importance of gene expression data in the validation of in silico predictions.
In our analysis, we did not observe any significant correlation between the box score and the relative expression obtained from the quantitative RT-PCR experiments (Table 2). This is an indication that, at least in C. crescentus, the level of repression of the LexA-regulated genes does not directly reflect the similarity of their SOS operator to the consensus pattern. Other factors, like intrinsic promoter strength and the positioning of the operator relative to sigma factor binding sites, are likely to contribute to the final level of induction.
In vivo functionality of the SOS operator.
The in silico analysis identified the model shown in Fig. 2 as the potential LexA binding site in C. crescentus, in good agreement with previous studies in several alphaproteobacteria (19). A high-score candidate binding site (Table 2; see Fig. S1 in the supplemental material) is present in the promoter region of the imuA gene, the first gene of an operon involved in mutagenic DNA repair in C. crescentus (25). In order to determine the role of the identified operator in the regulation of PimuA, we have conducted site-directed mutagenesis in the promoter region previously shown to drive high levels of expression after UV irradiation. We have introduced two base substitutions in the potential LexA binding site, as shown in Fig. 3, creating an operator-constitutive (Oc) mutant for the imuA promoter. The activities of both the wild- type and the mutagenized promoters (PimuA and PimuAOc) were measured in transcriptional fusions with the lacZ gene, both in the wild-type and lexA strains. As shown before, PimuA is highly UV inducible. Remarkably, the basal levels of transcription of PimuAOc are much higher than those observed for the wild-type promoter (
10-fold), showing that the operator identified in silico contributes significantly to the repression of imuA expression under physiological conditions (Fig. 3). It is also interesting to note that the levels of β-galactosidase expression achieved with PimuAOc in nonirradiated cells are even higher than those observed when cells carrying the wild-type promoter fusion are exposed to 45 J/m2 of UV light. Nonetheless, transcription from PimuAOc is still slightly stimulated by UV light, suggesting that residual binding of the repressor may still occur. These results confirm that the SOS operator identified in silico in this study and previously by others (19) is indeed responsible for the repression of the SOS genes in C. crescentus. In the lexA strain, both PimuA and PimuAOc show similar levels of basal transcriptional activity, which is not significantly modified after UV irradiation of cells.
|
|
| DISCUSSION |
|---|
|
|
|---|
We have confirmed the motif GTTCN7GTTC as the SOS operator in this organism, in good agreement with previous reports (19). In the imuA promoter, this sequence was shown to be responsible for a high level of repression, as determined by the site-directed mutagenesis experiments. Furthermore, this repression was shown to be mediated by LexA, since the operator is not functional in the lexA strain.
C. crescentus displays the core of SOS-regulated genes in proteobacteria, which is comprised of recA, lexA, ssb, uvrA, and ruvCAB (19). It is important to highlight that the most induced gene in this analysis was lexA, the negative regulator of the regulon. Among the genes shown to be part of the SOS regulon in this bacterium, some are interesting and reveal potentially new pathways that may be induced by DNA damage.
The CC_2272 gene encodes an endonuclease III family DNA glycosylase, responsible for the removal of pyrimidine adducts other than dimers and (6-4) photoproducts, the most representative of pyrimidine lesions created by UV irradiation (24). Another important gene is CC_0382, which encodes the DNA-methyladenine glycosylase I gene (tag). This protein catalyzes the removal of the cytotoxic lesion 3-methyladenine (3meA), induced by alkylation damage (41). To our knowledge, the induction of these two genes represents the first evidence of base excision repair genes regulated by the SOS response. The case of the tag gene is interesting if we consider that bacteria usually possess two DNA-methyladenine glycosylases, Tag and AlkA. The latter is described as being induced by sublethal doses of alkylating agents, while tag is constitutively expressed in E. coli (5). The regulation of tag by the SOS response adds a new layer of complexity in its regulation and in the removal of 3meA from the genome. Further examination of the SOS regulon in other bacterial species will reveal if this is a specific feature of C. crescentus physiology or a more widespread phenomenon.
Some other genes of unknown function, but which are potentially involved in DNA repair activities, were shown to be SOS regulated in C. crescentus. The CC_3518 gene encodes a protein with high similarity to the N-terminal end of UvrC endonucleases. In E. coli, the gene cho (ydjQ) encodes a 295-amino-acid protein with similarity to the N terminus of UvrC (33, 43, 62). Interestingly, the protein encoded by CC_3518, one of the most strongly SOS-upregulated genes in C. crescentus, also bears similarity to UvrC, although it is even smaller than Cho, consisting of only 123 amino acids. A BLAST analysis reveals that CC_3518 orthologs, with similar size, are widespread among bacteria. Another interesting family of genes includes CC_2332 and CC_1330 (35). They are both similar to the radical S-adenosylmethionine superfamily of proteins, which includes the splB gene of Bacillus subtilis that encodes the spore photoproduct photolyase. Possibly, these genes represent a different class of SOS-regulated bacterial photolyase genes. Interestingly, the Mycobacterium tuberculosis gene Rv2578c encodes a homolog of CC_2332 and CC_1330 and has been shown to be part of the SOS regulon in that organism (16), further supporting a role for these genes in the DNA damage response. CC_2332 is in a putative operon with another SOS-regulated gene, CC_2333, which is related to another base excision repair-related protein, uracil-DNA glycosylase (35).
On the other hand, dnaB and CC_2433 were consistently down-regulated in the lexA strain, suggesting that they are actually repressed upon SOS induction. For Rhodobacter sphaeroides, LexA has been proposed to both repress and activate transcription of the recA promoter (59). It is possible that in these two genes, LexA may act as a transcriptional activator. Since the lexA strain is severely affected in cell division and growth, the down-regulation of these genes might just reflect a pleiotropic effect of this gene disruption. However, it is interesting to note that the operators identified in both promoters are corrupted in the direct repeat, lacking one or both of the GTTC units (Table 2).
The case of dnaB is especially interesting, given that in previous reports it was shown to be damage inducible independently of RecA in E. coli (32); in C. crescentus, however, this gene is down-regulated in a LexA-dependent fashion. The implications of this fact are not completely clear. Considering the complex and coordinated cell cycle of C. crescentus, it would not be surprising to find a gene involved in replication initiation being repressed in the presence of DNA damage (simulated by the constitutive expression of the SOS-regulated genes in this lexA background), since it could act as an additional mechanism ensuring that replication initiation would be allowed only after the lesions are removed.
The gene comM (CC_0140), which had already been proposed to be part of the LexA regulon of alphaproteobacteria (19), was also confirmed by this analysis. It was annotated as a Mg2+ chelatase in Brucella suis and Brucella melitensis; thus, this gene could be involved in the regulation of polymerase fidelity synthesis during SOS activation, as has been proposed for members of this class of proteins in E. coli (64). In Haemophilus influenzae, the ComM protein was discovered on the basis of a transformation-deficient mutant that exhibited normal DNA uptake but possessed low transformation and phage recombination efficiencies (27).
Our analysis identified also some genes with known function that have never been associated with the SOS regulon in other organisms. One of these is CC_3225, which encodes a sensory box histidine kinase/response regulator. Two-component signal transduction systems are known to be the major cellular signaling molecules in prokaryotes. They are usually comprised of a sensor histidine kinase and a response regulator that mediates an appropriate cellular response to an endogenous or exogenous signal (46). The CC_2040 gene encodes an ATP-dependent RNA helicase, a member of the DEAD/DEAH family. C. crescentus possesses four members of this family of proteins, but only one seems to be LexA-regulated. These proteins are generally energy motors involved in the conformational change of RNAs, and as such, they are involved in many aspects of RNA metabolism, like transcription, ribosome biogenesis, translation, and RNA degradation (28); the term RNA helicase, however, has to be considered carefully, since for the vast majority of proteins, the exact biochemical activity has not been defined yet (57). We also identified the gene CC_3130, responsible for a glutamine synthetase family protein, as being regulated by LexA in C. crescentus. This enzyme is involved in nitrogen assimilation through the incorporation of ammonia in bacteria (49). It is still not clear how these genes would be involved in the celullar response to DNA damage, and thus further analyses are needed.
The results also indicated SOS regulation of a large number of hypothetical proteins, some of which did not display significant similarity to any other known protein in the microbial genomes sequenced so far. Representatives of this class of genes are some with highly increased expression in the lexA mutant, like CC_0627 (the second-most induced gene in the lexA strain [see Table 2]), CC_1927, CC_3356, CC_1531, CC_1532, CC_0383, and the first gene (CC_2879) of a putative operon of two genes. The CC_3467 gene, which encodes a conserved hypothetical protein related to proteins of unknown function in other bacteria, can also be included in this group. Another class of genes controlled by LexA in C. crescentus is that of the putative transcriptional regulators; this analysis identified CC_1054, CC_3036, and CC_3037 belonging to this group. CC_1054 is a member of the CopG transcriptional repressor family, CC_3036 belongs to the AlgR/AgrA/LytR family of transcriptional regulators, and CC_3037 is a helix-turn-helix-containing protein. Therefore, the indirect effects of SOS induction on global gene expression patterns are likely to occur.
Another interesting feature revealed by this work is the potential existence of an SOS-mediated divisional checkpoint in C. crescentus. In E. coli, the SulA protein inhibits cell division after SOS induction by blocking FtsZ polymerization (4, 54). As a result of this blockage, lexA knockouts are not viable, unless sulA is also disrupted. In contrast, in Bacillus subtilis, lexA knockouts are viable, although cells do show a filamentous aspect and poor growth; this filamentation is dependent on the yneA gene, which is unrelated to sulA (30). The sulA and yneA orthologs are not present in the C. crescentus genome (45) or in many other bacterial genomes (data not shown), and it will be of special interest to determine the genes responsible for this lexA-dependent divisional checkpoint in C. crescentus.
In total, 37 genes were identified as part of the SOS regulon in C. crescentus, including 35 genes that were induced and 2 genes that were repressed. Although the regulons in E. coli (13) and B. subtilis (2) are substantially bigger, the C. crescentus regulon contains more genes than the Pseudomonas aeruginosa (11) and Staphylococcus aureus (12) regulons, which are comprised of 15 and 16 genes, respectively. A table comparing the known SOS regulons of several bacterial species to that of C. crescentus is provided (see Table S2 in the supplemental material). Almost all SOS regulons described so far control the recA and lexA genes, as well as one or more recombination/repair functions and one or more Y family polymerases. We found in this study that C. crescentus is not an exception to this rule. However, some exceptions to this fact are known. In the bacteria "Dehalococcoides ethenogenes" (22), Bdellovibrio bacteriovorus (9), a "Magnetococcus" sp. (23), Petrotoga miotherma (38), Acidobacterium capsulatum (39), Deinococcus radiodurans (44), and Geobacter sulfurreducens (29), recA is not regulated by LexA. There are also lexA genes that are not autoregulated, as in Leptospira interrogans (15). The most dramatic situation has been found in Thermotoga maritima, where neither lexA nor recA is part of the SOS regulon (38); this is not commonly found in bacteria that have not suffered major genome reduction and ended up losing the lexA gene (Chlamydia pneumoniae, Mycoplasma pneumoniae, and Campylobacter jejuni, for example).
In conclusion, this work has further expanded the current knowledge of the prokaryotic responses to DNA damage, revealing a number of new genes which are part of the SOS regulon. These findings are especially relevant in the context of recent efforts focusing on unraveling the SOS response in several different bacteria, since it might play relevant roles in bacterial evolution and the development of antimicrobial resistance (3, 10, 11, 12, 42).
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published ahead of print on 14 December 2007. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
Present address: Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX. ![]()
| REFERENCES |
|---|
|
|
|---|
irradiation. J. Bacteriol. 183:6951-6956.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |