Previous Article | Next Article ![]()
Journal of Bacteriology, July 2002, p. 3823-3833, Vol. 184, No. 14
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.14.3823-3833.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Department of Molecular Microbiology and Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat-Aviv, Tel Aviv 69978, Israel
Received 7 March 2002/ Accepted 22 April 2002
|
|
|---|
|
|
|---|
Two regions of genes required for human macrophage killing and intracellular multiplication have been discovered in L. pneumophila (reviewed in references 44 and 51). Region I contains 7 genes (icmV, -W, and -X and dotA, -B, -C, and -D) (8, 10, 32, 50), and region II contains 17 genes (icmT, -S , -R, -Q, -P, -O, -N, -M, -L, -K, -E, -G, -C, -D, -J, -B, and -F) (3, 35, 41, 43, 50). Complementation analysis of these genes indicated that they are probably organized as nine transcriptional units (icmTS, icmR, icmQ, icmPO, icmMLKEGCD, icmJB, icmF-tphA, icmWX, and icmV-dotA) (8, 10, 35, 41, 43, 50). Most of these genes were also shown to be required for intracellular growth in protozoan host Acanthamoeba castellanii (45). Fourteen of the Icm and Dot proteins (IcmT, -P, -O, -L, -K, -G, -C, -D, -J, and -B and DotA, -B, -C, and -D) were found to have significant sequence similarity to Tra and Trb proteins from IncI plasmids colIb-P9 and R64 (29, 46). The icm/dot system is believed to serve as a translocation system that delivers effector proteins into host cells (34, 44).
At the present time there is very little information about the regulation of L. pneumophila virulence, as well as about regulatory factors that control the expression of the 24 icm and dot genes. So far, stationary-phase sigma factor rpoS has been shown to be involved in L. pneumophila virulence: a strain with a knockout in this gene lost its ability to grow in protozoan host A. castellanii (20) and was attenuated for intracellular growth in murine bone marrow-derived macrophages (4). However, this gene was found to be dispensable for growth in HL-60-derived human macrophages and in THP-1 cells (20). Another stationary-phase-related factor whose involvement in the regulation of L. pneumophila virulence was suggested was the relA gene product (21), but this gene was found to be dispensable for intracellular growth (55). In addition, the expression of only one out of nine icm::lacZ fusions was reduced in strains containing an insertion in the rpoS or the relA gene (55), indicating that other factors control the expression of the icm and dot genes.
We were interested in identifying DNA regulatory elements that control icm and dot gene expression. To address this goal, we used random and site-directed mutagenesis on the regulatory regions of nine icm and dot genes that were fused to the lacZ reporter. We identified 12 regulatory elements in the upstream region of eight icm and dot genes. Primer extension analysis indicated that seven of these sites constitute the -10 promoter elements of the icm genes; the other sites are expected to serve as binding sites for transcription regulators.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Plasmids used in this study
|
|
View this table: [in a new window] |
TABLE 2. Primers used in this study
|
The plasmids that were used as templates for the sequencing reactions analyzed together with the primer extension reactions are listed in Table 1. Plasmid pGS-Lp-82 was constructed by cloning a HindIII-EcoRV fragment containing the icmW gene, part of the icmV and icmX genes, and the regulatory region between icmV and icmW into the pUC-18 vector.
Site-directed mutagenesis. Site-directed mutagenesis was performed by the overlap extension PCR method (23). For each mutation, two primers that contain the mutation and that overlap one another by 20 bp were designed. The PCR template for all the mutations was the regulatory region of the gene of interest cloned in the pMC1403 vector (Table 1). PCR mutagenesis includes two steps. In the first step two PCR fragments were generated with the following pairs of primers: (i) a primer located on the vector, upstream from the regulatory region (pMC-amp), and one of the primers that contain the mutation and (ii) a primer located in the lacZ gene on the vector (pMC-lac) and the other primer that contains the same mutation on the complementary strand. The resulting two fragments were gel purified and used as templates in the second step, which includes a third PCR using the two primers located on the vector. The resulting PCR product was digested with BamHI and cloned into the pGS-lac-02 vector. All the mutations were confirmed by sequencing the whole regulatory region. The changes made were always A to C, T to G, and C to A.
PCR random mutagenesis. Random PCR mutagenesis was performed essentially as described before (18). It was performed on the icmF regulatory region by using the icmF reverse primer containing a BamHI site (icmF-Bam) and the icmF forward primer containing an EcoRI site (icmF-EI) (Table 2). The PCR mixture included either 50 mM dATP, 0.5 mM MnCl2, and 9.5 mM MgCl2 or 50 mM dCTP, 0.5 mM MnCl2, and 5.5 mM MgCl2, along with other deoxynucleoside triphosphates at 0.2 mM, 100 ng of template (pGS-reg-F2), 25 pmol of each primer, and 1 U of Super-Therm DNA polymerase. PCR under these conditions results in 1 to 3 bp changes per 100 bp of amplicon. The PCR product was digested with EcoRI and BamHI and cloned into the pGS-lac-02 vector digested with the same enzymes. The resulting colonies were screened on MacConkey plates, suspected colonies were isolated, and the mutated regulatory regions from some of them were sequenced.
ß-Galactosidase assays. ß-Galactosidase assays were performed as described elsewhere (33). L. pneumophila strains were grown on ABCYE (ACES buffered charcoal yeast extract) plates containing chloramphenicol for 48 h. The bacteria were scraped off the plate and suspended in AYE (ACES yeast extract) broth, and bacterial optical density at 600 nm (OD600) was calibrated to 0.1 in AYE. The resulting cultures were grown on a roller drum for about 18 h until reaching an OD600 of about 3.8 (stationary phase). To test the levels of expression at exponential phase, the cultures were diluted to an OD600 of 0.1 and grown for an additional 6 to 7 h until reaching an OD600 of about 0.7 (exponential phase). The assays were done with 50 or 100 µl of culture. The substrate for lacZ hydrolysis was o-nitrophenyl-ß-D-galactopyranoside.
Preparation of RNA. RNA preparation was performed as described elsewhere (42). The L. pneumophila JR32 strain was grown on ABCYE plates for 48 h. The bacteria were scraped off the plate and suspended in AYE broth, and the bacterial OD600 was calibrated to 0.1 in AYE. The resulting cultures were grown on a roller drum for about 18 h until reaching an OD600 of about 3.8. Then the cultures were diluted to OD600 of 0.1 in AYE and grown for additional 6 to 7 h until reaching an OD600 of about 0.7. These bacteria were used for RNA preparation. The cell pellets derived from 30 ml of bacteria were suspended in 3 ml of STE buffer (10 mM Tris [pH 7], 100 mM NaCl, 1 mM EDTA), and an equal volume of aqueous 95% hot phenol (65°C) was added. This mixture was shaken vigorously at 65°C for 10 min and centrifuged for 10 min at 11,000 x g. The aqueous phase was removed, and the phenol phase was reextracted with 3 ml of STE buffer. After centrifugation, the aqueous phase was removed and combined with that already collected, and the solution was extracted twice with hot phenol, once with phenol-chloroform-isoamyl alcohol (25:24:1), and twice with STE-saturated ether. The RNA was precipitated with 2 volumes of ethanol. After approximately 12 h of incubation at -20°C, the precipitate was recovered by centrifugation at 11,000 x g for 10 min. The RNA concentration was determined by measuring OD260.
Primer extension analysis.
The primers used for the analysis are listed in Table 2. A reverse transcriptase (RT) reaction was performed in RT buffer (50 mM Tris-Cl [pH 8.3], 8 mM MgCl2, 40 mM KCl, 1 mM dithiothreitol) containing deoxynucleoside triphosphates (1 mM concentration [each] of dATP, dCTP, dGTP, and dTTP), RNasin (10 U), 5 pmol of labeled ([
-32P]ATP) primer, and 5 U of avian myeloblastosis virus RT (Promega, Madison, Wis.). A total of 10 µg of the RNA preparation was used in a total reaction mixture volume of 6 µl. The reaction mixture was incubated for 5 min at 45°C and then incubated at 37°C for 30 min. After the addition of 4 µl of sequencing stop buffer, the samples were heated for 5 min at 90°C before being loaded on a sequencing gel.
DNA sequencing. The sequence was determined by the dideoxy termination method (40) using the SequiTherm cycle sequencing kit (Epicentre, Madison, Wis.). The templates for the sequencing reactions were plasmids containing the regulatory regions of the genes tested (Table 1). Sequencing was determined with the same primers used for the primer extension analysis.
|
|
|---|
Analysis of the expression levels of icm::lacZ fusions in L. pneumophila. It has been suggested that L. pneumophila pathogenesis is coordinated with entry into stationary phase (4, 11, 21). To test whether icm and dot genes that were shown to be required for intracellular growth are regulated in correlation with growth phase, we compared the expression levels of the nine icm::lacZ fusions at exponential and stationary phases.
Even though all the icm and dot genes were shown to be required for intracellular growth and host cell killing, their expression levels were found to be different from one another. As can be seen in Fig. 1, the icm::lacZ fusions can be divided into two groups according to their expression levels in the wild-type L. pneumophila JR32 strain. Four icm::lacZ fusions (icmR::lacZ, icmF::lacZ, icmV::lacZ, and icmW::lacZ) were found to have high levels of ß-galactosidase expression (above 1,000 Miller units [MU]), and two of them had higher levels of expression at stationary phase than at exponential phase (Fig. 1A). In contrast, five icm::lacZ fusions (icmT::lacZ, icmP::lacZ, icmQ::lacZ, icmM::lacZ, and icmJ::lacZ) were found to have low levels of ß-galactosidase expression (below 1,000 MU), and three of these fusions had higher levels of expression at stationary phase than at exponential phase (Fig. 1B). Only one of these fusions (icmP::lacZ) was shown to have lower levels of expression in strains containing an insertion in the genes coding for RpoS or RelA than in wild-type strains (55); the other fusions (icmT::lacZ, icmM::lacZ, icmR::lacZ, and icmF::lacZ), which had higher levels of expression at stationary phase than at exponential phase, are probably regulated by other factors.
![]() View larger version (21K): [in a new window] |
FIG. 1. Expression of icm::lacZ fusions in L. pneumophila at exponential and stationary phases. The expression of the nine icm::lacZ fusions (for the genes icmT, icmR, icmQ, icmP, icmM, icmJ, icmF, icmW, and icmV) in wild-type strain JR32 at exponential (gray) and stationary phases (white) was examined. ß-Galactosidase activity was measured as described in Materials and Methods. Four of the icm::lacZ fusions were found to have high ß-galactosidase activities (A), and five were found to have low activities (B). The results (Miller units [M.U.]) are the averages ± standard deviations (error bars) of at least three different experiments.
|
![]() View larger version (30K): [in a new window] |
FIG. 2. Expression of icm::lacZ fusions in E. coli at exponential and stationary phases. The expression of the nine icm::lacZ fusions (for the genes icmT, icmR, icmQ, icmP, icmM, icmJ, icmF, icmW, and icmV) in E. coli strain MC1061 at exponential (gray) and stationary phases (white) was examined. ß-Galactosidase activity was measured as described in Materials and Methods. Five of the icm::lacZ fusions were found to have high ß-galactosidase activities (A), and four were found to have low activities (B). The results (Miller units [M.U.]) are the averages ± standard deviations (error bars) of at least three different experiments.
|
Identification of -10 promoter elements of icm genes that have low expression levels. Careful analysis of the upstream regulatory regions of four icm genes (icmT, icmP, icmQ, and icmM) that were found to have low levels of expression in L. pneumophila, revealed a 6-bp (TATACT) putative consensus sequence (Fig. 3A). The distances of this sequence from the first ATG codons of the corresponding genes (32 to 74 bp), as well as the sequence itself, suggest that it might serve as the -10 promoter element for these genes. This sequence differs by only 1 nucleotide from the -10 promoter element recognized by E. coli vegetative sigma factor RpoD (TATAAT) (53). To examine the importance of this sequence in the expression of these icm::lacZ fusions, 2 nucleotides in each of these sequences were mutated and the effect of the change was examined. As can be seen in Fig. 3B, the change in either of these two positions (the second or third nucleotide of the putative consensus sequence) caused a dramatic reduction in the levels of expression of these icm::lacZ fusions. The mutation in the second position of the putative consensus (mutation 1 in Fig. 3) almost eliminated the expression of the lacZ reporter, and the levels of ß-galactosidase observed were close to the expression level of the promoterless lacZ vector, the negative control (pGS-lac-02) (data not shown).
![]() View larger version (27K): [in a new window] |
FIG. 3. Analysis of the regulatory regions of icm::lacZ fusions that have low levels of expression. (A) Sequences of the regulatory regions of icmP, icmT, icmQ, and icmM are shown. The putative consensus sequence is in boldface, and the distances to the first ATG are indicated. The nucleotides representing the 5' end of the mRNA are in boldface and underlined. Additional sequences that form potential inverted repeats with the consensus found are boxed. Arrows 1 and 2, nucleotides mutated. The changes made were always A to C and T to G. (B) Expression of icmT::lacZ, icmP::lacZ, icmQ::lacZ, and icmM::lacZ fusions and of two mutant versions of each fusion in L. pneumophila at exponential phase was determined. For each of the fusions, results for the wild-type (WT) regulatory region and for regulatory regions containing mutations in positions 1 and 2 are shown. ß-Galactosidase activity was measured as described in Materials and Methods. The data are the averages ± standard deviations (error bars) of at least three different experiments. (C) Mapping of the 5' end of the icmP transcript by primer extension. Primer extension was performed as described in Materials and Methods with a primer complementary to the 5' end of the icmP gene. G, A, T, and C (top), products of the sequencing reaction obtained by using the same primer. The sequence presented is that of the sense strand. Arrow, nucleotide representing the 5' end of the mRNA. Boldface and underlining are as described for panel A.
|
All the positions in the TATACT consensus are required for maximal expression. To determine if all the positions in the TATACT consensus are required for maximal expression, we performed a careful analysis of the icmQ consensus sequence (Fig. 4). Eight mutations in the icmQ consensus sequence were constructed (Fig. 4A): the 6 nucleotides of the TATACT consensus as well as 1 nucleotide on both sides of it were mutated. As can be seen very clearly in Fig. 4B, all six positions that are expected to be part of the consensus sequence were found to be required for maximal expression of the icmQ::lacZ fusion. The mutations upstream and downstream from the consensus sequence did not reduce the expression of the icmQ::lacZ fusion, and they determine the boundaries of the consensus.
![]() View larger version (21K): [in a new window] |
FIG. 4. Analysis of the regulatory region of the icmQ gene. (A) The sequence of the regulatory region of icmQ is shown (boldface, putative consensus sequence). The nucleotide representing the 5' end of the mRNA is in boldface and underlined. Arrows -1 to 7, nucleotides mutated. The changes made were always A to C, T to G, and C to A. (B) The expression of the icmQ::lacZ fusion (WT) and of the eight mutants from panel A in L. pneumophila at exponential phase was measured. ß-Galactosidase activity was measured as described in Materials and Methods. The results are the averages ± standard deviations (error bars) of at least three different experiments. (C) Mapping of the 5' end of the icmQ transcript by primer extension. Primer extension was performed as described in Materials and Methods with a primer complementary to the 5' end of the icmQ gene. G, A, T, and C (top), products of the sequencing reaction obtained by using the same primer. The sequence presented is that of the sense strand. Arrow, nucleotide representing the 5' end of the mRNA. Boldface and underlining are as described for panel A.
|
icmR contains at least three regulatory elements. Analysis of the icmR regulatory region identified two identical sequences (AAGATATATT) located next to each other. Because the 6 bp (TATATT) located at the 5' end of this sequence differ by only 1 nucleotide from the -10 promoter element described above (TATACT), we thought that these two sites might serve as regulatory elements of the icmR gene. To test this hypothesis, single as well as double mutations in the icmR regulatory region were constructed (Fig. 5A). As can be seen in Fig. 5B, the effect of the mutations in the icmR regulatory region was less pronounced than the effect of mutations in icmT, icmP, icmQ, and icmM. However, the combination of the two mutations at the third position (mutation 6 in Fig. 5B) caused a reduction of about 50% in the expression of the icmR::lacZ fusion, which was a more severe reduction than that due to each individual mutation at this position (mutations 2 and 4 in Fig. 5B).
![]() View larger version (23K): [in a new window] |
FIG. 5. Analysis of the regulatory region of the icmR gene. (A). The sequence of the regulatory region of icmR is shown, and the distance from the first ATG is indicated. The two putative consensus sequences are boxed, and the part of the consensus sequence (TATATT) similar to the consensus sequences found in the icmT, icmP, icmQ, and icmM genes is also indicated. The nucleotide representing the 5' end of the mRNA is in boldface and underlined, and the -10 promoter element is in boldface. Arrows 1 to 6, nucleotides mutated. The changes made were always A to C and T to G. (B) The expression of the icmR::lacZ fusion (WT) and of the six mutants from panel A in L. pneumophila at exponential phase was measured. ß-Galactosidase activity was measured as described in Materials and Methods. The data are the averages ± standard deviations (error bars) of at least three different experiments. (C) Mapping of the 5' end of the icmR transcript by primer extension. Primer extension was performed as described in Materials and Methods with a primer complementary to the 5' end of the icmR gene. G, A, T, and C (top), products of the sequencing reaction obtained by using the same primer. The sequence presented is that of the sense strand. Arrow, nucleotide representing the 5' end of the mRNA. Boldface and underlining are as described for panel A.
|
Identification of a regulatory elements of icm genes that have high expression levels. icmW, icmV, and icmF are the only three genes that were found to have high levels of expression in both L. pneumophila and E. coli (Fig. 1A and 2A). To find regulatory elements that are involved in the expression of these genes, the regulatory region of the icmF transcriptional unit was randomly mutated by PCR. Four clones that each contain a single mutation were identified, and all mutations mapped close to one another. Three mutations (from three independent PCRs) were found at the same position, and a fourth mutation was found five nucleotides downstream from it (Fig. 6A). These mutations caused a severe reduction in the level of expression of the icmF::lacZ fusion (data not shown).
![]() View larger version (24K): [in a new window] |
FIG. 6. Analysis of the regulatory region of icm::lacZ fusions that have high expression levels. (A) Sequences of the regulatory regions of icmW, icmV, and icmF. The distances to the first ATG are indicated. The region of sequence similarity is boxed, and the sequence (TATAGT) similar to those of the -10 promoter elements of the icmT, icmP, icmQ, and icmM genes is indicated. *, nucleotides in the icmF regulatory region in which random mutations were identified; arrows 1 and 2, nucleotides mutated by site-directed mutagenesis. The changes made were always A to C and T to G. (B) The expression of icmW::lacZ, icmV::lacZ, and icmF::lacZ fusions as well as of two mutant versions of each of these fusions in L. pneumophila at exponential phase was measured. For each of the genes, the wild-type (WT) regulatory region and the regulatory regions containing mutations in positions 1 and 2 are shown. ß-Galactosidase activity was measured as described in Materials and Methods. The data are the averages ± standard deviations (error bars) of at least three different experiments.
|
![]() View larger version (25K): [in a new window] |
FIG. 7. Analysis of the icmW and icmV regulatory regions. (A) The sequences of the regulatory regions of icmW and icmV are shown. These two genes are transcribed back to back, and the region is shown as double stranded. The 9-bp consensus sequence is boxed, and the internal 6-bp sequence (TATAGT) similar to the promoter elements found in the icmT, icmP, icmQ, and icmM genes is indicated. The nucleotides representing the 5' end of the mRNA are in boldface and underlined, and the -10 promoter elements are in boldface. Arrows 1 to 4, nucleotides mutated. The changes made were always A to C and T to G. (B) The expression of icmW::lacZ, and icmV::lacZ fusions as well as four mutant versions of each of these fusions in L. pneumophila at exponential phase was measured. For each of the genes, the wild-type (WT) regulatory region and the regulatory regions containing mutations in positions 1, 2, 3, and 4 are shown. ß-Galactosidase activity was measured as described in Materials and Methods. The data are the averages of at least three different experiments. (C) Mapping of the 5' end of the icmV transcript by primer extension. Primer extension was performed as described in Materials and Methods with a primer complementary to the 5' end of the icmV gene. G, A, T, and C (top), products of the sequencing reaction obtained by using the same primer. The sequence presented is that of the sense strand. Arrow, nucleotide representing the 5' end of the mRNA. Boldface and underlining are as described for panel A.
|
To determine the involvement of the sites mentioned above in the expression of icmV and icmW, four additional mutations in this region were constructed and their effect on the expression of these fusions was determined (Fig. 7B). In icmW, mutations in both sites (CTATAGTAT and TATACT) result in dramatic reduction in ß-galactosidase activity. In icmV, the mutations in the CTATAGTAT putative consensus had a minor effect on the expression of the lacZ gene but the change in the third position of the TATATT putative consensus reduced the expression to less then 20%.
To determine which of these sites, if any, constitute the -10 promoter elements of the icmV and icmW genes, the transcription start sites of both genes were determined, and the primer extension analysis of the icmV gene is presented in Fig. 7C. This analysis clearly indicates that TATACT in icmW and TATATT in icmV constitute the -10 promoter elements of these genes. The 9-bp consensus sequence (CTATAGTAT) identified probably serves as a binding site for a transcription regulator.
|
|
|---|
It has been suggested that two major stationary-phase-related factors (RpoS and RelA) are involved with L. pneumophila pathogenicity (4, 11, 21). However, the RpoS sigma factor was shown to be dispensable for intracellular growth in HL-60-derived macrophages and THP-1 cells (20), and the RelA regulatory factor was shown to be dispensable for intracellular growth in both amoebae and HL-60-derived human macrophages (55). In contrast, the icm and dot genes were shown to be required for intracellular growth in all the hosts examined (HL-60- and U937-derived human macrophages [41, 52], murine bone marrow-derived macrophages [50], and protozoan hosts A. castellanii [45] and Dictyostelium discoideum [47]). In addition, analysis of the effect of a strain containing an insertion in the rpoS gene (20) or the relA gene (55) on the expression of the nine icm::lacZ fusions described revealed that the expression of only one fusion (icmP::lacZ) was moderately reduced (55). This might explain the increase in ß-galactosidase activity observed with the icmP::lacZ fusion at stationary phase (Fig. 1), but it cannot explain the results for the icmT::lacZ, icmR::lacZ, icmM::lacZ, and icmF::lacZ fusions, in which higher levels of ß-galactosidase at stationary phase than at exponential phase were observed as well (Fig. 1).
One approach to identify regulatory factors that control the expression of a certain gene is to identify the DNA regulatory elements of the gene of interest and then to find regulatory factors that interact with these sites. To gain information about the DNA regulatory elements that control icm and dot gene expression, we analyzed their upstream regulatory regions by extensive site-directed and random mutagenesis as well as primer extension analysis. We were able to identify 12 DNA regulatory elements that are involved in the regulation of eight icm genes. Seven of these sites constitute the -10 promoter elements of the icm genes, and the other five sites (two in the icmR regulatory region and one each in the icmV, icmW, and icmF regulatory regions) probably serve as binding sites for regulatory proteins. The sequences of all these regulatory elements are summarized in Fig. 8.
![]() View larger version (30K): [in a new window] |
FIG. 8. Sequence alignment of the icm regulatory regions. The sequences are aligned according to the -10 promoter elements (boldface). The transcription start sites are in boldface and are underlined. *, mutations (29 in all). In icmP, icmT, icmQ, and icmM, sequence homologies between pairs of genes are boxed. In icmR, the 10-nucleotide direct repeat is boxed. In icmV, icmW, and icmF, the 9-nucleotide regulatory element is boxed. Arrows (indicating the center of symmetry), potential inverted repeats. The distance to the first ATG of each gene is indicated. CONC., consensus.
|
icmJ is the only icm gene that was found to have low expression levels, but the TATACT consensus was not found in its regulatory region (even when a 300-bp regulatory region was cloned upstream of the lacZ gene, the same low level of expression was observed [data not shown]). A potential site that has 2 nucleotides different from the consensus (TATCTT) was the closest match found, but, when the second and third nucleotides of this site were mutated, no change in the level of expression of the icmJ::lacZ fusion in L. pneumophila or E. coli was observed (data not shown).
icmR is the only gene that was found to have higher levels of expression in L. pneumophila than in E. coli. This gene was found to contain a -10 promoter element similar to those of icmT, icmP, icmM, and icmQ, as determined by primer extension (Fig. 5). In addition, we identified two identical regulatory elements (AAGATATATT) in the upstream region of this gene (Fig. 8) and showed that they play a role in icmR expression. These sites might constitute a binding site for an activator missing in E. coli. The 10-bp consensus sequence that was found in the icmR regulatory region was not found in any of the other icm regulatory regions. In addition, its -10 promoter element differs from all the other -10 promoter elements identified (the last nucleotide of the icmR promoter is G instead of T; Fig. 8). This result is in agreement with the observation that the icmR::lacZ fusion is the only fusion more highly expressed in L. pneumophila than in E. coli. This might indicate that the icmR gene is subjected to a unique regulation not present in the other icm genes.
By random mutagenesis and sequence alignment a 9-bp regulatory element (CTATAGTAT) (Fig. 8) was identified in the upstream regions of the three genes (icmF, icmV, and icmW) that were found to have high expression levels in both E. coli and L. pneumophila. This site was found to have an effect on the expression of these genes, but primer extension analysis indicated that it does not serve as a -10 promoter element. The -10 promoter elements that were determined by primer extension were found to be similar to the -10 promoter elements of the other icm genes (Fig. 8). The -10 promoter element and the regulatory elements identified in icmV and icmW were found to overlap one another in both genes and between the two genes (Fig. 7A). This organization might indicate that these two genes are subjected to complicated regulation.
As can be seen in Fig. 8, all the sites that were identified as -10 promoter elements have extensive homology to one another and are probably recognized by one of the L. pneumophila sigma factors. As most of the genome of L. pneumophila has already been sequenced (http://genome3.cpmc.columbia.edu/
legion/index.html), we looked for all the genes that have homology to known sigma factors and found that L. pneumophila contains homologs to at least six sigma factors (RpoD, RpoH, RpoF, RpoE, RpoS, and RpoN). The promoter sequence recognized by two of these sigma factors in L. pneumophila is already known (2, 22). RpoH and RpoF recognize sequences similar to the ones recognized by their E. coli homologs (CTTGAAA[11 to 16]CCCATnT for RpoH [n = A, G, T, or C] and TAAA[15]GCCGATAA for RpoF). These sequences are different from the ones found in the icm gene promoter regions, which might indicate that these two sigma factors are not involved in the recognition of the icm promoters. The sequence recognized by the L. pneumophila RpoE and RpoN sigma factors is not known. But it was shown that both the E. coli RpoE and its Pseudomonas aeruginosa homolog recognize the same promoter sequence (16) (GAACTT[16 to 17]TCTGA), and the sequence recognized by the RpoN sigma factor is known for many bacteria (TGGCAC[5]TTGC) (6). Because all these bacteria are evolutionarily closely related to one another (they are all
purple bacteria), it is most likely that these two sigma factors recognize similar promoter sequences in L. pneumophila. However, as these sequences were not found in the regulatory regions of the icm genes, we concluded that neither of these sigma factors is involved in the recognition of the icm promoters. Both the vegetative sigma factor, RpoD, and the stationary-phase sigma factor, RpoS, of E. coli recognize very similar -10 promoter elements (TATAAT for RpoD and CTATACT for RpoS) (53). This sequence is also similar to the sequence of the -10 promoter element that we found in the icm gene regulatory regions (TATAYT). However, as we recently demonstrated (55), a knockout of the rpoS gene moderately influenced the expression of only one (icmP::lacZ) out of the nine icm::lacZ fusions described. Taking all this information together we believe that the -10 promoter element of the icm and dot genes is recognized by the vegetative sigma factor (RpoD) of L. pneumophila. Further support for this assumption comes from the analysis of the icmQ -10 promoter element (Fig. 4). This analysis revealed that the first 2 nucleotides and the last nucleotide of this -10 promoter element are more important for expression then the other nucleotides. A similar situation was also found with the E. coli -10 promoter element of the vegetative sigma factor (31).
It is well known that promoters recognized by the vegetative sigma factor contain a conserved -35 promoter element in addition to the conserved -10 promoter element (53). Examination of the sequence located at the region where the -35 promoter element should have been located revealed that no obvious consensus sequence can be found in this region (beside the sequences found in the icmM and icmQ regulatory regions) (Fig. 8). However, this is not the first case where a -10 promoter element is present and a -35 element is missing. For two other bacteria, Helicobacter pylori (17) and Agrobacterium tumefaciens (14), a similar situation, which also involve virulence-related genes, was described. In A. tumefaciens it is known that a two-component regulatory system (virA and virG) is involved in the expression of the virulence-related genes (48).
However, two additional sites might be involved in the expression of genes beside the -10 and -35 promoter elements. An extended -10 promoter element was described for several systems (7, 12, 30). The sequence of the extension is composed of 2 nucleotides (usually TG), located 1 bp upstream from the -10 promoter element, and this sequence was shown to interact with region 2.5 of the vegetative sigma factor (5). A sequence identical to that of the known extended -10 promoter element was found only in the icmR promoter (Fig. 8), but it might be that in L. pneumophila other sequences constitute an extended -10 promoter that functions instead of a -35 promoter element in the icm genes. A second sequence that was found to be part of several promoters recognized by the vegetative sigma factor is called the UP element (19, 37). This sequence was found to be located upstream from the -35 promoter element in the region between nucleotides -59 and -42 with respect to the transcription start site, and it is an AT-rich sequence (AAA[A/T][A/T]T[A/T]TTTTNNAAAA) (15) that interacts with the alpha subunit of RNA polymerase (36). No sequence similar to the one presented was found in any of the icm promoter regions.
It is highly possible that additional regulatory elements, like the sequences described, that form potential inverted repeats with the -10 promoter element (Fig. 8) participate in the regulation of the icm genes. In addition, since we used translational fusions (for each gene, the codons for the first seven amino acids were included in the fusion) to determine the expression levels of the different icm genes, we cannot rule out contributions by posttranscriptional events to the differences in the expression observed.
We believe that the icm and dot genes are subjected to complicated regulation, which is required for the correct and successful function of their gene products. The fact that genes that are part of the same virulence system are expressed at different levels and contain similar as well as different DNA regulatory elements might indicate that a regulatory network that includes factors that respond to different environmental and growth conditions participates in their regulation. We began here to explore this regulatory network by finding 12 regulatory elements involved in the regulation of eight icm and dot genes and operons. The identification of these sites will allow us to find additional sites and regulatory factors that respond to changes in environmental signals and growth conditions and that participate in the regulation of the icm/dot virulence machinery.
The research was supported by the Charles H. Revson Foundation of the Israel Science Foundation (grant 45/00). G. Segal was supported by the Alon fellowship awarded by the Israeli Ministry of Education.
|
|
|---|
E-dependent promoters (sigmulon) in Pseudomonas aeruginosa and implications for inflammatory processes in cystic fibrosis. J. Bacteriol. 184:1057-1064.
28 of Legionella pneumophila restores flagellation and motility to an Escherichia coli fliA mutant. J. Bacteriol. 179:17-23.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»