Previous Article | Next Article ![]()
Journal of Bacteriology, October 2002, p. 5307-5316, Vol. 184, No. 19
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.19.5307-5316.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Arbeitsgruppe Genetik, Fachbereich Biologie/Chemie, Universität Osnabrück, D-49069 Osnabrück, Germany
Received 5 February 2002/ Accepted 24 June 2002
|
|
|---|
|
|
|---|
Most Scr+ enteric bacteria take up and phosphorylate sucrose by a phosphoenolpyruvate (PEP)-dependent sucrose:phosphotransferase system, which generates intracellular sucrose-6-phosphate (reviewed in references 29 and 30). Sucrose-6-phosphate is cleaved by a sucrose-6-phosphate hydrolase (invertase) into D-glucose-6-phosphate and D-fructose, which itself is phosphorylated by an ATP-dependent fructokinase. In Salmonella spp. mobile genetic elements, e.g., the conjugative plasmid pUR400 (36) or the conjugative transposon CTnscr94 (13), confer the ability to utilize sucrose, whereas in Klebsiella pneumoniae a highly similar scr regulon is encoded by the chromosome (38). Such optional genes often map in areas on the chromosome which highly diverge among different strains and which are hypervariable. These areas occur in strain-specific clusters of diverse size; the larger ones are called genomic islands or, to describe more precisely their functions, pathogenicity or metabolic islands, etc. (11, 13, 21, 28).
It was previously reported (6) that the E. coli wild-type isolate EC3132 in contrast to E. coli K-12 is capable of utilizing sucrose. However, growth on 0.2% sucrose was unusually slow, with a doubling time of 20 h. The corresponding csc (mnemonic for chromosomally encoded sucrose catabolism) genes were mapped in the equivalent of the 51-min region of K-12. Furthermore, it was shown that uptake and metabolism of sucrose in EC3132 is non-phosphotransferase system dependent (6, 31), and that instead sucrose is transported into the cells by a sucrose:H+ symporter named CscB (encoded by gene cscB). The transporter resembles other well-studied permeases of the cluster 5 of the major facilitator superfamily (MFS) (19), with the lactose permease (LacY) from E. coli K-12 being a prominent member.
In this report we describe the subcloning and further characterization of the other csc genes: cscK, which codes for a fructokinase; cscA, which codes for an invertase; and cscR, which codes for a csc-specific repressor from EC3132. A comparison of the csc coding region and adjacent genes with the genomic sequences of E. coli K-12 and the recently published sequences of the sucrose-positive, uropathogenic E. coli wild-type O157:H7 (12, 28), in addition to Salmonella enterica serovar Typhimurium (22), referred to herein as S. enterica, revealed a pronounced diversity in this region of the chromosome. Furthermore, we found evidence that the observed diversity was caused by the presence of the gene argW, which codes for an arginine-specific tRNA, which served as an integration site for mobile genetic elements.
Finally, we isolated and characterized mutants with chromosomal and plasmid-borne mutations that grow faster on sucrose than the wild type. Analysis of such adaptational mutations provided evidence that the basal expression level of cscB or the transport activity of the sucrose permease in the wild-type strain was not sufficient for an effective induction of the csc genes at sucrose concentrations below 5 mM. We speculate that the csc genes found in the E. coli strain EC3132 have been transferred "recently," in evolutionary terms, and that they are not completely adapted to their host.
|
|
|---|
Bacterial strains, plasmids, and media. The E. coli K-12 strains and plasmids used in this study are listed in Table 1. Cells were routinely grown either in standard phosphate minimal medium as described previously (6) supplemented with a 0.2% concentration of the indicated carbon sources, in Lennox broth without glucose and calcium ions, or in 2x TY medium as described in the work of Ausubel et al. (3). Utilization of various carbohydrates was screened on MacConkey plates (Difco) containing 1% of the indicated carbon source. The following antibiotics were used at the indicated concentrations: tetracycline, 10 mg/liter; ampicillin, 50 mg/liter; and chloramphenicol, 25 mg/liter. Transductions were carried out with P1 kc essentially as described previously (6).
|
View this table: [in a new window] |
TABLE 1. E. coli strains and plasmids used in this studya
|
|
View this table: [in a new window] |
TABLE 2. Phenotype and Csc enzyme activities of strain EC3132 and derivatives
|
(cscAop-lacZ), pJBL101 was digested with HindIII and the promoterless lacZ gene from plasmid pRU869 (40) was inserted. For
(cscKop-lacZ), pJBL101 was digested with EcoRV; for the fusion in cscB the plasmid was linearized by a partial digestion with HpaI. Both enzymes generate blunt ends. Therefore, the lacZ-carrying HindIII DNA fragment from pRU869 was isolated, blunt ended with Klenow DNA polymerase, and inserted into each csc gene, respectively. The appropriate orientations of all lacZ insertions were controlled by DNA sequencing. To measure repressor-independent, but cAMP-CrpA-dependent regulatory effects, cscR was deleted by digesting all plasmids with KpnI and StuI and recloning the csc genes carrying DNA fragments lacking the first 584 bp of cscR into the vector pSU19 (20). The obtained plasmids were named pAlacZ with the operon lacZ fusion in cscA, pKlacZ with the fusion in cscK, and pBlacZ with the fusion in cscB. Mutation analysis and DNA sequencing. DNA amplification of the csc genes was done according to the method of Saiki et al. (35). For the amplification of the complete csc regulon, the forward PCR primer cscB3 (5'-TTTCCGGTTGAGGAATATGG-3') and the reverse PCR primer cscR2 (5'-GGGCGTTGAGGATATCGTTGGATCTCGCCA-3'), which introduced an EcoRV restriction site (underlined), and the Expand Long Template PCR System from Roche Diagnostics, Mannheim, Germany, were used. Starting with 2 min at 93°C, the reaction profile consisted of 30 cycles of denaturing at 93°C for 15 s, followed by annealing at 50°C for 1 min and extension at 68°C for 4 min in a gradient thermocycler from Biometra Inc., Goettingen, Germany. Beginning with the 10th cycle, the elongation time was extended by 5 s to compensate for the loss of DNA polymerase activity.
The amplification of the isolated csc genes was performed on an air thermocycler (model 1605) from Idaho Technology Inc., Idaho Falls, Idaho, using the following reaction profile: 32 cycles of denaturing at 94°C for 1 s, annealing at 50°C for 1 s, and extension at 72°C for 45 s. The following PCR primer pairs were used: for cscB, cscB3 and cscB4 (5'-GCAGGTAATTCTGCAGCTGAATATTCCATTC-3'), which contained an artificial PstI site (underlined); for cscR, cscR1 (5'-TCGCTCTATGCCTCCCACGGA-3') and cscR2 (5'-GGGCGTTGAGGATATCGTTGGATCTCCGGT-3'), which contained an artificial EcoRV site (underlined); and for the csc promoter-operator regions, cscO1 (5'-AGGCAGTAGCCGCCCGTCTG-3') and cscO2 (5'-CGGCGCTCGTGAAGTTTTGC-3'), respectively. PCR products were directly purified using the Wizard PCR preps DNA purification system (Promega Corp., Mannheim, Germany). All DNA sequencing reactions were performed by the dideoxy chain termination method using the ALFexpress AutoRead or dATP labeling mix sequencing kit from Amersham Biosciences. The nucleotide sequences of both strands were determined after subcloning into the pSU19 (20) or the pGEM-T (Promega Corp.) vector using Cy-5 (5-N,N-diethyltetramethylindo dicarbocyanine)-labeled universal and reverse primers or unlabeled internal csc sequencing oligonucleotides priming about every 250 bp within the genes. Computer analysis was done with DNASIS sequencing analysis software (Hitachi) and by using the BLAST programs and database services provided by the National Center for Biotechnology Information, Bethesda, Md.
Identification of transcriptional start points. Primer extension experiments for the identification of the transcriptional start points upstream of cscK and cscA were performed as described previously (3). Total RNA was isolated from the cscR::cat(Con) derivative LJM115 during the mid-exponential growth phase on Lennox broth without glucose and calcium ions.
Transport and enzyme assays. Transport of [14C]sucrose (15 µM final concentration) and sucrose hydrolase (invertase) and fructokinase activities were measured in exponentially grown cells as described previously (6). Transport activities were calculated from the initial uptake rates. The assay of ß-galactosidase was performed according to the method of Pardee and Prestidge (27).
|
|
|---|
![]() View larger version (39K): [in a new window] |
FIG. 1. Detailed comparative map of the csc-dsd regions of the E. coli strains K-12, O157:H7, and EC3132. Genes are represented by boxes, with their orientations indicated by arrows. Regions of homology between E. coli K-12 and the two other strains are shown on a dark grey background; those between O157:H7 and EC3132 are shown on a hatched background. O157:H7-specific DNA is emphasized on a light grey hatched background. The heading numbers refer to the positions of the genes on the E. coli K-12 map (in minutes) and to the relative kilobase coordinates.
|
DNA sequence analysis of the csc gene cluster revealed four open reading frames: The previously described cscB gene, which codes for a sucrose permease (6), is located immediately downstream of the gene cscK. Both were cotranscribed from the promoter cscKp. cscK encodes a protein of 307 amino acid residues that exhibits high similarities to other fructokinases (e.g., 74% amino acid identity with ScrK from pUR400 [2]). Gene cscA, which codes for a sucrose hydrolase (invertase), is divergently oriented in respect to the cscKB operon. CscA consists of 477 amino acid residues and exhibits high similarities to other pro- and eucaryotic sucrose- or sucrose-6-phosphate-hydrolyzing enzymes. Downstream of cscA an open reading frame was identified for the LacI-GalR-type repressor protein CscR (23% identity with LacI and 26% identity with GalR, respectively). In pJBL103 the coding region for the amino-terminal 112 amino acids of CscR was deleted, explaining the constitutive csc gene expression from this plasmid (Table 2).
A further sequence analysis and a comparison with the equivalent DNA sequence of E. coli K-12 showed that the csc genes of EC3132 partially replaced the dsd gene cluster for the utilization of D-serine, thus explaining the Dsd- phenotype of all csc+ strains. The gene dsdC, which codes for the activator of the dsd regulon, and the 5'-coding region of gene dsdX which codes for a putative D-serine transporter (25), were deleted and replaced by the csc regulon in EC3132. The gene dsdA, which codes for a D-serine-deaminase, is still present but is inactive due to the missing activator DsdC. This particular genotypic exclusion had been observed before (1, 6), but its molecular basis was not clear.
On the opposite end, the csc gene cluster is flanked by the argW gene which codes for an arginine-specific tRNA. Whereas the upstream region of argW in EC3132 is almost identical to the equivalent segments of E. coli K-12 (5) and O157:H7 (12, 28), a comparison of the downstream regions of argW genes from the different E. coli strains indicated several major rearrangements. The existence of these changes suggests that bacteriophages or related genetic elements, which used argW for a site-specific integration into the host chromosome, were responsible for the horizontal gene transfer and stable integration of the csc genes (see Discussion). An overview of the genes in the csc region of EC3132 is given in Table 3.
|
View this table: [in a new window] |
TABLE 3. Genes in the csc region of EC3132d
|
csc'R derivative JM109/pJBL103, was fully constitutive for all tested Csc activities (Table 2). Similar effects were observed for the plasmid-encoded systems: JM109 harboring the wild-type csc genes on pJBL101 showed only low Csc enzyme activities, whereas cells harboring pKJL124 exhibited a 90-fold induction of the invertase and a 4.5-fold induction of the sucrose transport activity. Interestingly, the basal expression level of the CscB activity was increased more than threefold compared to the wild type. A similar result could not be observed for the fructokinase activity, probably because the true uninduced basal expression levels of cells harboring pJBL101 or pKJL124, respectively, were below the detection limits of the test, as for the wild-type strain. However, the fructokinase activity was clearly induced in JM109/pKJL124 in the presence of 0.2% sucrose. Cells harboring pJBL126 in contrast to pKJL124 exhibited a constitutive csc gene expression. Identification of the mutations leading to enhanced sucrose utilization. Constitutive expression of both cscKB and cscA operons could be caused by cscR(Con) mutations inactivating the repressor CscR. To test this hypothesis, the cscR alleles from the wild-type strain EC3132, mutant ECH1, and the mutated plasmid pJBL126 were amplified by PCR and subcloned into the plasmid pGEM-T. The corresponding plasmids were tested for their abilities to complement the cscR::cat mutation of LJM115. LJM115 carrying the wild-type cscR gene on the multicopy plasmid did not show any CscA invertase or sucrose transport activity even in the presence of the inducer sucrose, because of the disproportionately high levels of repressor molecules in these cells. In contrast, both cscR alleles from ECH1 and pJBL126 cloned on the same multicopy vector did not alter the constitutive csc gene expression of LJM115 (data not shown). DNA sequences of both inactive cscR alleles were determined. The cscR mutation of strain ECH1 was found to be a deletion of a single adenosine residue at bp 841 of the open reading frame. This altered gene encodes a 290-residue peptide (of 331 amino acids), in which the last 9 amino acids were changed by the frameshift. The cscR mutation of pJBL126 was identified as a single adenosine insertion at bp 102 of the cscR gene. The altered open reading frame encodes a truncated peptide of 47 residues, in which in addition the last 14 amino acids were altered compared to the wild type.
In contrast to these constitutive mutants lacking an active repressor, the mutants ECB1 and JM109/pKJL124 still exhibited inducible csc gene expression. To map the mutation of pKJL124 more precisely two hybrids between the mutant and the wild-type csc regulon of pJBL101 were constructed using a single NruI site in cscA (Fig. 2). Based on the analysis of the plasmid-encoded phenotypes and the Csc enzyme activities of JM109 harboring the different constructs (Table 2) the mutation could be mapped in the DNA segment, which encoded the cscBK genes and the csc promoter-operator region. The DNA sequence of this fragment was thus determined, and the mutation was identified as a T-to-C transition in the left half of the putative cscKo operator (GTTAAC/GTTAAC). As shown in Fig. 2B this particular palindrome motif is present twice between the two operons and most likely corresponds to the operators cscKo and cscAo.
![]() View larger version (40K): [in a new window] |
FIG. 2. Schematic illustration of the construction of csc hybrid plasmids (A) and nucleotide sequence analysis of the csc promoter-operator region (B). (A) The Csc phenotype for sucrose utilization was tested on MacConkey indicator plates with 1% sucrose (scored as 1+ to 3+, pink to deep-purple colonies). The genes are indicated by boxes, with the gene or promoter orientations shown by arrows. DNA of pJBL101 is represented by dark grey boxes, and DNA of pKJL124 is represented by light grey boxes. The position of mutation in pKJL124 is symbolized by an asterisk. (B) The putative -35 and -10 regions are underlined, as well as the putative ribosome-binding sites (RBS) and the start codons of cscK and cscA. Transcriptional start points (TK) are indicated by letters in boldface italic type, the putative csc operators are boxed, and the putative cAMP-CrpA binding sites are marked with diagonal lines. The mutation of pKJL124 is symbolized by an asterisk.
|
Effects of the Q353H substitution in CscB. CscB shares a high degree of similarity (31.2% identical residues over the entire length) to the lactose permease (LacY) from E. coli (6). However, as shown earlier the transport activities of the sucrose permease were significantly lower compared to LacY, and transport could be measured only after overexpression of the protein, e.g., by using a plasmid-encoded system (31). To test whether the Q353H exchange alone caused the increased transport activity, alleles both from the wild type and from mutant ECB1 were amplified by PCR and subcloned as PstI/EcoRV fragments into the expression vector pTM30 (24). This expression vector provides a regulated tac promoter-operator, a lacIq gene, and a ribosome binding site in an optimal position with respect to the start codon. The expression levels of cscB from these plasmid constructs were sufficient for measuring specific transport activities in JM109, even without induction by IPTG. Addition of IPTG led to an immediate growth inhibition and inactivation of CscB (data not shown). JM109 cells expressing the Q353H derivative of CscB from pTMB1 accumulated sucrose under identical conditions approximately twofold higher (537 ± 17 pmol per min and mg of protein) than cells expressing the wild-type protein from pTM3132 (206 ± 40 pmol per min and mg of protein) (means ± standard deviations) (Fig. 3).
![]() View larger version (19K): [in a new window] |
FIG. 3. Sucrose transport by JM109 harboring cscB expression plasmids. Time courses of active sucrose transport by JM109 expressing wild-type sucrose permease from pTM3132 (triangles) or the Q353H derivative of CscB from pTMB1 (circles) are indicated. Cells were grown aerobically at 37°C until mid-log exponential growth phase, harvested, washed with minimal medium, and immediately tested. The final sucrose concentration in the test was 15 µM. Mean values and standard deviations (error bars) from at least three different experiments are given.
|
lac CyaA- strain MC4100-5 (Table 4). The results confirmed the cAMP-CrpA dependence for both promoters. The low basal activities were enhanced for all four constructs by the addition of 0.5 mM cAMP by a factor of 6 to 8. To ensure that the enhanced ß-galactosidase activities were not caused by an increase of the plasmid copy numbers in the presence of cAMP, the intracellular plasmid DNA concentrations for each culture were simultaneously determined. The semiquantitative analysis revealed no detectable differences (data not shown) that could account for the observed differences in the ß-galactosidase activities. |
View this table: [in a new window] |
TABLE 4. cAMP-CrpA dependence of csc gene expressiona
|
|
|
|---|
Finally, in S. enterica LT2 (gi16763390 [22]) downstream of argW, both the dsd and csc gene clusters are replaced by a cluster of five genes for the transport and utilization of phosphoglycerate not found in strains of E. coli. The sequence homology of the DNAs of the different enteric bacteria starts again close to the glk gene for a glucokinase, which is located about 10 kb downstream of argW in Salmonella and about 42 kb downstream of argW in both E. coli K-12 and O157:H7 strains, respectively. Interestingly, S. enterica LT2 lacks the csc genes, while the dsd gene cluster is located in the equivalent of the 83-min region of E. coli K-12. The dsd and the csc genes thus do not belong to the set of housekeeping or backbone genes but rather belong to the group of optional genes that are highly mobile among the enteric bacteria.
We speculate that the very low efficiency of the csc genes and the corresponding permease CscB and metabolic enzymes indicate a relatively recent acquisition at around the time when the different strains of the enteric bacteria diverged. Therefore, the characterization of mutations which optimize these genes and enzymes in the new host should reveal mechanisms involved in its evolution. The analysis of suppressor mutants with improved growth rates corroborates the hypothesis that the sucrose transport in EC3132 could be the rate-limiting step for the induction of the csc genes. This means that the intracellular inducer concentration in the presence of 0.2% sucrose never reaches a threshold value, which would be necessary for the effective induction of the csc genes. Support for this idea comes from the fact that (as for ECH1 or pJBL126) the majority of adaptational mutations we have analyzed thus far were found in cscR (more than 60% [our unpublished results]). The two other classes of adaptational mutations seem to increase either the basal expression level of cscB (caused by an operator mutation in pKJL124) or the CscB activity (in ECB1), which should lead to a situation in which even at low sucrose concentrations in the medium the threshold value for induction could already be achieved.
Direct proof for the csc operators is still missing. However, several findings argue that the perfect palindrome sequences highlighted in Fig. 2 constitute the csc operators. CscR, like other repressors of the LacI-GalR family (42), possesses an N-terminal helix-turn-helix motif that is responsible for the interaction with the DNA. Essential for the specific recognition of the cognate operator is the second or "recognition" helix that crosses the major groove of the DNA almost parallel to the base pairs (9). By direct modifications of the operator sequence or direct amino acid exchange in the recognition helix of the lac repressor, rules for the interaction of specific amino acid residues with individual bases of the operator were defined (17). According to these rules arginine 20 in the recognition helix of CscR would be specific for a guanine residue in position 6 of the cognate operator, whereas methionine 16 could interact only with a thymine residue in position 4 of the operator. Both nucleotides are present in the putative csc operator sequences (G6T5T4A3A2C1/G'1T'2T'3A'4A'5C'6; numbering of the bases according to the lac operator). These findings confirm the rules of the DNA-protein interaction set up for this regulator family. Furthermore, the mutation found in pKJL124 is exactly located in the palindrome in front of the cscBK operon. This should cause a reduction in the affinity of CscR for the operator and therefore an increase of the basal expression level of cscB, which would result in an increase in the concentration of the intracellular molecular inducer. Sahin-Tòth et al. (32) reported the cloning and sequencing of cscA and the cscA/cscKB promoter-operator region from the sucrose-positive E. coli wild-type strain B-62. The published sequence (gb|AF084030) is almost identical to the sequence of EC3132 except that the equivalent cscKB operator exhibits a C'6-to-T'6 transition in the right half of the palindrome. A doubling time of 48 min on 0.2% sucrose was measured for this strain (39). Therefore, it is tempting to speculate that B-62 has already acquired an adaptation mutation that enables the cells to grow fast on relatively low sucrose concentrations. Finally, the fact that this particular palindrome was found twice in the csc promoter region might indicate a cooperative regulatory effect antagonistically to the two cAMP-CrpA activator binding sites located between the two operons. Further experiments are currently under way to characterize the interaction between CscR and its operators.
An increase of the transport activity of the sucrose permease (as in ECB1) should also lead to an improvement of the induction characteristics under relatively low sucrose concentrations. CscB belongs to cluster 5 of the MFS (19). In many aspects the sucrose permease shares properties with the permeases of the MFS, in particular with LacY. There are, however, some characteristic differences between CscB and the lactose permease. The transport activity of the wild-type sucrose permease, for example, is for unknown reasons about 10-fold lower than that of LacY (6, 31). A further difference between CscB and other permeases of this family is CscB's unusual substrate specificity. Only sucrose (31) and palatinose (6-O-
-D-glucopyranosyl-D-fructofuranose) (our unpublished results) were identified as substrates, as no detectable transport of lactose, melibiose, or other galactosides was observed even at high substrate concentrations (
10 mM). For LacY a detailed secondary structure model has been proposed (16). On the basis of primary sequence alignments of CscB and LacY, a topological model for the sucrose permease with 12 transmembrane helices was proposed and further tested (6, 33, 34; our unpublished data). CscB possesses five out of six highly conserved intramembrane charged residues, which seem to be involved in ligand binding and transport, and three out of four putative intramembrane charge pairs. This indicates that the relative positions of the transmembrane helices and therefore the overall topologies of these permeases are identical. The missing charge pair in respect to LacY (Asp237 in helix VII and Lys358 in helix XI of LacY) was introduced into the wild-type sucrose permease by site-directed mutagenesis of Asn234 (helix VII) and Ser356 (helix XI) (10). Whereas individual replacements of these residues abolished active sucrose transport, simultaneous replacement of Asn234 with Asp or Glu and replacement of Ser356 with Arg or Lys resulted in high activity. Interestingly, when expressed at a low rate, the CscB double mutant Asn234Asp-Ser356Arg was present in the membrane in a significantly greater amount than the wild type (10), which indicates that the import of CscB into the membrane and therefore its activity can be improved. For this reason it is tempting to speculate that the replacement of the Gln353 residue in helix XI by a His residue, which is in close proximity to Ser356, in the fast-growing derivative ECB1 might also cause an enhanced protein import into the membrane and/or an improved protein stability, which would explain the observed increased transport activity. Alternatively, one could speculate that the His353 residue might directly improve sucrose binding properties in the mutant protein of strain ECB1. Further experiments are necessary to test these hypotheses. Interestingly, the G+C content of the cscB gene was only 43.6%, whereas the G+C content for the other csc genes was 52%, which is in the range typical for E. coli. This could mean that cscB might have been the last gene that has become part of the csc regulon and that especially the sucrose transport is still in the process of adaptation.
This work was financially supported by the Deutsche Forschungsgemeinschaft through Sonderforschungsbereiche 171 and 431 and by the Fonds des Verbandes der Chemischen Industrie (VCI).
This work is dedicated to the memory of Pieter W. Postma, a highly acclaimed scientist in the field of bacterial physiology and genetics. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»