Previous Article | Next Article ![]()
Journal of Bacteriology, August 2003, p. 4806-4815, Vol. 185, No. 16
0021-9193/03/$08.00+0 DOI: 10.1128/JB.185.16.4806-4815.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Laboratory of Sialobiology, Department of Pathobiology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61802,1 Institut de Biologie Physico-Chimique (UPR9073-CNRS), 75005 Paris, France2
Received 27 March 2003/ Accepted 15 May 2003
|
|
|---|
30 bp covering the promoter region. Native (nondenaturing) polyacrylamide gel electrophoresis, mass spectrometry, and chemical cross-linking indicated that NanR forms homodimers in solution. The region protected by NanR contains three tandem repeats of the hexameric sequence GGTATA. Gel shift analysis with purified hexahistidine-tagged or native NanR detected three retarded complexes, suggesting that NanR binds sequentially to the three repeats. Artificial operators carrying different numbers of repeats formed the corresponding number of complexes. Among the sugars tested that were predicted to be products of the nan-encoded system, only the exogenous addition of sialic acid resulted in the dramatic induction of a chromosomal nanA-lacZ fusion or displaced NanR from its operator in vitro. Titration of NanR by the nan promoter region or artificial operators carrying different numbers of the GGTATA repeat on plasmids in this fusion strain supported the binding of the regulator to target DNA in vivo. Together, the results indicate that GGTATA is important for NanR binding, but the precise mechanism remains to be determined. |
|
|---|
|
|
|---|
Construction of the nanA-lacZ translational fusion. A 438-bp PCR fragment was amplified with oligonucleotides Nan1 and Nan2 (26), generating a DNA fragment including 232 bp with the TAA stop codon from the 3' end of nanR, the 121-bp nanR-nanA intergenic region, and 85 bp of nanA that was inserted into the SmaI site of the fusion vector pRS/NM482 (pRS414 with its cloning region replaced by the in-phase codons of pNM482) to produce a translational nanA-lacZ fusion. The fusion gene was transferred to bacteriophage lambda by in vivo recombination as described previously (33) and used to lysogenize JM101 and the nanR6 derivative IBPC1016. A DNA fragment covering the nanA insert and lacZ fusion junction was PCR amplified from chromosomal DNA of a lysogen by using oligonucleotide Nan2 and a primer specific for the lacZ gene. The sequence of the insert and nanA-lacZ junction was verified using the Amersham Thermosequenase kit.
Purification of NanR. The nanR (yhcK) open reading frame (ORF) was engineered with 5' NdeI and 3' BamHI cleavage sites using primers 5'-GGGAATTCCATATGGGCCTTATGAACGCATTTG-3' and 5'-CGCGGATCCCGTTTCTTTTTGTTGGTGGTCTG-3', respectively (restriction sites are underlined). Amplicons generated by PCR from the pTZ(Nan1-3) NanR-overproducing plasmid (26) were digested and cloned into the pET21c vector, fusing the ORF to a C-terminal hexahistidine tag. This plasmid was transformed into strain BL21(DE3) Star, and the culture was induced at an A600 of 0.5 with 1 mM IPTG (isopropyl-ß-D-thiogalactopyranoside). The cells were harvested after 2 h of further incubation, and the pellets were frozen overnight at -20°C. Cell lysis was accomplished by thawing the pellets and sonicating them in 50 mM Tris (pH 8.0)-500 mM NaCl- 15 mM imidazole. Hexahistidine-tagged NanR (NanR-His6) was purified to electrophoretic homogeneity by nickel-nitrilotriacetic acid chromatography (Qiagen, Valencia, Calif.) as described by the manufacturer. Cloning was carried out by C&P Biotech, Thornhill, Ontario, Canada.
To construct a plasmid that would permit overproduction of native NanR, nanR was amplified using a forward (5'-ATGAACGCATTTGATTCGCAAACCG-3') and reverse (5'-GCCGGCCCCGGGTTATTTCTTTTTGTTGGTGGTCTGACC-3') primer pair (the engineered XmaI site is underlined). The pBAD24 (14) expression vector was digested with NcoI, blunt ended, and then digested a second time with XmaI. The purified amplicon was ligated to the doubly digested pBAD24 vector, and the transformed plasmids expressed in strain JM109 were screened for overproduction of NanR by polyacrylamide gel electrophoresis after induction with 0.2% L-arabinose. One construct was sequenced by the W. M. Keck Center for Comparative and Functional Genomics, University of Illinois, to verify insertion of the wild-type nanR gene and saved as pSX675. To purify native NanR, JM109 harboring pSX675 was grown in 1 liter of Terrific broth to an A600 of 0.4 and induced with 0.2% L-arabinose for 3 h with vigorous aeration. The cell pellet was resuspended in binding buffer that contained 20 mM Tris-HCl, pH 8.0, 1 mM EDTA, 5% glycerol, 1 mM dithiothreitol, 0.01% Triton X-100, and 100 mM NaCl. The cells were disrupted by sonication, and particulate material was removed by centrifugation at 50,000 x g in a TLA 100.5 rotor of a Beckman TL-100 ultracentrifuge.
Target DNA (246 bp), including 39 bp to the end of nanR with its terminating codon, 86 bp to the beginning of the translational start site of nanA, and the entire 121-bp intergenic region, was amplified with a forward (5'-CGACGCAGACTCGCTTTATC-3') and reverse (biotin-labeled 5'-GGCACGCTTTCGGTCAGACC-3') primer pair. The soluble protein extract containing overproduced native NanR was incubated for 10 min at room temperature with the biotinylated DNA, and the protein-DNA complexes were subsequently bound to µMACS Streptavidin MicroBeads (Miltenyi Biotec, Auburn, Calif.). The iron filings contained within a µ column trapped the paramagnetic complexes, and the pure native NanR was eluted with high salt after the column was stringently washed to remove unbound polypeptides according to the manufacturer's instructions. N-terminal amino acid sequencing of the purified native NanR was carried out by the Protein Services Facility, University of Illinois, using automated Edman chemistry.
Determination of NanR molecular weight. Determination of the NanR-His6 molecular weight was carried out under nondenaturing polyacrylamide gel electrophoresis conditions essentially as described by Ferguson (11). Nondenatured protein molecular weight markers were purchased from Sigma, and the analysis was conducted according to Sigma Technical Bulletin MKR-137.
To determine the molecular mass of NanR-His6 by gel permeation chromatography, a column containing Sephacryl S-100 was equilibrated in 10 mM Tris buffer, pH 8.0, containing 100 mM NaCl and used to obtain a standard curve from the elution volumes of RNase A (13.7 kDa), chymotrypsin A (25 kDa), ovalbumin (43 kDa), and bovine serum albumin (66 kDa). The void volume of the column was determined using Blue Dextran. Matrix-assisted laser desorption-ionization- time of flight (MALDI-TOF) spectrometry was performed with 100 µg of NanR-His6 on an Applied Biosciences Voyager System 4066 mass spectrometer equipped with a nitrogen laser operating at 337 nm for desorption-ionization at an accelerating voltage of 25 kV and a grid voltage at 90 to 93% of the acceleration. The analysis was performed by the Mass Spectrometry Facility, University of Illinois. Cross-linking was initiated by the addition of ethylene glyco-bis succinimidylsuccinate (Pierce Chemical Co., Rockford, Ill.) in dimethyl sulfoxide to a final concentration of 10 mM, with 2.5 µg of purified native or hexahistidine-tagged NanR in a total volume of 20 µl. The reaction mixtures were incubated at room temperature for 1 h and quenched by adding 0.1 M glycine (final concentration) prior to analysis by denaturing polyacrylamide gel electrophoresis.
ß-Galactosidase assay. Cells were usually grown in E minimal salts medium (9) with 0.4% glycerol and 0.5% Casamino Acids (NZ amine; Sigma). ß-Galactosidase activity was measured as described previously (21), using 0.1% sodium dodecyl sulfate and 2 drops of chloroform for cell permeabilization. Data from at least two independent experiments are presented in Miller units.
S1 nuclease protection and DNase I footprint analysis. Total RNA was purified by the hot-phenol method, and S1 nuclease transcript mapping was carried out as previously described (25). The probe used was the 438-bp Nan1-2 PCR fragment in which the Nan1 oligonucleotide had been labeled with [32P]ATP and polynucleotide kinase. DNase I footprint analysis was carried out as described previously (25) using the same probe as for the S1 experiment.
Mobility gel shift analysis. The 246-bp fragment described above was labeled at both ends with [32P]ATP and polynucleotide kinase and purified by centrifugation through Micro Bio-Spin 30 chromatography columns (Bio-Rad, Hercules, Calif.). Typical 10-µl (total volume) reaction mixtures contained binding buffer (25 mM HEPES and 50 mM sodium glutamate, pH 8.0), 0.1 ng of labeled DNA, 20 ng of poly(dA-dT), and either purified NanR or extract at the concentrations indicated. After the mixture was incubated at room temperature for 10 min, 1.5 µl of a 50% (wt/vol) sucrose solution was added, and the sample was loaded onto a 5% polyacrylamide gel in 0.5x TBE (45 mM Tris, pH 8.3, 45 mM sodium borate, 1 mM EDTA). Samples were run at a constant 100 V for 1.5 h, and the dried gels were exposed overnight at -80°C against Kodak X-OMAT AR film and Fisher Biotech Intensifying Screen for detection of radiolabeled bands by autoradiography. The dissociation constant (Kd) of the interaction between NanR (monomer concentration) and the labeled 246-bp DNA fragment was determined by liquid scintillation spectrometry of excised free DNA according to the method of Carey (8).
Computer-assisted (in silico) analyses. The National Center for Biotechnological Information public database was screened for homologues of nan gene products using the BLAST algorithm (1). Molecular modeling of FadR (protein database number 1HW2) against the N terminus of NanR and estimation of dimer width were kindly performed by James Murray (Oxford University, Oxford, United Kingdom) using the RasMol program (available at www.umass.edu).
|
|
|---|
![]() View larger version (17K): [in a new window] |
FIG. 1. Genetic organization of E. coli nan pathway of sialate catabolism. (A) ORFs and their transcription directions are indicated by large arrows. The bent arrows indicate known or putative promoters. (B) Abbreviations for products, reactants, and enzymes are given in the text. The vertical line represents the E. coli inner membrane.
|
|
View this table: [in a new window] |
TABLE 1. Effect of carbon source on nanA induction
|
60 bp upstream of the longest nanA transcript (Fig. 3). This distance is typical for a classical class I CAP-controlled operon (7).
![]() View larger version (104K): [in a new window] |
FIG. 2. S1 nuclease transcript mapping and DNA footprint analysis of the nanR-nanA intergenic region. (A) The DNA fragment used for the analyses was the 439-bp PCR Nan1-2 (Fig. 3) labeled at the Nan1 end within the nanA ORF. Lanes 1 to 7, S1 mapping of nanA transcripts (25 µg of total RNA) isolated from JM101 (lanes 1, 3, and 5) and IBPC1016 (nanR6) (lanes 2, 4, and 6) grown on glycerol (lanes 1 and 2), glucose (lanes 3 and 4), and Neu5Ac (lanes 5 and 6). Lane 7, control (25 µg of tRNA). Lanes 8 to 11, DNA sequencing reactions on the Nan1-2 fragment using the Nan1 oligonucleotide as a primer. Lanes 12 to 18, DNase I footprinting on the Nan1-2 fragment (lanes 12 to 17 show the binding of purified NanR-His6 at 0.9, 1.7, 3.5, 7.0, 14, and 41 nM; lane 18 shows the free DNA). (B) The DNA fragment used for DNase I footprinting was the 245-bp Nan1-BstN1 fragment labeled at the Nan1 end. Lane 1, free DNA; lane 2, DNA with crude extract of NanR-overproducing strain ( 50 µg of total protein/ml); lane 3, same as lane 2 with 20 nM CAP and 200 µM cyclic AMP (cAMP); lane 4, 5 nM CAP and 200 µM cAMP; lane 5, 20 nM CAP and 200 µM cAMP. The numbers on the right are sizes (in base pairs) of pBR322 after digestion (shown at left) with MspI.
|
![]() View larger version (22K): [in a new window] |
FIG. 3. Intergenic region of the E. coli chromosome between nanR and nanA. The nucleotide sequence between the end of nanR and the translational start codon (boxed) of nanA is indicated. The bent arrows indicate transcriptional start site(s). The CAP and NanR binding sites are indicated, and the nucleotides known or proposed to constitute these sites are boxed. The -10 region is underlined. The first line of nucleotides below the intergenic region gives the FadR consensus pseudopalindrome, and the second line gives the FadR subfamily consensus binding site (29). Lowercase letters indicate bases that are variable within the consensus sequence. The oligonucleotides (Nan1 and Nan2) were previously described (26) and were used to amplify the 439-bp DNA fragment that includes the 121-bp intergenic region.
|
![]() View larger version (65K): [in a new window] |
FIG. 4. Purification of native NanR. Lane 1, molecular mass markers with masses (in kilodaltons) on the left. Lane 2, 31 µg of induced soluble extract. Lane 3, 3.5 µg of purified native NanR with the amino acid sequence of the first 10 residues on the right. The N-terminal methionine shown in parentheses is apparently removed in vivo.
|
Identification of the NanR binding site. When an extract containing overproduced NanR was incubated with a 246-bp radiolabeled DNA fragment covering 39 bp 5' from the termination codon of nanR to 86 bp 3' from the translational codon of nanA and including the 121-bp intergenic region (Fig. 3), a concentration-dependent ladder of three retarded bands was detected by gel mobility shift analysis (Fig. 5A, lanes 2 to 4). An extract prepared from the nanR null mutant IBPC1017 carrying the nanR7 mutation did not confer a mobility shift (Fig. 5B, lane 2). Adding purified NanR-His6 to the IBPC1017 nanR extract reestablished gel shift activity (Fig. 5A, lane 3). The same result was observed when native NanR was used for the reconstitution (not shown). Incubating purified NanR-His6 in the presence of radiolabeled promoter without added extract produced the same gel shift pattern (Fig. 5C) as the crude extract containing native NanR (Fig. 5A). The apparent Kd (5.6 ± 3.4 nM) of the interaction between NanR-His6 and its operator (measured by quantifying the disappearance of free DNA) is within the range found for other transcriptional regulators. Purified native NanR produced the same pattern of gel-shifted bands as its tagged derivative and was found to have a Kd of 2.5 nM, which is within the range calculated for NanR-His6.
![]() View larger version (45K): [in a new window] |
FIG. 5. Gel mobility shift analysis of NanR. (A) Extract of JM101 harboring pTZ(Nan1-3) was added to the 246-bp labeled DNA fragment and subjected to gel shift analysis. Lane 1, DNA fragment alone; lanes 2 to 7, serial 10-fold dilutions of the extract (0.5 mg of total protein/ml) starting at a 1/10 dilution. (B) Gel shift analysis with extract of the nanR7 mutant IBPC1017 (lane 2) and with the exogenous addition of 26 nM NanR-His6 (lane 3). Lane 1, DNA fragment alone. (C) Gel shift activity using purified NanR-His6 was determined at different dilutions. Lane 1, DNA fragment alone. Lanes 2 to 10 contained 44.2, 17.7, 8.8, 6.6, 4.5, 2.2 (duplicate lanes), 1.7, and 0.8 nM NanR-His6, respectively.
|
30-bp region (Fig. 2B, lane 2). Purified CAP protected a region corresponding to the consensus CAP binding motif shown in Fig. 3 (Fig. 2B, lanes 4 and 5), and both CAP and NanR can bind simultaneously (Fig. 2B, lane 3). To further investigate the interaction of NanR with its operator, we used purified NanR-His6 for footprint analysis. As shown in Fig. 2A (lanes 12 to 18), the purified protein protected a region of DNA that includes the three direct repeats of the sequence GGTATA separated by 3 or 2 bp (Fig. 3). Testing dilutions of the repressor shows that NanR binds sequentially to three sites, which is consistent with the presence of the three retarded bands in the gel mobility shift assay. The region protected corresponds to the three direct GGTATA repeats bound by native NanR and shows that the distal site 1 has the highest affinity. The 5' end of the major nanA transcript lies within the middle repeat, so that bound NanR would be expected to prevent RNA polymerase binding. The simplest interpretation of the gel shift and protection assays is that NanR binds its operator by covering approximately three turns of the helix with a stoichiometry of three NanR molecules (presumably homodimers) per target DNA molecule. Assuming that the three bands in the gel shift shown in Fig. 5 result from the successive binding of NanR to target DNA, a logarithmic plot of the molecular weights of these complexes (assuming either a monomeric or dimeric form of NanR and no effect on mobility from DNA bending) versus the relative mobilities of the complexes yields a straight line. This logarithmic relationship between mobility and predicted stoichiometry has been observed for other DNA binding proteins, although it does not rigorously exclude conformational isomerization of the complexes as an alternative explanation (12). Direct determination of the NanR stoichiometry in each band or a cocrystal structure of NanR with its operator will be necessary to confirm the mechanism of NanR binding.
To further demonstrate that NanR specifically recognizes the nanA promoter region in vivo, we cloned the 246-bp DNA fragment used for gel shift analysis into pGEM to generate plasmid pSX676. This construct was transformed into the nanA-lacZ derivative of wild-type strain JM101, resulting in a 50-fold induction compared to an untransformed control (Table 2). Transformation with vector alone resulted in a twofold induction, suggesting minimal nonspecific interaction between NanR and the relatively high intracellular concentration of plasmid DNA. We conclude that vector containing the nanA promoter region titrates intracellular NanR, leading to induction of the nanA fusion.
|
View this table: [in a new window] |
TABLE 2. In vivo titration of NanR
|
100 nM) of purified native NanR (Table 3). Binding was not detected at the relatively low NanR concentration used for the previous gel shift analysis (Fig. 5C); thus, it was unclear whether the interaction of NanR and a single motif reflects specific binding. However, operators with two or three tandem repeats produced the same number of gel-shifted bands as the number of repeats of the motif (Table 3). When two different DNA fragments with three repeats were tested, the number of residues flanking the GGTATA motifs did not influence NanR binding. Thus, operators ranging in length from 49 to 246 bp with three copies of the GGTATA motif invariably produced three gel-shifted bands (Table 3 and Fig. 5). When operators with four or six repeats were tested, the number of bands detected was equal to or one greater than the number of repeats (Table 3). Binding of purified native NanR to operators with four or six repeats displayed the same concentration dependence as binding to the natural operator shown in Fig. 5C, with Kds near 1 and 0.5 nM, respectively. The combined results of these artificial-operator-binding experiments indicate that the GGTATA motif is an important component of NanR target recognition under in vitro conditions. |
View this table: [in a new window] |
TABLE 3. Binding of NanR to artificial operators with various numbers of tandem GGTATA repeats
|
Inefficient displacement of NanR by Neu5Ac in vitro. On the basis of induction of aldolase (NanA) by exogenous addition of Neu5Ac to the medium or induction of the permease encoded by nanT in a nonpolar nanA mutant strain that produces Neu5Ac biosynthetically (endogenous induction), it was previously concluded that sialic acid was the physiological nan inducer (38, 39). This conclusion is supported by the present results shown in Table 1. On the basis of these results, we expected Neu5Ac to efficiently displace NanR from its operator in vitro. However, Neu5Ac, ManNAc, GlcNAc, GlcN, and Man at 8 mM concentration in the binding reaction did not appreciably affect the gel shift pattern (not shown). Similarly, adding 1 mM Neu5Ac to the upper buffer chamber had little effect on NanR binding, indicating that the lack of displacement of NanR from its sites is not due to the negatively charged Neu5Ac being entirely stripped from NanR during electrophoresis.
More than 90% of Neu5Ac in solution exists as the ß isomer with axial C-2 hydroxyl extending up from the ring. Mutarotation (ring opening and closing) slowly interconverts this form to the
isomer, which is the form that exists in all known glycoconjugates except the sialyl donor, CMP-sialic acid, where the sugar is in the ß configuration. A variety of physical techniques have shown that an enzyme like sialate aldolase (NanA) "chooses" the
isomer from solution as its substrate, effectively driving the mutarotation of the predominant ß form (10). We therefore preincubated Neu5Ac with native NanR, and as shown in Fig. 6, this treatment resulted in at least partial displacement of NanR from the DNA. If the NanT permease is also specific for the
isomer, then under physiological conditions, this form would be preferentially delivered into the cell, which could explain the rapid induction kinetics of the operon (38, 39) and our inability to duplicate the process in vitro. A physical analysis of the actual form of Neu5Ac bound to NanR or present intracellularly will be necessary to confirm our hypothesis that the
isomer of Neu5Ac is the physiological inducer.
![]() View larger version (132K): [in a new window] |
FIG. 6. Gel mobility shift analysis after preincubation with Neu5Ac. Purified native NanR at 2.3 (lanes 2 and 3) or 0.5 (lanes 4 and 5) nM was incubated (lanes 3 and 5) or not (lanes 2 and 4) with 10 mM Neu5Ac for 1 h at room temperature prior to gel shift analysis using the 246-bp DNA fragment. Lane 1 shows the DNA fragment alone.
|
|
|
|---|
Members of the FadR/GntR family of "winged" HTH proteins consists of two domains, a highly conserved N-terminal DNA binding domain containing the HTH motif (17) and a more divergent C-terminal domain involved in dimerization and effector binding (29). FadR is the only member of the GntR superfamily whose structure has been solved in three dimensions in complex with its DNA operator (34, 41). This structure demonstrates an unusual interaction of FadR with its 17-bp pseudopalindromic DNA binding site, in which the recognition helices of both HTH motifs of the FadR homodimer interact with the same major groove. This is in contrast to other prokaryotic HTH proteins, like CAP (32) or LacI (20), which contact two adjacent major grooves. In addition, FadR uses a winged-helix (WH) motif to contact the adjacent minor grooves. Although most HTH transcriptional regulators recognize palindromic or pseudopalindromic targets, we have provided evidence that NanR recognizes three tandem GGTATA repeats. Though relatively uncommon in bacteria, this type of interaction is common in eukaryotic regulators of the WH class, where the wing confers specificity, usually by stabilizing the interaction between the DNA residues present in the major groove of the target and the so-called recognition helix through interactions between the wing and an adjacent minor groove (5, 13). However, the WH family is extremely versatile (18), including monomers, homodimers, and heterodimeric protein-DNA complexes. One member of the family, RFX1, has been shown to use its wing to make contact with the major groove, and the recognition helix was found to be overlaid on the minor groove (13). The WH motif also may participate in protein-protein interactions (13).
Three-dimensional modeling of the NanR N terminus with the equivalent FadR domain indicates topological similarity between the predicted structure of NanR and the corresponding structural elements of FadR. This alignment had an overall relatively poor root mean square deviation of 2.4 Å, which is consistent with
26% sequence identity. However, we note that 9 of the 13 FadR amino acid residues lying within hydrogen-bonding distance of the DNA target are potentially shared with the NanR N terminus (Fig. 7). This could imply that NanR makes contacts with DNA similar to those made by FadR, with the nonidentical amino acids conferring specificity on the NanR operator. However, our in vitro data suggest that the basic unit of NanR recognition is the GGTATA repeat. The three models that can most plausibly account for our results are that (i) NanR specifically recognizes each of three tandem direct repeats, (ii) each GGTATA site is itself a pseudo-inverted repeat (GGTA:TAnc; the colon indicates the location of the pseudo-inverted repeat; "nc" indicates nonconserved residues), and (iii) each adjacent pair of GGTATA repeats contributes an inverted repeat, forming a FadR-like pseudopalindrome (Fig. 3) so that NanR binds to two overlapping inverted repeats.
![]() View larger version (27K): [in a new window] |
FIG. 7. Alignment of FadR and NanR N termini. The N termini of NanR (top sequence) and FadR (bottom sequence) were aligned by eye, and identities are boxed. The dashes represent an arbitrary gap to maximize the alignment. The arrows indicate FadR residues within hydrogen-bonding distance of DNA targets in the cocrystal (35). The solid arrows indicate amino acids that are identical in FadR and NanR, while the dotted arrows indicate those amino acids without homology to NanR that could be responsible for specific recognition, as described in the text.
|
100 Å. Note that the footprint of the NanR binding to the operator shows protection of >30 bp, implying that the sequence protected by each NanR molecule extends beyond the GGTATA repeat. Model 1 also includes the possibility that NanR binds to DNA as a monomer, as in the case of WH proteins of the OmpR family, which bind to tandem repeats with a monomer binding to each successive major groove (4, 15, 16). Since we know nothing about the kinetics of NanR dimerization, we cannot be certain that the functional (binding) form of NanR is the homodimer. Possibly three monomers bind and oligomerize on the DNA with a trimer as the functional unit. Ferguson analysis (11) or the use of radiolabeled NanR should allow us to distinguish the possible binding stoichiometries of each complex. In model 2, the GGTATA motif is considered a pseudopalindrome, GGTA:TAnc, so that NanR should bind as a dimer but possibly interacts asymmetrically with one side of the major groove. In model 3, the specific NanR recognition sequence covers two GGTATA motifs. The observation that a FadR consensus pseudopalindromic operator can be identified overlapping the first two GGTATA repeats (Fig. 3) could argue that this palindrome is the biologically relevant motif. This idea is supported by the observation that a minimum of two repeats are necessary for efficient band shift in vitro or for operator titration of the nanA-lacZ fusion in vivo. However, it is difficult to reconcile this model with the rest of our in vitro results, which clearly implicate all three repeats in binding. Recently, revised models for Fur binding to the previously defined Fur Box (19 bp) have been proposed (3, 19). These models are consistent with the cocrystal structure of the related protein DtxR (28, 40) and the structure of the recently determined FurR homodimer (27), indicating that both bind as homodimers to overlapping 15-bp palindromes on opposite sides of the DNA helix. If this model is relevant to NanR binding (model 3), then the FadR pseudopalindrome could constitute one of the overlapping palindromes (Fig. 3). However, no second overlapping palindrome is obvious in the nanA operator. Understanding the diversity of the FadR WH family clearly requires additional cocrystal structures.
Using site-directed mutagenesis of the repeats and the promoter titration system described here, altering any of the three repeats should decrease NanR binding, while mutagenesis of any two repeats should abrogate activity. It will also be interesting to determine the effects of increasing the distance or altering the orientation between repeats to further test our hypotheses about the NanR binding mechanism. In this regard, note that it has been shown that the same protein (CytR) can bind to both direct and inverted repeats when the spacing between the repeats is altered (24).
The inability of one GGTATA motif to titrate NanR indicates that the repressor does not recognize a single repeat unit under in vivo conditions. As this hexameric sequence is expected to occur many times throughout a >4-Mb genome, failure to recognize a single motif makes sense physiologically. However, if each retarded band in the mobility shift analysis represents the corresponding addition of one extra NanR molecule, the complex with the greatest mobility (the smallest number of NanR molecules bound) should correspond to binding of one equivalent of NanR. Possibly the "cage effect" in the band shift experiment produces sufficiently high concentrations to allow detection of the first NanR complex (Fig. 5C). Alternatively, binding of the first NanR molecule could require either recognition of at least two direct repeats or, perhaps, participation of the FadR subfamily-type consensus palindrome (Fig. 3). Binding of successively greater numbers of NanR molecules might then involve recognition of the tandem repeats, with the added potential for protein-protein interactions stabilizing the complex. This mechanism would be a hybrid of models 2 and 3. The binding of NanR to artificial operators, especially those with more than three GGTATA motifs, is consistent with this hypothesis (Tables 2 and 3). The regularly spaced ladder of NanR-DNA complexes shown in Fig. 5 was observed with crude wild-type extract or an extract from a strain overproducing NanR, as well as purified NanR-His6 or native NanR. This should exclude possible artifacts due to the peptide tag or extra components supplied by the extract and indicates that the affinity of NanR for its target and possible NanR-NanR interactions are the only factors involved in DNA binding. Although we cannot exclude the possibility of positional or configurational isomers to explain the banding pattern, studies in systems with known stoichiometries indicate that the primary factor affecting DNA mobility during electrophoresis is the molecular weight of the bound protein (12). Moreover, the DNA footprint analysis (Fig. 2B) would seem to require binding of at least two NanR molecules to explain the protection pattern. Binding of multiple regulatory units to transcriptional control sites is a mechanism for fine tuning gene expression in response to environmental conditions (15) and for increasing the range of induction, as seen in the lacZYA system. Our results demonstrate that the high level of nanA induction which is achieved through loss of NanR by mutation or titration or the presence of Neu5Ac is comparable to that produced by LacI binding to its three separated operators (22). However, binding of multiple LacI molecules requires a DNA looping mechanism, while NanR achieves a similar regulatory range by binding to adjacent sites.
Although both our present and previous results (30, 38, 39) clearly implicate Neu5Ac as the nan inducer, we were initially surprised by the failure to duplicate the in vitro equivalent of induction by incubating NanR with Neu5Ac. However, the temporal dependency of the in vitro induction suggests that the thermodynamically minor
isomer of Neu5Ac might be the true inducer. Because we have not detected orthologues of NanR in organisms with putative nan systems that are not closely related to E. coli, nan expression in different organisms may involve alternative regulators and inducing signals. In other words, the coordination between Neu5Ac synthesis or acquisition and its catabolism in different species may require distinct regulatory circuitry and involve diverse effector molecules. The relative ease of genetic manipulation provided by E. coli offers an attractive system to further explore the role of sialic acid metabolism in host-microbe interactions and provides the basis for similar studies in other pathogens or commensals.
K.A.K. was supported in part by a College of Agricultural, Consumer and Environmental Sciences Value-Added Predoctoral Fellowship and an American Heart Association (Midwest Affiliate) Predoctoral Fellowship. This work was supported by National Institutes of Health grant R01 AI42015 (E.R.V.) and by the CNRS and the Université Paris 7 to UPR9073 (J.P.).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»