Previous Article | Next Article ![]()
Journal of Bacteriology, May 2005, p. 3496-3501, Vol. 187, No. 10
0021-9193/05/$08.00+0 doi:10.1128/JB.187.10.3496-3501.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry, McGill University, 3655 Promenade Sir William Osler, Montreal, Quebec, Canada H3G 1Y6
Received 14 November 2004/ Accepted 11 February 2005
|
|
|---|
-helix and a flexible C terminus. NMR titration experiments suggest that the ß1-ß2 and ß3-ß4 loops and the C-terminal helix are important elements in RNA binding. Even though the ß3-ß4 loop contains a highly conserved RNA-binding motif, GxxG, typical of KH domains, our structure excludes CsrA from being a member of this protein family, as previously suggested. A mechanism for the recognition of mRNAs downregulated by CsrA is proposed. |
|
|---|
Intracellular levels of CsrA are regulated by two untranslated RNA molecules, CsrB and CsrC, that act as antagonists by sequestering CsrA and preventing its binding to target mRNAs (13, 17, 28). CsrA binding to both CsrB and CsrC seems to be mediated by a highly repetitive sequence element, 5'-CAGGA(U,C,A)G-3', located in the loops of predicted CsrB/C hairpins (17, 28). It has been proposed that CsrA exists in equilibrium between CsrB/C and CsrA-regulated mRNAs, implying that CsrB/C levels are a key determinant of CsrA activity in the cell.
CsrA homologs have been recognized for important roles in the regulation of stationary-phase gene expression in other bacterial species. The CsrA homolog (RsmA) of Erwinia species regulates a variety of genes involved in soft-rot disease of higher plants (8, 9). csrA and csrB in Salmonella enterica serovar Typhimurium regulate genes involved in epithelial cell invasion by this species (2). In Pseudomonas aeruginosa, the Csr (Rms) system controls the quorum-sensing systems Las and Rhl, which regulate several of its virulence factors.
CsrA is a 61-amino-acid dimeric protein previously thought to be related to RNA-binding proteins containing the KH motif (18). In this work we describe a nuclear magnetic resonance (NMR)-based model of the CsrA dimer from Escherichia coli and some of its RNA-binding properties. Furthermore, a mechanism for the recognition of mRNAs down regulated by CsrA is discussed.
|
|
|---|
1 mM protein in 50 mM sodium acetate buffer, 300 mM NaCl at pH 4.5. For preparation of 13C- and 15N-labeled and unlabeled protein samples, equal amounts of purified 13C- and 15N-labeled and unlabeled proteins were mixed in the presence of 6 M urea overnight. Urea was removed by extensive dialysis against NMR buffer.
Gel filtration.
The oligomeric state of CsrA was determined using gel filtration (Hiload 16/60 Superdex 75; Pharmacia Biotech). Regeneration-induced CNP homolog (RICH) (53.8 kDa), RICH in 1 mM dithiothreitol (26.9 kDa), and gamma-ear protein (13.8 kDa) were used as standards. Samples were run with a flow rate of 1 ml/min at room temperature in NMR buffer as described above. CsrA eluted from the column at a predicted molecular mass of
18 kDa as expected for a dimer.
NMR spectroscopy. NMR experiments were recorded at 303 K on a Bruker Avance 600-MHz spectrometer. Backbone and side chain assignments of CsrA were determined in HNCACB, CBCA(CO)NH, edited [15N/13C]TOCSY-HMQC, [13C]HCCH-TOCSY, and [13C](h)CCH-TOCSY experiments. NOE data for the structure determination were obtained from homonuclear NOESY, 15N-edited or 13C-edited 3D NOESY experiments. Backbone assignments at pH 7.5 were determined using HNCA and CBCA(CO)NH experiments. The intermolecular NOEs were detected using a filter-edited 3D NOESY spectrum and a pair of identical 13C-edited 3D NOESY with and without decoupling in the indirect 1H dimension (100-ms mixing time). A 13C/15N-labeled/nonlabeled protein sample (1:1) was used for these experiments. 1H-15N residual dipolar coupling constants were measured from comparison of IPAP-HSQC experiments recorded on CSRA with and without 2.5% C12E5-hexanol (24) For the measurement of dipolar couplings we used 50 mM sodium acetate buffer at pH 4.5 and 0.5 mM CSRA. All NMR spectra were processed using either XWINNMR version 2.5 or 3.1 (Bruker Biospin) or NMRPipe (10). Evaluation of spectra and manual assignments were completed with NMRView (15). Pulse-field gradient self-diffusion experiments were done according to the method reported by Ekiel et al. (12).
Structure calculations.
CNS 1.1 software (4) was used to generate an initial fold of CsrA with a basic set of 122 NOEs manually assigned from NOESY spectra (104 intramolecular and sequential NOEs). Hydrogen bond constraints were introduced to secondary structure regions as determined by chemical shift analysis, characteristic NOE patterns, and analysis of amide exchange rates. Dihedral restraints (
and
) were obtained using the TALOS program (7). These calculations generated a fold that was used as a model template for automated assignments by ARIA1.1 (21). The final structure of CsrA was calculated with a total set of 710 constraints collected from the experiments described earlier. Noncrystallographic symmetry restraints were used to keep both subunits in the dimer with the same conformation. In the final round of calculations, CNS 1.1 was extended to incorporate residual dipolar coupling (RDC) restraints for further refinement using the torsion angle space. The axial and rhombic components were defined from a histogram of measured RDCs (5) and optimized by a grid search method (6). Ten structures were selected based on the lowest overall energy and least violations to represent final structures. PROCHECK-NMR was used to generate Ramachandran plots to check the protein's stereochemical geometry (16). A summary of the structural statistics for CsrA is shown below in Table 1.
|
View this table: [in a new window] |
TABLE 1. Structural statistics for CsrA
|
0.7 mM CsrA) in the presence of different amounts of ligand concentrations in the 0 to 2.0 mM range. As high concentrations of imidazole improve the solubility of CsrA at physiological pH, the protein sample and RNA stock solutions were prepared in 500 mM deuterated imidazole, 300 mM NaCl at pH 7.5. The RNA sequences used were 5'-ACCUGCACACGGAUUGUGUGGUUC-3' (glg25), 5'-CACACGGAUUGUGUG-3' (glg15), and 5'-CAGGAUG-3' (CsrB consensus sequence) and were synthesized in the Core DNA & Protein Services, University of Calgary. Protein structure accession numbers. The coordinates of CsrA have been deposited in the RCSB under PDB code 1Y00, and the NMR assignments are under BMRB accession 5&link_type=GEN">11855.
|
|
|---|
4.5), CsrA does not aggregate into higher-order forms but remains as a dimer (Fig. 1A). Gel filtration data were also confirmed by NMR self-diffusion experiments (Fig. 1B) (12). At pH 4.5, CsrA had a diffusion coefficient of 0.93 x 106 cm2/second, in agreement with the formation of a dimer at low pH (apparent molecular mass of 18.6 kDa). The apparent molecular mass for the aggregate at pH 7.5 was
29.3 kDa. NMR experiments for determining the solution structure of CsrA were performed at pH 4.5.
![]() View larger version (25K): [in a new window] |
FIG. 1. Structure determination of the CsrA dimer. A. Gel filtration chromatogram of CsrA. Protein standards were RICH (53.8 kDa), RICH in 1 mM dithiothreitol (26.9 kDa), and gamma-adaptin ear protein (13.8 kDa) as indicated. CsrA eluted as an 18-kDa protein, consistent with a dimeric form. B. Pulse-field gradient-self-diffusion experiments for CsrA. The slopes in the plot are proportional to the diffusion coefficient (Ds). C. Representative two-dimensional strips of 13C edited NOESY experiments with and without carbon decoupling in a 1:1 sample of 13C/15N-labeled/unlabeled protein. Peaks from the carbon-coupled experiment are shown to the left for both sets of strips. NOEs resulting from intermolecular interactions appear as singlets in both experiments, while intramolecular NOEs are doublets in the uncoupled experiment.
|
, Cß, and H
(29) and the analysis of sequential and short-range NOE connectivities involving NH, H
, and Hß protons indicated that the CsrA monomer is composed of five ß-strands and a short
-helix. The unstructured C terminus is unfolded as shown by measurement of backbone {1H}-15N heteronuclear NOEs (Fig. 2E). Analysis of 13C-edited NOESY experiments recorded in conjunction with and without carbon decoupling on a 1:1 mixture of 13C/15N-labeled/nonlabeled CsrA allowed us to determine intermolecular NOEs and the hydrogen bond network defining the CsrA dimer (Fig. 1C).
![]() View larger version (62K): [in a new window] |
FIG. 2. Structure of CsrA. (A) The CsrA sequence from E. coli K-12 (gi:161306043) is aligned with homologous proteins from S. enterica (gi:16766132), Yersinia pestis (gi:16123457), E. carotovora (gi:50122288), Vibrio cholerae (gi:9654977), Buchnera aphidicola (gi:21672665), Pseudomonas fluorescens (gi:38489882), Legionella pneumophila (gi:54296805), and Haemophilus influenzae (gi:16272754). Completely conserved acidic, basic, polar, and hydrophobic residues are red, blue, green, and gray, respectively. The location of secondary structure elements is shown on top. (B) Stereo view of the backbone atoms for residues 1 to 55 of 10 selected conformers. The two subunits are magenta and blue. (C) Ribbon depiction of CsrA. (D) Topology diagram of the CsrA structure showing the connectivity between strands in the two ß-sheets. (E) Values of the {1H}-15N heteronuclear NOE for backbone amides, showing the unstructured C terminus.
|
20%), with proton and carbon nuclei resonating within a narrow chemical shift range. The dimeric nature of CsrA contributed further to this ambiguity. However, the high content of an antiparallel ß-sheet within CsrA allowed the structure of CsrA to be defined using relatively sparse NMR-derived restraints (Fig. 2 and Table 1).
Each CsrA monomer is composed of five strands, ß1 to ß5, corresponding to residues 2 to 6, 10 to 15, 18 to 23, 30 to 35, and 41 to 43. Residues 46 to 50 fold into a short
-helix followed by an unstructured C terminus (residues 51 to 61). In the dimer, strands ß1 and ß5 of one monomer hydrogen bond to ß4' and ß2' of the other monomer, forming a mixed antiparallel ß-sheet (Fig. 2B to D). Packing of these two mixed ß-sheets forms the core of CsrA.
In spite of the low sequence similarity, it was proposed that CsrA was a member of the KH domain family, a group that comprises a diverse series of RNA-binding proteins (18). The characteristic signature of this protein family is the presence of a
30-amino-acid segment that expands around a conserved GxxG core sequence (where x is any amino acid, with a preference for basic residues) (1). In CsrA, the GxxG motif has the sequence GVKG (residues 24 to 27) and is located in the loop connecting strands ß3 and ß4. Our structure proves that CsrA is not a member of the KH family of proteins, which have a characteristic ß
ßß
topology (19, 20) that differs from that of CsrA. However, it is still likely that the GxxG sequence in CsrA is involved in the recognition of the GGA triplet present in all CsrA-binding sites (3).
RNA binding of CsrA. Charged residues in CsrA are grouped into well-defined clusters on the protein surface (Fig. 3A). The main basic patch comprises residues R6, R7, K26, R31, and the side chain amides of N28 and Q29, defining a putative RNA-binding site. Residues E10, E45, and E46 and D16, E17, and E39 give rise to well-defined acidic patches located on the side and bottom of the CsrA molecule. Electrostatic interactions between these basic and acidic patches may explain CsrA aggregation at high concentrations.
![]() View larger version (45K): [in a new window] |
FIG. 3. Surface properties of CsrA. (A) Surface potential of the CsrA structure. Blue and red colors indicate positive and negative electrostatic potential, respectively. (B) Superposition of 15N-HSQC spectra of CsrA in the absence (blue) and presence (red) of the glg15 RNA (5'-CACACGGAUUGUGUG-3'). (C) Mapping of chemical shift changes (from panel D) onto the CsrA structure. Residues with large chemical shift changes are red or pink, residues with small changes are white, and residues that could not be quantified are gray. (D) Measured chemical shift changes versus residue number from the RNA titration, calculated using the equation [( H)2 + (0.2 · N)2]1/2. Secondary structural elements are shown on top. Blank spaces represent peaks that could not be traced with certainty.
|
Toeprint analyses have identified the position of bound CsrA on target mRNAs (3, 11). In the case of the glgCAP transcript, RNA digestion and gel mobility assays were performed on a 134-nucleotide untranslated leader containing the CsrA-binding site. Binding of CsrA protected both the single-stranded glgC Shine-Dalgarno (SD) sequence and the glgCAP hairpin further upstream from cleavage by RNase T1 and Pb2+ (3). Structural changes seem to occur in the hairpin RNA, as CsrA binding enhanced the cleavage of the sequence in the stem-loop protected in the unbound form.
In light of our structural data, we postulate that the CsrA dimer presents its GxxG motif to simultaneously recognize both GGA sequences in the SD sequence and the upstream hairpin loop. Since SD sequences are present in most bacterial mRNAs, CsrA likely requires two signals to recognize the correct transcripts to be regulated. The upstream hairpin loop in glgCAP transcript may act as an allosteric activator for CsrA binding to the downstream SD sequence. Further, CsrA seems to bind the GGA sequence with higher affinity when present as part of a hairpin loop than in single-stranded sequences. This is supported by experiments with reverse transcriptase where the SD-CsrA interaction was not sufficiently strong to disrupt the reverse transcriptase complex (3). Our titration experiments substantiate the finding that CsrA binds preferentially to hairpin loop structures. In addition, the GGA sequence in the hairpin loop affects the affinity of CsrA binding to the SD (3). Furthermore, conformational changes seem to occur upon binding to RNA as evidenced by our own data and footprinting studies and RNA structure mapping that demonstrated that the base of the glgCAP leader RNA hairpin is disrupted when CsrA is bound (3). Binding affinity is probably higher for the hairpin due to the reduced conformational entropy associated with this structure.
We propose that CsrA binds to its target mRNAs in two steps. First, the CsrA dimer recognizes the hairpin loop upstream of the SD and binds to the GGA sequence by using one of the GxxG loops. At this point, it is likely that both RNA and CsrA experience a conformational change that increases CsrA's affinity for the downstream single-stranded SD sequence. CsrA then binds to the SD sequence through the second GxxG motif, thereby preventing transcription and rendering the RNA more susceptible to degradation.
Note. While this paper was under review, the X-ray structure of the CsrA homolog from P. aeruginosa was released (RCSB PDB code 1VPZ). The structures are very similar (RMSD of 1.71 and a Dali Z-score of 10.0) and have identical strand topologies.
We thank M. Cygler, M. Bachetti, D. Elias, G. Kozlov, T. Moldovenau, and N. Siddiqui for assistance and helpful discussions.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»