Previous Article | Next Article ![]()
Journal of Bacteriology, May 2008, p. 3344-3352, Vol. 190, No. 9
0021-9193/08/$08.00+0 doi:10.1128/JB.01800-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Department of Biophysics, Molecular Biology and Genetics, University College of Science, Kolkata-700009, India,1 Structural Genomics Section, Saha Institute of Nuclear Physics, Kolkata-700064, India2
Received 14 November 2007/ Accepted 24 February 2008
|
|
|---|
|
|
|---|
If the same RNA can fold a large number of proteins which are structurally unrelated, the question of specificity arises in the interaction of the unfolded proteins with the RNA. Here we have deciphered the RNA-protein interactions at the nucleotide-amino acid level.
To locate the interacting nucleotides of domain V, we used primer extension analysis and determined the positions where the extensions were blocked. We found that the same set of nucleotides interacts with four different proteins tested: bovine carbonic anhydrase (BCA), lactate dehydrogenase (LDH), malate dehydrogenase (MDH), and chicken egg white lysozyme. Using matrix-assisted laser desorption ionization-time of flight/time of flight (MALDI-TOF/TOF), we also tested whether the RNA interacted with specific amino acids. In the two proteins tested, BCA and lysozyme, the amino acids involved were specific for each protein and could even be identical between the two proteins. The disposition of the amino acids on the three-dimensional structure shared some common features. The RNA-protein interactions thus appear to have significant specificity.
|
|
|---|
Unfolding and refolding of proteins. BCA, LDH, MDH, lysozyme, and human carbonic anhydrase I (HCA-I) were denatured with guanidine hydrochloride (7, 11, 31, 35), and the loss of their secondary structures was verified by circular dichroism spectra (16). For all enzymes except LDH, 6 M guanidine hydrochloride was used for denaturation. For LDH, 1 M denaturant was used. The denatured enzymes were diluted 100 times (final concentration, 300 nM) in the refolding buffer (50 mM Tris-HCl, pH 7.5, 100 mM NaCl, 10 mM magnesium acetate) in the absence or in the presence of 23S or domain V rRNA and were incubated for 30 min at 25°C to allow folding. The molar concentration of RNA was twice that of the enzymes. The residual amount of guanidine hydrochloride had no effect on the activities of the enzymes. The activities of refolded proteins were determined, following published protocols (7, 11, 31, 35), and expressed as a percentage of activity of the same amount of native protein. For lysozyme the activity assay was not done, but cross-linking with domain V for primer extension (see below) was done with it.
UV cross-linking of domain V-refolding protein complexes. Three hundred microliters of sample, containing 125 nM domain V RNA and 600 nM unfolded protein (to ensure that all RNA molecules were bound to proteins), was irradiated on a glass dish in ice with 254-nm UV (GS Gene Linker; Bio-Rad) at a distance of 6 cm for 2 min. The irradiated samples were precipitated by salt-ethanol and washed with 70% ethanol.
Primer extension on cross-linked domain V-protein complexes.
Primer BG32 (5'ACCCCGGAATTCGCGCCCACGGCAGATAGG3') was annealed to cross-linked domain V-protein complexes (about 3 to 5 µg). Annealed primers were labeled with [
-32P]dCTP, following the 3-deoxynucleoside triphosphate method, at 55°C using ThermoScript reverse transcriptase (Invitrogen). Labeled primers were extended at 58°C after the addition of all four deoxynucleoside triphosphates in excess by the same enzyme for about 45 min. The products were precipitated, washed with 70% ethanol, and analyzed on a 6.5% polyacrylamide gel in 8 M urea next to a sequencing ladder of domain V rRNA genes. It was obtained using the same primer by Thermo Sequenase DNA polymerase (Thermo Sequenase cycle sequencing kit; USB).
Sample preparation for MS. The mass spectrometry (MS) samples were prepared as described previously (38, 39). UV-irradiated samples were precipitated, air-dried, and dissolved in 8 M urea, 50 mM Tris-HCl (pH 8.0), 1 mM EDTA, and 1 mM dithiothreitol, heated for 10 min at 70°C, and cooled to room temperature. The sample was diluted 10-fold with 50 mM Tris-HCl (pH 8.0) and then digested with trypsin (Gold MS grade; Promega) with an enzyme/substrate ratio of 1:25 for 16 h at 37°C. The samples were precipitated and dissolved in 50 mM Tris-HCl (pH 8.0) and 2 mM EDTA. They were then digested with RNase A and RNase T1 for 2 h at 50°C, followed by trypsin for 16 h at 37°C. Digestion was stopped by direct injection of 500 µl sample onto a µRPC_C18_ST_4.6/100 column (GE Healthcare) mounted in an AKTAbasic 100 system and run using the Unicorn software. For reverse-phase chromatography, the solvents were as follows: solution A, 0.1% (vol/vol) trifluoroacetic acid (TFA) in double-distilled H2O; solution B, 0.085% (vol/vol) TFA in acetonitrile. The column was equilibrated with a 5x column volume (CV) of solution B, and the following gradients were applied in steps with a flow rate of 0.1 ml/min: (i) isocratic elution of the injection peak at the rate of 5% solution B for 2x CV, (ii) 5% to 100% solution B for 20x CV, and (iii) isocratic wash with 10% B until the baseline level was reached. Five hundred microliters of fractions were collected, and the presence of cross-linked peptide-RNA was checked by absorbance at 250 nm. Peak fractions were pooled, vacuum dried, and then mixed with the appropriate matrix for MS and tandem MS (MS/MS) analysis.
MS.
MS was carried out in an Applied Biosystems AB4700 MALDI-TOF/TOF mass spectrometer. Each 0.5-µl sample was mixed on a stainless-steel plate with an equal volume of either 10 mg/ml 2,5-dihydroxy benzoic acid or 5 mg/ml
-cyano-4-hydroxycinnamic acid (recrystallized and dissolved in 50% acetonitrile-0.1% TFA) matrix solutions. It was then dried at room temperature and subjected to MS and MS/MS. PMF were obtained in positive reflector mode. The instrument was operated in the delayed extraction mode with a delay time of 200 ns. Spectra were obtained by accumulation of 2,500 and 6,000 consecutive laser shots, respectively, in MS and MS/MS mode, and the laser intensities used were in the range of 4,000 to 5,000. Close external calibration for MS was performed using 4700 Cal mix (Applied Biosystems). Peak harvesting was carried out using the 4000 series Explorer (Applied Biosystems) software. Collected spectra were processed using Data Explorer software V.4.6 for advanced baseline correction (peak width, 32; flexibility, 0.5; degree, 0.1), noise reduction (correlation factor, 0.8), and deisotoping. Processed spectra were checked for the presence of tryptic peptides and their possible modifications for the representative proteins using the MassSorter v.1.05.04 software program (3). The search parameters included 1 missed cleavage and an error tolerance of ±50 ppm for PMF or ±0.2 Da for the MS/MS ion search.
|
|
|---|
![]() View larger version (32K): [in a new window] |
FIG. 1. Folding of proteins in the presence of rRNA components. (A) Unfolded proteins are folded in the absence and presence of 23S rRNA components. 1, 2, 3, and 4 represent the folding of BCA, HCA-I, MDH, and LDH, respectively. Bar graphs represent the mean folding values (± standard deviations) from three independent experiments. (B) Bar graphs showing the activity of native BCA after incubation without or with different components of 23S rRNA. Bar graphs represent the mean folding values (± standard deviations) from three independent experiments.
|
![]() View larger version (48K): [in a new window] |
FIG. 2. Primer extension analysis on domain V RNA cross-linked to proteins. (A) Primer extension analysis on domain V RNA after UV irradiation in the absence (lane 1) or in the presence (lanes 2 to 5) of protein. The proteins used were BCA, lysozyme, MDH, and LDH in lanes 2 to 5, respectively. (B) Secondary structure of the central loop of E. coli domain V rRNA. The boxes include the nucleotides that are identified to be cross-linked with proteins. The position of the cDNA primer used for primer extension analysis is indicated by the solid line at the 3' end of domain V. (C) Electrophoretic mobility shift assay showing RNAs UV irradiated without or with BCA. Lanes 1, 3, and 5 represent domain IV, domain II, and domain V of 23S rRNA UV irradiated without BCA, respectively. Lanes 2, 4, and 6 represent the same RNA molecules UV irradiated in the presence of unfolded BCA, respectively. Note that only domain V RNA shows a mobility shift in the presence of unfolded protein (lane 6).
|
-32P]UTP-labeled domain V, II, and IV regions of 23S rRNA synthesized in vitro to UV cross-linking in the absence and in the presence of unfolded BCA. The UV-irradiated samples were electrophoresed through a 5% native polyacrylamide gel (Fig. 2C). In the presence of the protein, the mobility of only domain V was shifted. This shows that unfolded proteins cross-link specifically with domain V RNA. A few shorter RNA species appeared in the lane of domain V RNA (Fig. 2C, lanes 5 and 6) due to premature termination of transcription, which is normal in cases of transcripts with profuse secondary structures. However, the electrophoretic mobility of these short transcripts was not shifted, apparently because they could not bind the protein. Strategy for identification of amino acids that interact with domain V by MALDI-TOF MS. We used the method improvised by the group of Urlaub and Luhrmann to identify oligopeptide stretches that cross-link to domain V RNA during the refolding reaction (24, 34, 37-39). Subsequently, the cross-linked amino acid residues of the oligopeptides were identified. BCA and lysozyme were chosen for these studies since they function as monomers.
UV-irradiated samples were digested with trypsin and then treated with an excess of RNase A and RNase T1 to hydrolyze the RNA, so that the peptides remain attached to a single nucleotide or short oligonucleotides only. The sample was fractionated, and the peak fractions at 250 nm were analyzed for peptides or peptides attached to a nucleotide(s). An accurate analysis of the molecular mass of such cross-linked heteroconjugates, by comparison with the theoretical masses of all possible tryptic fragments of the protein, should provide the composition of the cross-linked peptides. Peptide fragments generated from contaminants such as trypsin, keratin, RNase, etc., were excluded from the analysis.
Accurate mass determination of cross-linked peptide by high-resolution MALDI-TOF spectra. To identify the modified peptide stretches (due to the attached nucleotide), all experimental data were compared with the theoretical data in the presence of custom modifications for our study (nucleotides of RNA and their fragments, such as adenosine [A], A-phosphate, A-ribose, ribose only, etc.) using the MassSorter software program (3). A few modifications and the molecular weights are given in Table S1 in the supplemental material. For both BCA and lysozyme, analyses were performed separately through MassSorter. Five modified stretches were found in both cases. For example, the oligopeptide stretch 148 VGDANPALQK 157 (m/z 1012.48) in the tryptic pool of BCA remains attached with one cytosine (molecular weight, 305) residue, giving rise to a high-intensity spectrum at m/z 1317.78 (Fig. 3A). The interacting stretches of BCA and lysozyme obtained from MassSorter analysis are shown in Table S2 in the supplemental material. MS results clearly showed specific stretches of BCA and lysozyme interacting with domain V during folding.
![]() View larger version (40K): [in a new window] |
FIG. 3. MS and MS/MS analysis to show that the peptide stretch 148 VGDANPALQK 157 of BCA remains cross-linked with one cytosine residue of domain V RNA through its Asn152 residue. (A) MS spectrum of the signal at m/z 1317.78, derived after addition with a cytosine (C) residue (molecular weight, 305) from the "parent ion" at m/z 1013.48 (VGDANPALQK) (M+H)+. (B) The sequence of the corresponding peptide, with a scheme of the y and b fragmentation series. The asparagine residue within the box is cross-linked with the C moiety. (C) MS/MS spectrum of modified VGDANPALQK (m/z = 1317.78, i.e., attached with C) recorded with a 2,5-dihydroxy benzoic acid matrix. The spectra show an almost complete series of y-type and b-type fragments (b5/y5 to b9/y9 shown). Ion b5 is attached with a C residue, as revealed by the m/z value, while y6 is also attached with a C residue, so Asn152 is cross-linked with the C residue of domain V. (D) Complete list of b and y ions is presented here. b1 to b4 and y1 to y5 indicate unmodified ions. b5* to b9* and y6* to y9* represent modified ions attached with a C residue (the asterisk is used to depict modified ions).
|
|
View this table: [in a new window] |
TABLE 1. Contact sites between unfolded BCA or lysozyme and domain V RNA during foldinga
|
![]() View larger version (28K): [in a new window] |
FIG. 4. MS analysis of VGDANPALQK cross-link to show the RNA sequence where it binds through the cytosine residue. (A) Spectral range (m/z 1000 to 2600) which includes the peptide stretch alone (m/z 1012.48) and its different cross-linked form. (i) The VGDANPALQK peptide only and the same peptide cross-linked with one C residue are shown. (ii) MS spectrum of the signal at m/z 2276.20, derived from the VGDANPALQK peptide, where it remains attached with the UGUC nucleotide stretch of domain V. (iii) The same peptide, while attached to the UUGUC nucleotide stretch of domain V, gives a signal at m/z 2580.48. (B) The deduced oligonucleotide sequence of domain V RNA is shown; the last C residue of the 3' UUGUC 5' stretch interacts with the peptide.
|
![]() View larger version (79K): [in a new window] |
FIG. 5. Atomic resolution crystal (3.5 Å) structure of the E. coli 50S ribosomal subunit (PDB identifier 2AW4) showing the nucleotides involved in protein folding. Nucleotides U2473, U2491, C2551, A2560, and A2587, which interact with the unfolded protein, are situated at the entry mouth (shown by an arrow) of the tunnel and are depicted as CPK models. The figure was created using the BIOSYM package (Insight II; Biosym/MSI).
|
![]() View larger version (44K): [in a new window] |
FIG. 6. Surface residues of BCA are shown which interact with domain V RNA. (A) Ribbon diagram of BCA (PDB identifier 1V9E) showing five residues (labeled and marked in red) that interact with RNA during folding. Four of these amino acids are located in the loop regions. The cartoon was drawn using the PyMOL software program. (B) Surface view (using the GRASP software program) of two approximately opposite faces of the BCA molecule where the interacting peptide stretches 1 SHHWGYGK 8 (red), 58 MVNNGHSFNVEYDDSQDK 75 (green), 80 DGPLTGTYR 88 (blue), 148 VGDANPALQK 157 (purple), and 253 QVRGFPK 259 (cyan) are located. (C) Ribbon diagram (using PyMOL) of lysozyme (PDB identifier 1GXV) showing five residues (labeled and marked in red) that interact with RNA during folding. (D) Surface view (using PyMOL) of two opposite faces of lysozyme where the interacting peptide stretches 6 CELAAAMK 13 (red), 14 RHGLDNYR 21 (green), 34 FESNFNT QATNR 45 (blue), 62 WWCNDGR 68 (purple), and 115 CKGTDVQAWIR 125 (cyan) are marked.
|
|
|
|---|
The structural overlap of the two functions, protein synthesis and folding, on the PTC (8, 25-27, 30) suggests that they coevolved in the RNA world. The secondary structure of domain V has been preserved throughout evolution, and some of the bases in its large central loop are invariant (16). We find that most of the bases that contact proteins are in the loops and bulges, which agrees with the current view on RNA-protein interaction.
The specific nucleotides in PTC implicated in protein folding map on the crystal structure of the 50S subunit in the cavity at the entry mouth of the exit tunnel that goes through the 50S subunit (Fig. 5). We measured the affinity constants of unfolded BCA for 70S and 50S particles to be 9 x 107 M–1 and 9 x 108 M–1, respectively. The corresponding values for native BCA were 2 x 106 M–1 and 4 x 106 M–1, respectively (unpublished data). Therefore, the newly synthesized proteins, released from the 70S ribosome, have a strong preference to bind to the 50S subunit. In fact, in recent publications (4, 5), we have shown that full-length proteins remain associated with free 50S subunits when the 70S ribosome was used for folding in vitro (5) and when the proteins were synthesized in a cell-free E. coli translation mix or in live E. coli cells (4). These results suggest that PTC-mediated protein folding could be physiologically relevant.
The binding of full-length nascent proteins to the 50S subunit must follow the splitting of the 70S ribosome, which takes place after complete synthesis of the protein. The proteins most likely exit in a largely unfolded state from the exit tunnel because it is too narrow to accommodate proteins with tertiary folds (2, 26). Upon release from the tunnel, the proteins can bind to free 50S subunits in the cytosol. Alternatively, the nascent protein could bind to the PTC of the ribosome from which it was synthesized, but this would require going around the 50S subunit to the PTC at the tunnel entry point, which seems unlikely.
Another possibility, that all proteins might not exit from the tunnel, is suggested from the following observations. The folding of globin (heme binding site formation) takes place on the PTC, and this intermediate is too large to go through the tunnel and has to leave via the interface between the two subunits (22). Some nascent peptide chains cross-link to 16S rRNA (10). The tail spike protein of phage P22 remains associated with the 30S subunit after the subunits dissociate (12). The nascent proteins could also get distributed in branches from the exit tunnel seen in the rear side of the large subunit (36). In a cryoelectron microscopy study, three nascent proteins, Ig1, Ig2, and green fluorescent protein, were found to be largely concentrated in the entry cavity of the tunnel (PTC region) in E. coli ribosomes stalled following translation (18). In these and in many earlier studies (6), the full-length protein is found to exit from a point which is at a distance from the PTC. Whether the exit is from within or outside the tunnel thus remains unsettled.
Earlier we showed that folding of a number of proteins, including β-galactosidase and the DnaK chaperone in vitro and in vivo and firefly luciferase translated in vitro, could be suppressed by antibiotics that bind to domain V (4, 9, 17). These results lead us to believe that the folding of full-length proteins by domain V RNA in vitro is a good mimicry of the folding in vivo.
In the primer extension experiments, we found that in most cases the nucleotide bases in PTC interacted directly with the amino acids of unfolded proteins; only in a few cases could the interaction be through the phosphate moiety of the nucleotides (see Table S2 in the supplemental material). Some of the ambiguities in the results of primer extension experiments, like the binding of folding proteins to more than one (consecutive) nucleotide, were cleared by the MS studies. Among the amino acids of the two proteins (BCA and lysozyme) studied here, asparagine interacted with RNA the most frequently, with the other amino acids being lysine, glutamine, leucine, and glycine. This agrees with the list of amino acids that are prevalent among those interacting with nucleotides of RNA (21, 32). We have not detected any negatively charged amino acid interacting with RNA.
The amino acids that interact with domain V were found to be present mostly on random coils on the surfaces of the crystal structures of globular proteins (Fig. 6). The forces and interactions working on the polypeptide when it folds in an aqueous environment can give it the right shape if topological and steric problems are overcome. We can imagine three major steps where domain V RNA could help.
(i) A major deficit that must be overcome during folding is the large positive-configurational entropy of random coils (15). Reducing this entropy by clamping the amino acids in random coils on the outside of the protein could be the major contribution of domain V.
(ii) Another major problem is the unfavorable standard free energy of transfer associated with dehydration of hydrophobic functional groups as they are pulled into the interior of folded polypeptides (33). This effect can be largely modulated by slowing down the process by reducing the temperature to give the surrounding water a more organized structure. We found that folding by ribosome and domain V RNA is very efficient at low temperatures. It is general knowledge that the proteins retain their native conformation much longer at low temperatures.
(iii) A third advantage is trapping of the folding protein at the entry mouth of the 50S ribosomal tunnel (18) through the external random coils with the specific nucleotides of PTC. This would considerably reduce the overall entropy of the system.
Once these disruptive processes are tamed, the internalized secondary structures might engage in a jigsaw puzzle to proceed toward native conformation. We are trying to delete/mutate the amino acids which bind to specific nucleotides to get more insight into the process of ribosome-assisted protein folding. From studies of only two proteins, we found two identical amino acid-nucleotide pairs out of five such pairs. Whether this is fortuitous or not would be made evident by studying such interactions with a few more proteins.
This research was supported by grants from DBT, CSIR, FIST, UGC-DSA, and UGC-COSIST. D.S. and A.D. are CSIR research fellows.
Published ahead of print on 29 February 2008. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»