Previous Article | Next Article ![]()
Journal of Bacteriology, August 2006, p. 5993-6001, Vol. 188, No. 16
0021-9193/06/$08.00+0 doi:10.1128/JB.00460-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington,1 Biosciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico2
Received 3 April 2006/ Accepted 2 June 2006
|
|
|---|
-helix (A61 to A75) nestled against its side. Hydrophobic interactions between residues in the
-helix and ß-strands 3 and 4 hold the
-helix near the ß-sheet core. The electrostatic potential on the solvent-accessible surface is primarily negative with the exception of a positive arginine pocket composed of residues R18, R70, and R74. Steady-state {1H}-15N heteronuclear nuclear Overhauser effects indicate that the protein's core is rigid on the picosecond timescale. The absence of amide cross-peaks for residues G13 to H19 in the 1H-15N heteronuclear single quantum correlation spectrum suggests that this region, a loop between ß-strands 1 and 2, undergoes motion on the millisecond to microsecond timescale. Dali searches using the structure closest to the average structure do not identify any high similarities to any other known protein structure, suggesting that the structure of Rv2302 may represent a novel protein fold. |
|
|---|
|
|
|---|
0.8, protein expression was induced by making the broth concentration 1.0 µM in isopropyl-ß-D-1-thiogalactopyranoside. Following 4 to 8 h of culture growth at 28°C, the cells were harvested and then frozen at 80°C. The thawed cell pellet was resuspended in 35 ml of lysis buffer (50 mM sodium phosphate, 0.3 M NaCl, 10 mM imidazole, pH 8.1), brought to 0.2 µM in phenylmethylsulfonyl fluoride, and passed three times through a French press (SLM Instruments, Rochester, NY). The cell debris was removed by centrifugation for 45 min at 24,000 x g in a JA-20 rotor (Beckman Instruments, Fullerton, CA). Following filtration through a 0.45-µm-pore-size membrane (Corning Incorporated, Corning, NY), the supernatant was loaded onto a 20 ml Ni-nitrilotriacetic acid affinity column (QIAGEN, Valencia, CA) and washed stepwise with buffer (0.3 M NaCl, 50 mM sodium phosphate, pH 8.1) containing increasing concentrations of imidazole (5 to 500 mM). The Rv2302 fraction that eluted at 250 mM imidazole (
30 mg/liter) was dialyzed overnight at 4°C in 4 liters of thrombin cleavage buffer (150 mM NaCl, 20 mM Tris-HCl, pH 8.4). After the volume was reduced to
3 ml (Amicon Centriprep-3), the N-terminal polyhistidine tag was removed by overnight incubation at room temperature with
1 mg of thrombin (Fisher BioReagents, Fair Lawn, NJ) and 3 µl of 1.0 M CaCl2. The cleaved protein was then purified on a Superdex75 HiLoad column (Amersham Pharmacia Biotech, Piscataway, NJ) that simultaneously exchanged it into NMR buffer (100 mM KCl, 20 mM potassium phosphate, 2.0 mM dithiothreitol, pH 7.1). Using a flow rate of 1.0 ml/min, approximately 20 mg of Rv2302 eluted at 80 min, a retention time characteristic of a protein with a calculated monomeric molecular mass of 8,591 Da (Rv2302 plus the three N-terminal residues, GSH, that remain after thrombin cleavage). Sodium dodecyl sulfate-polyacrylamide gel electrophoresis showed the protein to be greater than 98% pure.
Optical spectroscopy.
Circular dichroism (CD) data were collected on an Aviv Model 62DS spectropolarimeter calibrated with an aqueous solution of ammonium d-(+)-camphorsulfonate. The measurements were obtained on an Rv2302 sample (
0.06 mM) in NMR buffer and in a quartz cell of 0.1 cm path length. A thermal denaturation curve for Rv2302 was obtained by recording CD spectra at intervals of 2.5°C from 5 to 80°C and plotting the ellipticity at 220 nm. Wavelength scans for Rv2302 were recorded between 200 and 250 nm at 25°C, 80°C, and 25°C (postheating). Each wavelength spectrum was the result of averaging two consecutive scans with a bandwidth of 1.0 nm and a time constant of 1.0 s. The wavelength spectra were processed by first subtracting a blank spectrum followed by baseline correction and noise reduction.
NMR spectroscopy and resonance assignments.
All NMR experiments were collected on samples with concentrations of
2.0 mM at 25°C using Varian 600- and 500-Inova spectrometers equipped with triple resonance probes and pulsed-field gradients. The data were processed and analyzed with Felix (MSI, San Diego, CA). All chemical shifts were referenced to DSS (2,2-dimethyl-2-silapentane-5-sulfonate; 0 ppm) using indirect methods (41).
The 13C, 1H, and 15N chemical shifts of the backbone and side chain resonances were obtained from the analysis of sensitivity-enhanced two-dimensional 1H-15N (18, 45) and 13C-1H heteronuclear single quantum correlation (HSQC) (16, 18) spectra and three-dimensional (3-D) HNCA, HN(CO)CA, CBCA(CO)NH (18, 30), HNCACB, HNCO, HCCH-total correlated spectroscopy (TOCSY) (19), HCC-TOCSY-NNH (26, 29), and CC-TOCSY-NNH (12, 25, 29) spectra. Distance restraints were obtained from a suite of multidimensional nuclear Overhauser effect spectroscopy (NOESY) experiments using a mixing time of 150 msec: 3-D 13C- and 15N-edited NOESY-HSQC (18, 33, 45) and 4-D CC-NOESY-HMQC (40). Deuterium-exchange studies were performed by lyophilizing an NMR sample and redissolving in 99.8% D2O (the 1H-15N HSQC spectrum of a lyophilized sample redissolved into 10% D2O-90% H2O was essentially identical to the 1H-15N HSQC spectrum obtained prior to lyophilization). Two-dimensional 1H-15N HSQC spectra were recorded 0.5, 2, 4, and 96 h after the exchange. Steady-state {1H}-15N heteronuclear NOE values were measured from the ratios of 1H-15N HSQC cross-peak volumes in spectra recorded in the presence (Isat) and absence (Iunsat) of three seconds of proton presaturation prior to the 15N excitation pulse (NOE = Isat/Iunsat) (9).
Structure calculations.
Structures were calculated with the CNS program (version 1.1) (2) using the experimentally derived restraints listed in Table 1. Pseudoatom corrections were added to the upper bound of the following stereochemically unassigned methylene protons and methyl groups: 1.0 Å for methylene protons, 2.0 Å for chemically equivalent aromatic protons, 1.5 Å for methyl protons, and 2.4 Å for pairs of methyl groups in Leu and Val residues. The latter pseudoatom corrections were applied only to the methyl groups for 5 out of the 11 Val residues and 1 of the 2 Leu residues because the others were stereospecifically assigned using a biosynthetically directed 13C-labeled sample and observing the carbon-carbon splitting of the Pro-R methyl group in the 13C-1H HSQC spectrum (31). Dihedral angle restraints for phi (
) of 57 ± 25° (
-helices) and 139 ± 40° (ß-strands) were obtained through the chemical shift index analysis of the carbon and proton chemical shifts (42). One of the three Pro residues (P36) was determined to be in the cis conformation on the basis of a large difference in the Pro 13Cß and 13C
chemical shift (34) and NOE pattern. Deuterium exchange experiments revealed a subset of 24 cross-peaks in the 1H-15N HSQC spectrum that still remained 30 min after the exchange. This subset of more slowly exchanging protons corresponded to amides in ß-strands. When the acceptor oxygen for the slowly exchanging amide was identified from preliminary structural ensembles, hydrogen bond restraints (1.8 to 2.3 Å and 2.8 to 3.3 Å for the NH-O and N-O distances, respectively) were introduced into the structure calculations. Identical hydrogen bond restraints were added for backbone amide protons in an
-helical region (residues 65 to 75) identified in the preliminary structural ensembles (and confirmed by chemical shift index analysis). Note that the latter amides all exchanged with deuterium within 30 min, consistent with the observation that hydrogens in
-helix hydrogen bond networks generally exchange before hydrogens in ß-strand hydrogen bond networks (8). When no persistent distance or dihedral angle violations were observed in the initial ensemble of calculated structures of lowest energy, dihedral angle restraints for psi (
) of 47 ± 30° (
-helices) and 140 ± 40° (ß-strands), based on the elements of secondary structure identified in the structural ensemble and TALOS calculations (7), were introduced into the calculations. The final set of 55 calculations, using the restraints compiled in Table 1, generated an ensemble of 25 low-energy structures (in terms of total and NOE energies). The structures in this ensemble were refined with explicit water (24) using force constants of 500 and 2,000 kcal for the NOE and dihedral restraints, respectively. This final ensemble was then used to calculate a mean structure and average root mean square deviation (RMSD) values to the mean structure. Structural quality was assessed using PROCHECK-NMR (23). Structural similarity searches were performed using the DALI server (www.ebi.ac.uk/dali/interactive.html) (13).
|
View this table: [in a new window] |
TABLE 1. Summary of the structural statistics for RV2302a
|
|
|
|---|
0.06 mM sample collected at 25°C in the same buffer used to collect NMR data. The double minimum at
215 and
208 nm and projected maximum at a wavelength of <200 nm are characteristic of a structured protein with a mixture of ß-sheet and
-helical content (14). The dotted line in Fig. 1A is the CD spectrum of the same sample at 80°C. Relative to the spectrum collected at 25°C, the minimum at
215 nm has increased and red-shifted to
218 nm, while there is only a slight increase and small blue shift in the minimum at
208 nm. These observations at 80°C indicate that Rv2302 is more unstructured at elevated temperatures. However, the absence of an extrapolated negative minimum at
198 nm and a positive maximum at
218 nm indicates that Rv2302 is not entirely random coil at 80°C and still has some ß-sheet structure. When the sample is cooled to 25°C again, the CD spectrum, indicated by the dashed line in Fig. 1A, is very similar to the original CD spectrum collected prior to heating (solid line). This indicates that the temperature-induced effects to the structure of Rv2302 are largely reversible.
![]() View larger version (17K): [in a new window] |
FIG. 1. (A) CD spectra of Rv2302 (0.06 mM) in NMR buffer collected at 25°C (solid line), 80°C (dotted line), and 25°C postheating (dashed line). (B) CD thermal melt for Rv2302 (0.06 mM) in NMR buffer. Data points were collected at 220 nm in intervals of 2.5°C between 5 and 80°C.
|
60°C, at which point a plateau is reached with an inflection point at approximately 45°C. Because the CD spectra in Fig. 1A suggest that at 80°C the protein is not entirely unstructured and will refold upon cooling back to 25°C, the plateau in Fig. 1B may not represent a fully unstructured protein. If this is true, then the inflection point in Fig. 1B may represent an initial stage in protein denaturation. Because ß-sheets are typically more robust than
-helices (8), if the inflection point in Fig. 1B represents a first stage in the denaturation of Rv2302, then it may reflect an unraveling of the C-terminal
-helix. Solution state NMR. The elution time of Rv2302 on a size exclusion column was consistent with a globular 8.6-kDa protein, and this was corroborated by the 1H-15N HSQC spectrum for Rv2302 shown in Fig. 2. The line widths and chemical shift dispersion of the HSQC cross-peaks are characteristic of a folded, monomeric protein of molecular size in the 10-kDa range. Further evidence that Rv2302 behaved as a monomer in solution is the lack of intermolecular NOEs in any of the NOE experiments (3-D 13C- and 15N-edited NOESY-HSQC and 4-D CC-NOESY-HMQC). As indicated in Fig. 2, all the 1H-15N HSQC cross-peaks were unambiguously assigned except for the single cross-peak labeled with a question mark (tentatively assigned to the side chain amide of R40). Side chain and amide resonances could not be observed, or assigned, for seven residues between G13 and H19, corresponding to a loop (L1) between ß-strands 1 and 2.
![]() View larger version (23K): [in a new window] |
FIG. 2. An 1H-15N HSQC spectrum of Rv2302 with the assigned cross-peaks labeled. Unassigned cross-peaks are identified with a question mark, and weak cross-peaks are identified with an "x." The spectrum was collected at 25°C in NMR buffer (100 mM KCl, 20 mM potassium phosphate, 2.0 mM dithiothreitol, pH 7.1) at a 1H resonance frequency of 600 MHz.
|
-
pairs for Rv2302 are found in the most favored regions, and 20% are within additionally allowed regions (calculated using PROCHECK-NMR) (23). The Ramachandran statistics improve to 84 and 15% for the
-
pairs in the most favored and additionally allowed regions, respectively, when only the ordered residues in the ensemble are analyzed.
The 25 calculated structures in the ensemble converge well, as shown by the statistics in Table 1. The RMSD of the structured regions in the ensemble to the mean structure is 0.57 Å for the backbone atoms (N-C
-C=O) and 1.12 Å for all heavy atoms. The degree and location of convergence of the 25 calculated structures in the ensemble are graphically depicted in Fig. 3A. For each residue of each structure in the ensemble, the mean pairwise RMSD to the mean structure is plotted. The values were generated by moving a window of three residues along the sequence, calculating the mean pairwise RMSD (Å) to the average structure, and plotting the value over the central residue. The figure illustrates that all five ß-strands and the
-helix are well defined, with mean pairwise RMSDs per residue in the 0.2 Å range for all but one residue (V11). Two of the four loops are less ordered, with mean pairwise RMSDs per residue approaching 0.8 Å; however, the four-residue loop between ß-strands 3 and 4 is well defined.
![]() View larger version (39K): [in a new window] |
FIG. 3. (A) Plot of the mean pairwise RMSDs to the average structure for each residue in the Rv2302 ensemble in Table 1. The data were generated by moving a window of three residues along the sequence and plotting the mean pairwise RMSD (Å) over the central residue. (B) Backbone {1H}-15N heteronuclear NOE values for Rv2302. Proline residues are indicated by asterisks. The top axis shows the residue number, and the bottom axis shows the elements of secondary structure: solid bar, ß-sheet; open oval, -helix; L, loop.
|
-helical and ß-sheet secondary structure, and this observation is corroborated by the NMR-based structure determined for Rv2302 shown schematically in Fig. 4A and with Molscript (21) in Fig. 4B. The core of the structure is a five-strand antiparallel ß-sheet with a conspicuous ß-sheet twist (5) that is more clearly illustrated in Fig. 4C. The length and relative orientation of the five ß-strands are illustrated in the secondary structure diagram in Fig. 4A. The absence of 1H-15N HSQC cross-peaks for most of the residues between ß-strands 1 and 2 (L1) suggests that this region is not rigidly structured in solution and qualitatively implies that these residues are undergoing conformational exchange on an intermediate timescale (millisecond to microsecond) (4). The regions between the other ß-strands in the sheet are more ordered relative to L1, as suggested by the lower pairwise RMSDs of these loop residues in the ensemble structures to the mean structure (Fig. 3A). Indeed, Fig. 3A indicates that the loop between ß-strands 3 and 4 is very well defined. Nestled against the backside of the ß-strand core is a 15-residue
-helix. A hydrophobic core consisting of the side chains of residues A72 and A75 in the
-helix and the side chains of residues V50 and T48 (methyl group) in ß-strand 4 and W41 and V39 in ß-strand 3 appears responsible for holding the
-helix near the ß-sheet. Indeed, a number of long-range NOEs were observed between the side chains of these residues. While the
-helix is drawn continuously between residues 61 and 75 in Fig. 4B, there is actually a slight bend in the helix that is more evident in the view shown in Fig. 4C.
![]() View larger version (25K): [in a new window] |
FIG. 4. (A) Secondary structure diagram of Rv2302. The -helices are drawn as red ovals and the ß-strands as solid blue arrows with the residue number of the beginning and the end of each element shown. (B) Molscript ribbon representation of the average structure of the ensemble of calculated structures for Rv2302 produced using MOLMOL (20). The ß-strands are shown in blue, and the -helices are red. (C) View of the protein looking directly down upon the ß-sheet core highlights the twist in the ß-sheet. The protein, drawn using PyMOL, is rainbow colored (ROYGBIV [red, orange, yellow, green, blue, indigo, violet]), starting from the C terminus.
|
180°. One face is highlighted by a negatively charged (red) surface, and the other face is dominated by two positively charged (blue) pockets. The largest pocket of positive charge is composed of three arginine residues, R18, R70, and R74 (Fig. 5B). While R70 and R74 are part of an
-helix, R18 is in the unstructured loop between ß-strands 1 and 2 that the 1H-15N HSQC suggests has motion on the intermediate NMR timescale. Because intermediate timescale motion is often associated with activity at, or near, an active site (15), R18 may have a mechanistic role in the biochemical function of Rv2302.
![]() View larger version (72K): [in a new window] |
FIG. 5. (A and B) PyMOL-generated maps of the electrostatic potentials at the solvent-accessible surface of Rv2302 showing views of the protein that differ by an 180° rotation of the molecule about the vertical axis. Positively charged surfaces are shown in blue, and negatively charged surfaces are red. Three positively charged arginine residues, R18, R70, and R74, are clustered together to form an arginine pocket. (C) PyMOL-generated ribbon representation of the Rv2302 structure. The regions with the highest ConSurf (11) scores from Fig. 6 are shown in magenta (9) and pink (8), and the three side chains that comprise the arginine pocket are shown in blue. (D) PyMOL-generated surface representation of panel C rotated approximately 180° highlights a conserved surface-exposed region composed of loops 12 and 14.
|
Identification and mapping of the conserved regions of Rv2302. A BLAST search of the Institute for Genomic Research Comprehensive Microbial Resource data bank results in only four proteins with 58% or more sequence conservation and 40% or more sequence identity with residues M1 to A71 of Rv2302. CLUSTAL W (39) sequence alignment of these four proteins relative to Rv2302 is illustrated in Fig. 6 along with the ConSurf (11) preliminary identification of the most conserved residues in this group, residues contained in ß-strands 1 through 4 and loops 2 and 4. Loops 2 and 4 both contain five consecutive, highly conserved residues of consensus sequence: G(S/T)PPY and PGPD(A/S), respectively. In Fig. 5C and D the residues with the two highest ConSurf scores, 9 (magenta) and 8 (pink), are highlighted on the ribbon (5C) and surface (5D) structure of Rv2302 in orientations that differ by 180°. The conservation in the ß-strands results in a highly conserved surface on one face of the ß-sheet, as shown in Fig. 5C. On the opposite face (Fig. 5D), the conserved residues in loops 2 and 4 occupy a significant patch on the protein's surface. Further genomic sequencing of additional organisms and biochemical studies on Rv2302 are necessary to determine if these putative conserved regions have any role in the biochemical function of Rv2302. Interestingly, the three arginine residues that form a pocket on the surface of Rv2302 are not conserved in the aligned sequences and occupy a region adjacent to the highly conserved surfaces. Hence, if the arginine pocket has a biological function in Rv2302, it may be unique to Mycobacteria.
|
View larger version (16K): [in a new window] |
FIG. 6. Sequence alignment of Rv2302 (residues M1 to A71) with four other closely related sequences using the program CLUSTAL W (39). The color scheme is as follows: hydrophobic, red; hydrophilic, green; acidic, blue; basic, magenta. Sav, Streptomyces avermitilis (Sav6750); Sco, Streptomyces coelicolor (Sco0174 and Sco1589); Tfu, Thermobifida fusca YX (Tfu1961). Residue positions with the highest ConSurf scores (11), 9 (asterisk) and 8 (caret), are indicated below the alignment. The helical and ß-strand regions of Rv2302 are identified by red ovals and blue rectangles, respectively.
|
Because the Dali search failed to identify a likely biochemical function for Rv2302, the program ProKnow was used to infer a possible biochemical role for the protein (32). ProKnow uses the primary amino acid sequence along with the known, or predicted (using DASEY), structure to identify and score the most likely biochemical functions of a protein (http://www.doe-mbi.ucla.edu/Services/ProKnow/biolatlas.phl). This program predicts that Rv2302 is a DNA binding protein, having a Bayesian score of 0.71, an evidence rank of 2.7, and 6 clues (Bayesian score range of 0 to 1 [best], evidence rank range of 0 to 6 [best], and clue range of 0 to 9 [best]).
Microarray studies of M. tuberculosis implicate Rv2302 in a couple of biological responses. In one study the Rv2302 gene was upregulated 4.14-, 4.84-, and 6.58-fold in response to 4, 24, and 96 h of starvation, respectively (1). Identification of such genes is important because M. tuberculosis is known to exist for long periods in a nongrowing, drug-resistant state, and proteins that are upregulated following exposure to conditions that mimic this latent state may be potential targets for new M. tuberculosis drugs. In a second study, the Rv2302 gene was upregulated 1.83-fold relative to wild-type cells in the double-deletion mutant of two heat-shock regulons, HrcA and HspR (37). Partial disruption of the regulatory circuits that affect an overexpression of heat-shock proteins in M. tuberculosis appears to play an important role in pathogenesis, impairing the ability of M. tuberculosis to establish a chronic infection (36).
Conclusions.
Rv2302 is a small, 80-residue polypeptide with no known biological function that folds into a five-strand antiparallel ß-sheet with a 15-residue C-terminal
-helix nestled onto one side. While antiparallel ß-sheets are common, a Dali search of this structure versus all known structures in the Research Collaboratory for Structural Bioinformatics Protein Data Bank failed to identify any other proteins with a similar structure and/or configuration of antiparallel ß-strands, indicating that the Rv2302 fold has not been observed before. CD spectroscopy suggests that Rv2302 is rather robust, being able to refold back into its native structure after being heated to 80°C. If unusual molecular dynamics, relative to the molecule as a whole, plays a role in the protein's function, then loop 1 between ß-strands 1 and 2 may be important. Well-defined regions of positive and negative charges on the surface of Rv2302 may provide an interface for electrostatic associations between a substrate with a well-defined electrostatic surface. For example, ProKnow predicts that Rv2302 binds DNA, and a small positively charged region is observed on the surface of Rv2302 that may be a potential surface to bind to the negatively charged phosphodiester backbone of DNA. Microarray studies of M. tuberculosis response to starvation and overexpression of heat shock proteins show that Rv2302 is upregulated, suggesting that it may play a biochemical role in starvation survival and the heat shock response. However, further studies are necessary to uncover the biochemical function of Rv2302. If a function for Rv2302 is found, the structure presented here will contribute to a molecular understanding of its function and potentially speed up the conception and development of new therapies to fight the spread of tuberculosis around the world.
We thank Debnath Pal at the UCLA-DOE Institute for Genomics and Proteomics for assistance with ProKnow.
|
|
|---|
-helical coiled-coils controls protein folding and stability. J. Biol. Chem. 278:35248-35254.
-helices, twisted ß-sheets and ß-turns. Biophys. Chem. 31:77-86.[CrossRef][Medline]
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»