Previous Article | Next Article ![]()
Journal of Bacteriology, July 2003, p. 4031-4037, Vol. 185, No. 14
0021-9193/03/$08.00+0 DOI: 10.1128/JB.185.14.4031-4037.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
John Toedt,1 Edward Eisenstein,1 Andrew J. Howard,2 and Gary L. Gilliland1*
Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute and the National Institute of Standards and Technology, Rockville, Maryland 20850,1 Center for Synchrotron Radiation Research and Instrumentation, Biological, Chemical, and Physical Sciences Department, Illinois Institute of Technology, Chicago, Illinois 606162
Received 13 January 2003/ Accepted 31 March 2003
|
|
|---|
-helical coiled coil. The C-terminal domain features a six-stranded half-barrel that curves around an
-helix. The crablike three-domain structure of YchF suggests the binding site for a double-stranded nucleic acid in the cleft between the domains. The structure of the putative GTP-binding site is consistent with the postulated guanine specificity of the protein. Fluorescence measurements have demonstrated the ability of YchF to bind a double-stranded nucleic acid and GTP. Taken together with other experimental data and genomic analysis, these results suggest that YchF may be part of a nucleoprotein complex and may function as a GTP-dependent translation factor. |
|
|---|
-helices, where the P-loop follows the N-terminal (central) ß-strand, and the Walker B motif resides in the adjacent ß-strand.
According to the most recent classification of P-loop NTPases (19), YchF belongs to the TRAFAC class that includes translation factors (hence the name of the class), Ras-related GTPases, heterotrimeric G proteins, septins, dynamins, and motor ATPases such as kinesin and myosin. The members of the class are characterized by a conserved threonine in the region between the Walker A and B motifs. This residue binds the Mg2+ required for NTP hydrolysis. Structurally, the distinctive feature of the TRAFAC class is an antiparallel ß-strand next to the Walker B strand. From the functional point of view, these proteins are typical molecular switches that change the conformation upon NTP hydrolysis. Two regions are particularly sensitive to the presence of the
-phosphate, the loop preceding the antiparallel ß-strand (switch I) and the loop following the Walker B strand (switch II).
The presence of a glycine-rich motif in the switch II region as well as the conserved motif YXFXTXXXXXG in the switch I region are distinctive features of the Obg superfamily (19). Besides the YchF family, it includes four other protein families: Obg, DRG, Ygr210, and NOG1. The YchF and Ygr210 families form one branch of the superfamily with the YchF proteins present in bacteria and eukaryotes and the Ygr210 proteins present in archaea and fungi. The NTPase domains of all members of the superfamily share significant sequence similarity. For instance, YchF and YhbZ (the Obg homolog) from E. coli are 32% identical over a fragment of 210 residues. However, the rest of the structure is quite different, as are the regulatory pathways these proteins are involved in (6).
The specificity of G proteins to guanine is determined by the conserved NKXD motif, also known as G4 (5), located at the end of strand ß6 (Fig. 1). A peculiar feature of YchF is the lack of the conserved lysine in this sequence and the replacement of the aspartate with glutamate. This is a unique exception among the TRAFAC GTPases which, in the absence of experimental data, raises questions regarding the nucleotide specificity of YchF.
![]() View larger version (58K): [in a new window] |
FIG. 1. Ribbon presentation of the polypeptide fold of YchF. The -helical domain is shown in green. Disordered loops are shown in white. The NTP-binding P-loop is shown in purple. The figure was produced with Molscript (18) and Raster3D (23).
|
|
|
|---|
Crystallization and structure determination. YchF crystals were obtained at room temperature in hanging drops from 0.1 M Bicine (pH 7.5), 0.3 M magnesium acetate, and 13% polyethylene glycol 8000. They belong to the space group P21 with the following cell parameters: a, 61.0; b, 93.6; c, 85.2 Å; and ß, 100.1°. There are two molecules in the asymmetric unit.
X-ray diffraction data from one native crystal (Table 1) were collected at 100°K on the IMCA-CAT beamline 17-ID at the Advanced Photon Source (Argonne, Ill.) with a Mar charge-coupled device detector. Heavy atom derivatives were measured on a Bruker rotating anode generator with a Mar345 image plate detector. All data were processed with HKL2000 (30). The structure was solved by multiple isomorphous replacement with uranium (5 mM UNO3 soaked for 1 h) and gold [9 mM KAu(CN)4 soaked for 1 h] derivatives. A single uranium site and two gold sites were located by SOLVE (39) and were used in phase refinement with SHARP (10). The atomic model was built by using O (15) and refined with REFMAC (26) to an R factor of 0.208 for all data in the resolution range of 10 to 2.4 Å (Rfree = 0.247 for 5% data). Crystallographic calculations were performed with the CCP4 program suite (8). No restraints on the noncrystallographic symmetry were imposed during refinement. Refinement statistics are given in Table 1. One residue, Val287 in chain B, has unusual main-chain torsion angles (
= 28.1°;
= -110.2°) that correspond to the epsilon region of the Ramachandran plot. The electron density for this residue is clear and unambiguous. Atomic coordinates and structure factors have been deposited in the Protein Data Bank (2) under the accession code 1JAL.
|
View this table: [in a new window] |
TABLE 1. X-ray data and refinement statisticsa
|
The fluorescence of 2'(3')-O-(2,4,6-trinitrophenyl)GTP (TNP-GTP) (Molecular Probes) was measured with an excitation wavelength of 408 nm and a maximum emission spectrum of 555 nm. The reaction mixture contained 20 mM Tris-HCl (pH 7.5), 50 mM NaCl, and other components as indicated in the text. The titration curve was plotted as an increase in fluorescence with respect to the protein-free experiment versus the nucleotide concentration by using Kaleidagraph (Synergy Software).
|
|
|---|
-helices on both sides. The strand order is 2-4-1-5-6-7, with strand ß2 antiparallel to the other strands. A 23-residue loop following ß2 forms part of the C-terminal (C) domain, which also includes residues 279 to 363. Domain C is dominated by the mixed ß-sheet curved as a half-barrel around helix
12. An 80-residue polypeptide following strand ß5 folds into the
-helical (A) domain consisting of two long
-helices (
5 and
6) and one short
-helix (
7). The loops connecting the three domains (Fig. 1) are not visible in the electron density, suggesting not only the flexibility of the loops but also the mobility of the corresponding domains.
As a P-loop NTPase, YchF is characterized by the presence of two conserved sequence motifs involved in the binding of the phosphates of NTP (Fig. 2). The Walker A sequence G9XXXXGKS16 forms the P-loop between strand ß1 and the following
-helix. The Walker B sequence D72XXG75 resides in the adjacent ß-strand, ß4, and contains an aspartic acid (Asp72) usually involved in the coordination of a Mg ion. Neither NTP nor Mg is observed in the present structure, although Mg2+ was in the crystallization solution.
![]() View larger version (21K): [in a new window] |
FIG. 2. Amino acid sequence of the H. influenzae YchF in one-letter code. Residues conserved in all family members with no more than one exception are enclosed in boxes. Residues with conservative replacements in two or three family members are in bold letters. Secondary structure elements calculated with DSSP (16) are indicated above the sequence.
|
![]() View larger version (40K): [in a new window] |
FIG. 3. Guanine recognition site in Ras (Protein Data Bank entry 121P), a typical G protein (a), and YchF (b). GTP is shown in purple, as observed in Ras and as modeled in YchF. The lysine residue of the consensus sequence NKXD (Lys117 in Ras) is spatially replaced with Phe107 in YchF.
|
Binding of GTP by YchF was assessed by measuring fluorescence of the nucleotide labeled with TNP. In the absence of Mg2+, the fivefold increase in fluorescence was observed upon the addition of 10 µM protein to 10 µM nucleotide. Titration of the nucleotide at a fixed protein concentration produced a hyperbolic curve that gave a binding constant for TNP-GTP of about 5 µM (Fig. 4).
![]() View larger version (18K): [in a new window] |
FIG. 4. Fluorescence of TNP-GTP in the absence and presence of GTP. The protein was present at a concentration of 2.5 µM. The increase in the fluorescence intensity of TNP-GTP with respect to that in the protein-free solution is plotted over the TNP-GTP concentration. The fluorescence intensity was recorded at 25°C at a wavelength of 555 nm, and the excitation wavelength was 408 nm.
|
The effect of Mg2+ on the nucleotide binding was monitored in a separate experiment (Fig. 5). First, the complex with TNP-GTP was formed in the absence of Mg2+, which was visualized as an increase in fluorescence (Fig. 5, curve 2). Then Mg2+ was added to the reaction mixture, which caused an immediate quenching of fluorescence (Fig. 5, curve 3). We attribute the effect of the inhibition of nucleotide binding to the excess of Mg2+ rather than to the hydrolysis and release of the product by YchF. Inhibition of GTP binding at millimolar Mg2+ concentrations is typical for most of the G proteins (14, 22, 25, 36), though not for all of them. For instance, the Caulobacter crescentus CgtA protein, a member of the Obg family, shows a 60-fold increase in affinity to GTP (1.2 versus 65 µM) in the range of 5 to 15 mM Mg2+ and a decrease at higher concentrations of Mg2+ (20).
![]() View larger version (17K): [in a new window] |
FIG. 5. Fluorescence spectra of TNP-GTP in the absence and presence of YchF and Mg2+. Line 1, 2.5 µM TNP-GTP; line 2, 2.5 µM TNP-GTP plus 2.5 µM YchF; line 3, 2.5 µM TNP-GTP plus 2.5 µM YchF plus 2 mM Mg2+. The emission spectrum was recorded at 25°C, and the excitation wavelength was 408 nm.
|
7 of domain A and helix
12 of domain C, which face each other, are likely to be key elements of nucleic acid binding. The distance between them as measured between C
atoms of Gln189 and Lys303 is 20.4 Å, which is about the size of the nucleic acid duplex.
![]() View larger version (80K): [in a new window] |
FIG. 6. Electrostatic surface potential calculated with GRASP (28). Positive charges are blue, and negative charges are red. The orientation of the molecule is as shown in Fig. 1.
|
![]() View larger version (20K): [in a new window] |
FIG. 7. Quenching of fluorescence of tryptophan upon addition of dsDNA to the YchF protein. The protein concentration was 100 nM. Line 1, no dsDNA; line 2, 50 nM (per base pair) dsDNA; line 3, 100 nM (per base pair) dsDNA.
|
|
|
|---|
C and D loops and the variable stem of tRNASer.
Domain C has a topology typical for the ubiquitinlike superfamily of
+ß proteins (SCOP) (27). Besides ubiquitin-related proteins, the family includes Ras-binding domains of different effectors and, most notably, the N-terminal domain of threonyl-tRNA synthetase (ThrRS) (32). The similarity of the latter to YchF is striking: the root mean square (RMS) deviation of 62 common C
atoms is only 2.0 Å. The domain is not involved in the binding of the cognate tRNAThr, and its function remains unknown. At the sequence level, the N-terminal domain of ThrRS represents the conservative TGS domain (PF02824 in the Pfam database) (1), which was also found in guanosine polyphosphatase SpoT and some GTPases (42).
Thus, both domains A and C appear to be related to RNA-binding proteins, and their presence in YchF suggests a similar feature for this protein. The ability of YchF to bind dsDNA further supports the contention that some nucleic acid may be a natural substrate for YchF. The putative nucleic acid-binding surface of YchF may not be limited to helices
7 and
12, which are positioned to embrace the nucleic acid duplex. The extended binding surface of domain C may include His309 and Asp311 adjacent to helix
7 and a cluster of polar residues Arg318, Glu320, Lys337, Glu338, and Arg360 on the outer surface of the domain. Most of these residues are conserved. All together, they form a continuous patch on the domain surface, suggesting a possibility that nucleic acid winds around the C domain. Given the small size of the domain, it would require a sharp bending of the nucleic acid, which is unlikely for linear dsDNA but possible for the loopy structure of RNA.
Domain G has all the features of the GTP-dependent molecular switches (4, 34). These proteins exist in two major conformations, and the switch between them depends on the GTP-GDP exchange. Two regions adjacent to the nucleotide site transmit the signal between the active center and periphery of the protein. The switch I region includes strand ß2 and the preceding loop. A threonine residue (Thr36 in YchF) at the tip of the loop binds the
-phosphate of GTP and Mg2+ and moves away from the active site upon GTP hydrolysis (the switch I region is completely disordered in the present nucleotide-free structure). A glycine residue (Gly75 in YchF) of the Walker B motif is a key element of switch II. Located at the end of the ß-strand, it binds the
-phosphate and promotes a conformational rearrangement of the following loop upon GTP hydrolysis. Glycine is conserved in this position to maintain the restrained conformation of the chain (
/
= 134/160° in YchF). The following glycine-rich loop forms a short
-helix (
3). The high level of sequence conservation of both switch regions in the YchF family (Fig. 2) indicates their functional importance, which is likely related to the regulatory function typical for G proteins.
Ras-like G proteins are characterized by a relatively low intrinsic rate of GTP hydrolysis (37). Structural and mutagenesis studies on the H-ras p21 protein have suggested a key role of Gln61 in activating a water molecule for a nucleophilic attack on the
-phosphate of GTP (31). It was also noted that the Q61E mutant of H-ras p21 had a 20-fold-greater rate of GTP hydrolysis than the wild-type protein, whereas the substitution of Gln61 by other amino acids reduced the GTPase rate (11, 12). Gln61 immediately follows the Walker B motif in p21. While Gln61 is conserved in the ras-like G proteins, this position is occupied by a histidine residue in elongation factors (which is in accord with their lower GTPase activities) and by hydrophobic residues in the other members of the TRAFAC class (19), suggesting that they lack an efficient catalytic machinery for GTP hydrolysis. With the highly conserved leucine residue (Leu76) in this position, the YchF structure does not indicate any alternative residue that may act as a general base in catalysis. It seems likely that YchF and other members of the family are not GTPases but rather sensors of the relative levels of GTP and GDP.
The putative nucleic acid-binding domains A and C are in direct contact with the switch regions of domain G, so that the conformational changes upon GTP-GDP transition may be transmitted to the nucleic acid site and regulate the nucleic acid-binding ability of the protein. The 23-residue loop protruding from domain G (residues 45 to 67) is part of domain C and is also closely associated with the switch I region. The coiled coil of domain A is in contact with the glycine-rich helix
3, which is part of switch II. The long helical structure of domain A implies a lever-like mechanism of conformational changes associated with nucleic acid binding. The involvement of the coiled coil in the binding a nucleic acid is expected due to a number of conserved basic residues at the tip of the
-hairpin.
Based on the analysis of the sequences, YchF has been assigned to the Obg superfamily, which includes four other families of GTPases (19). Obg is an essential gene that is involved in DNA replication in C. crescentus (21) and Streptomyces griseus (29) and is associated with the ribosome (35). Several members of the superfamily, including YchF, possess the TGS domain related to the RNA-binding proteins (42). These observations, taken together with the universal phyletic distribution of the proteins, allowed Leipe and coauthors to suggest that these proteins may be uncharacterized translation factors (19).
Another indication that YchF is involved in translation comes from the genome-scale project aimed at the characterization of multiprotein complexes in Saccharomyces cerevisiae (13). By using tandem-affinity purification (TAP), it was shown that the yeast homolog of YchF, YBR025c, interacts with the components of the multisubunit translation elongation factor eEF1.
Furthermore, a relation of YchF to ribosomal proteins can be deduced from the analysis of the gene expression in E. coli under conditions of genotoxic stress. YchF appears to be among the genes downregulated in response to the DNA damage (17). Changes in gene expression after treatment of E. coli culture with mitomycin C were assessed with DNA array technology. The expression profile of ychF in these experiments correlates with the genes related to protein biosynthesis, including those coding for ribosomal proteins, which are also downregulated.
Finally, in many bacterial genomes including both gram-negative and gram-positive species, ychF is located next to the pth gene, which codes for peptidyl-tRNA hydrolase. This protein ensures the recycling of peptidyl-tRNAs produced through abortion of translation. The phylogenetic pattern of pth is similar to that of ychF, suggesting that the proteins may be functionally related. Northern blot experiments showed that both pth and ychF are cotranscribed in a bicistronic transcript (9).
The exceptionally high level of amino acid conservation typical for ribosomal proteins suggests a key role for YchF in the cell. Most of the conserved residues are implicated in binding GTP and nucleic acid as discussed above. On the other hand, the amino acid conservation in the YchF family extends far beyond the GTP-binding motifs and is not limited to the putative nucleic acid-binding site. Of particular interest is the cluster of acidic residues in the fragment containing residues 241 to 258. These residues form a negatively charged surface centered on helix
10 in the G domain. Although most of them are not strictly conserved, the overall negative charge of the surface is preserved. The surface may be involved in interactions with other components of the putative ribonucleoprotein complex. One possibility is that helix
10 mimics a nucleic acid duplex and thus competes for a binding site on an effector protein. An example of such a molecular mimicry is a C-terminal
-helix in DinI, a negative regulator of the SOS response in E. coli (40). By competing for the DNA-binding site of RecA, DinI interferes with the activation of RecA.
The crystal structure of the YchF homolog from yeast has been recently solved by Lima and coworkers (Protein Data Bank entry 1NI3). The two structures are very similar and can be superimposed with the RMS deviation of 2.3 Å over 306 common C
atoms. The similarity is particularly strong in the TGS (C-terminal) domains, the RMS deviation for which is only 0.8 Å. The largest deviation between the structures, up to 15 Å for the equivalent C
atoms, is in the switch II region. This glycine-rich fragment forms a helix (
3) in the E. coli protein, whereas in the yeast protein it lacks any secondary structure. Although the difference may be partly due to crystal contacts that involve this fragment in the yeast structure, it indicates the conformational flexibility of switch II, which is well documented for other G proteins. In spite of different crystallographic environments, both structures reveal the switch I region and the linker between domains G and A as the most mobile parts of the protein molecule, as can be judged from their high atomic B factors. Apparently, the flexibility of these functionally important parts of the molecule is an intrinsic feature of the protein rather than a crystallographic artifact.
The crystal structure of YchF is the first three-dimensional structure obtained for a protein family widely represented in all forms of life. Analysis of the structure and available experimental data support the contention that YchF may function as a GTP-dependent nucleic acid-binding protein. Its involvement in translation is a distinct possibility. These results provide a basis for further studies of this undoubtedly important protein family.
Certain commercial materials, instruments, and equipment are identified in this manuscript in order to specify the experimental procedure as completely as possible. In no case does such identification imply a recommendation or endorsement by the National Institute of Standards and Technology nor does it imply that the materials, instruments, or equipment identified is necessarily the best available for the purpose.
Present address: Xencor Co., Monrovia, CA 91016. ![]()
|
|
|---|
B stress response transcription factor cofractionate with ribosomes. J. Bacteriol. 182:2771-2777.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»