Previous Article | Next Article ![]()
Journal of Bacteriology, March 2007, p. 1899-1904, Vol. 189, No. 5
0021-9193/07/$08.00+0 doi:10.1128/JB.01435-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Unité de Biochimie Structurale and CNRS URA 2185, 25 rue du Docteur Roux, 75724 Paris, France,1 Synchrotron Soleil, L'Orme des Merisiers, Saint Aubin, BP 48, 91192 Gif-sur-Yvette, France,2 Plate-forme de Cristallogenèse et Diffraction des Rayons X, 25 rue du Docteur Roux, 75724 Paris, France,3 Unité de Génétique Moléculaire Bactérienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris, France4
Received 10 September 2006/ Accepted 5 December 2006
|
|
|---|
|
|
|---|
The product of the M. tuberculosis gene Rv0813c is one of these hypothetical proteins confined to the order Actinomycetales. The inspection of mycobacterial genomes reveals a conserved arrangement of the genetic locus around Rv0813c, but this conservation provides no further functional hints, since neighboring genes also lack reliable functional annotation. In an attempt to provide new elements for function discovery, we have undertaken the structural characterization of Rv0813c. Here, we show that Rv0813c is a representative of a new family of bacterial proteins strongly resembling the eukaryotic fatty acid-binding proteins, and we discuss the structural and functional implications of these observations.
|
|
|---|
Initial crystallization screenings were carried out on sitting drops (200 nl of protein, 5.5 mg/ml, plus 200 nl of the crystallization solution equilibrated against 150 µl of the well solution) using a Cartesian Technologies nanoliter dispenser workstation. Initial hits were manually reproduced and optimized. The best crystals were grown in 2 M NaCl, 5% (vol/vol) dioxane, and 100 mM HEPES, pH 6.5, at 18°C and reached a size of 200 by 200 by 300 µm (50 by 50 by 100 µm for the SeMet-labeled protein) in 3 days.
X-ray diffraction data were collected on beamline ID29 at the European Synchrotron Anomalous Diffraction (Grenoble, France) from frozen crystals (100 K) in mother liquor plus 25% (vol/vol) glycerol. A 1.7-Å-resolution data set was collected from a single native crystal, and complete SAD data were collected from a SeMet-labeled crystal for the X-ray wavelength corresponding to the peak of the selenium K edge. Data reduction was carried out with the programs MOSFLM and SCALA from the CCP4 software package (3). The two selenium sites were readily located with the programs SHELXD (25) and SOLVE (26). SAD phases were calculated to 2.6-Å resolution and extended to 1.7 Å using the program RESOLVE. The resulting electron density map (Fig. 1A) allowed the automatic tracing of over 80% of the polypeptide chain with the program ARP/WARP (20). Refinement of the native structure was carried out with REFMAC (21) alternated with rounds of validation and rebuilding with O (17). The final model included protein residues 27 to 226 and three molecules of dioxane, which was present in the crystallization solution.
![]() View larger version (30K): [in a new window] |
FIG. 1. (A) Experimental electron density map showing the triad Arg-X-Tyr and the bound dioxane molecule. (B) Overall structure of Rv0813c, color coded from the N terminus (blue) to the C terminus (red). Charged residues in the protein loops that define the entry to the cavity are labeled. (C) Molecular surface of the internal binding cavity, color coded according to electrostatic potential, showing the bound dioxane molecule and ordered solvent molecules. The entrance of the cavity (top) is open to the solvent.
|
Protein structure accession numbers. The experimental structure factors and the refined coordinates for the final model (Table 1) have been deposited in the Protein Data Bank with accession code 2FWV.
|
View this table: [in a new window] |
TABLE 1. Crystallographic data for Rv0813 crystals
|
|
|
|---|
-helix that extends into the solvent and is increasingly disordered toward its N terminus (no density is observed for the first 26 residues in the crystal structure). This
-helix is followed by a largely exposed proline-rich loop (residues 40 to 50) that also contains several hydrophobic side chains and is engaged in crystallographic contacts, raising the question of whether this domain could function as a docking site for protein-protein interactions. The 10-stranded ß-barrel is well defined in density, except for the solvent-exposed loop 132 to 144, which is presumably disordered in the crystal structure. A short 310-helix seals the "bottom" end of the barrel, whereas the "top" end comprises five protein loops that protrude into the solvent in an open configuration, allowing access of a putative ligand to the central cavity. The internal surface of the central cavity (Fig. 1C) is defined by an array of hydrogen-bonded polar side chains. Thus, the guanidinium group of Arg190 forms a salt bridge with Glu126 (Fig. 1A) and is also engaged in H bonds to Tyr192 and Glu20. On the other side of the pocket, Gln92 makes H-bonding interactions with the hydroxyl groups of Tyr202 and Ser108. Other H-bonding interactions connect Glu126 to Trp106 and Glu204 to Arg206. At the bottom of the cavity, the phenol group of Tyr192 binds a small ligand, which displays the chair conformation of a six-membered nonplanar ring in the experimental electron density map (Fig. 1A) and was modeled as dioxane, an additive included in the crystallization buffer. An extended array of well-ordered solvent molecules fills the remaining space of the cage (Fig. 1C), further stabilized by hydrogen-bonding interactions to several polar side chains (Arg206, Glu157, Tyr155, His152, His116, Tyr88, and Arg112).
Four of the five loops that define the entry to the cavity at the top of the ß-barrel (Fig. 1B) extend into the solvent and contain two or more charged side chains each. Thus, loop 178 to 186 contains two basic residues (Arg178 and Arg180), loop 207 to 215 contains two acidic residues (Asp208 and Asp210), loop 82 to 87 contains three charged groups (Arg82, Asp85, and Asp87), and loop 112 to 121 contains another three (Arg112, Asp118, and Glu121). However, many of these charged side chains point toward the interior of the protein, making it unlikely that these charged residues could fulfill a role similar to those in FABPs, which mediate interactions of the protein with the lipid bilayer to facilitate the transfer of small hydrophobic ligands, such as fatty acids (7, 15, 23).
Rv0813c belongs to a new bacterial family of FABP-like proteins.
A search for amino acid sequence similarities in public databases (as of December 2005) revealed that Rv0813c homologs (Fig. 2) are found in mycobacteria (amino acid sequence identities of >80% to Rv0813c), nocardia (65%), and corynebacteria (40 to 50%). More distantly related proteins retain the 10-stranded ß-barrel and the short 310-helix that seals the bottom of the ß-barrel but lack the N-terminal
-helical region of Rv0813c. This larger group includes several proteins of bacterial and eukaryotic origin, with sequence identities of 28 to 35% to Rv0813c. Indeed the three-dimensional (3D) structures of two of these proteins, Arabidopsis thaliana At1g79260 and M. tuberculosis Rv2717c (Protein Data Bank codes 2A13 and 2FR2) (Fig. 3A), have been recently determined within the context of structural-genomics programs. Although these proteins lack the N-terminal domain of Rv0813c, their 10-stranded ß-barrels are very similar (root mean square deviations of 1.2 to 1.4 Å for 133 matched C
positions). The above observations suggest that the 10-stranded beta barrel may be a structurally conserved domain seen in many other proteins among prokaryotes and eukaryotes.
![]() View larger version (95K): [in a new window] |
FIG. 2. Multiple alignment of Rv0813c homologs. Secondary-structure elements are indicated for Rv0813c (above) and Atlg79260 (below). Invariant residues are in white (red background), and those conserved in at least 70% of the sequences are in red (boxed). Genes and species abbreviations: Rv0813, M. tuberculosis Rv0813c; Nf, Nocardia farcinica; Ml, Mycobacterium leprae; Ma, Mycobacterium avium subsp. paratuberculosis; Cj, Corynebacterium jeikeium; Cd, Corynebacterium diphtheriae; Ce, Corynebacterium efficiens; Cg, Corynebacterium glutamicum; Os, Oryza sativa; Pa, Propionibacterium acnes; Nsp, Nocardioides sp.; Tf, Thermobifida fusca; Sa, Streptomyces avermitilis; Sc, Streptomyces coelicolor; Fsp, Frankia sp.; Kr, Kineococcus radiotolerans; Asp, Arthrobacter sp.; Bl, Brevibacterium linens; Lx, Leifsonia xyli; Mtb, M. tuberculosis Rv2717c; At, Arabidopsis thaliana Atlg79260.
|
![]() View larger version (45K): [in a new window] |
FIG. 3. (A) Structural superposition of Rv0813c (red) with At1g79260 (2A13; green) and Rv2717c (2FR2; light cyan). (B) Structural superposition (93 core Ca positions) of Rv0813c (red) with human muscle FABP (1HMT; blue).
|
-helical insertion between the first and second ß-strands that covers the entrance to the internal binding site. So far, proteins belonging to the FABP family have been found only in eukaryotes, where they are thought to function in the uptake, transport, and storage of retinoids and fatty acids, as well as in signaling, although their precise cellular functions remain unclear (11, 12, 14, 18, 19, 30). The 10-stranded antiparallel ß-barrel of Rv0813 superposes well (root mean square deviation, 2.3 to 2.5 Å) with those of CRBPs, CRABPs, and FABPs, as shown in Fig. 3B for human muscle FABP (29), and also shares the short irregular 310-helix (Rv0813c residues 63 to 71) immediately preceding the first ß-strand. Sequence comparisons suggest that this helix may represent a general motif that primes the folding of the 10-stranded ß-barrel and may serve as a signature for identifying additional Rv0813c-like proteins. On the other hand, Rv0813c differs from eukaryotic FABPs in some features. For instance, the mycobacterial protein lacks the gap observed between the fourth and fifth ß-strands of the FABP ß-barrel, which was proposed to serve as an entrance to the fatty acid-binding site. In Rv0813c (and in other bacterial homologs), these two strands are occluded within the dimer interface (see below) and are thus not accessible to a putative ligand. Instead, the top of the barrel (opposite the conserved 310-helix) is open to the solvent in Rv0813c, which lacks the double
-helix insertion that caps the entrance to the ligand-binding site in FABPs. Extensive crystallographic and nuclear magnetic resonance studies of FABPs show that these proteins usually bind a single fatty acid molecule with its hydrophilic group deep within the cavity, although in some cases, two ligand molecules are found to occupy the internal cavity (22, 27). Bound fatty acids are completely enclosed in the interior of the ß-barrel with their carboxyl groups forming direct or water-mediated H bonds to three polar amino acids, which always include at least one arginine (13). Two of these three binding residues are usually found within an Arg-X-Tyr motif. A similar motif, Arg190-Leu191-Tyr192, is also found at the bottom of the ß-barrel cavity in Rv0813c, where the phenol group of Tyr192 is engaged in H-bonding interactions with a dioxane molecule (Fig. 1A). There are important differences, however, since the side chain of Arg190 might be unavailable for ligand binding, as it is locked into a conserved salt bridge with Glu126. Also, the Arg-X-Tyr motif in Rv0813c is located on the 8th strand of the ß-barrel, rather than in the 10th, as seen in FABPs, and the Rv0813c internal cavity is more negatively charged than those in FABPs, resembling more closely those of the CRBPs.
Several glycine residues are strictly conserved in the ß-strands of Rv0813c homologs (Fig. 2), probably because of structural constraints required to allow bending of the corresponding ß-strands. These include three glycine residues in the largely conserved motif GXWXGXG (residues 73 to 79 in ß-strand 1), Gly91 in strand 2, Gly128 in ß-strand 4, Gly161 in ß-strand 6, and Gly193 in ß-strand 8. Interestingly, only Gly73 and Trp75 of the GXWXGXG motif correspond in part to the PROSITE signature patterns of FABPs and lipocalins (PROSITE entries PS00214 and PS00213, respectively) (10). A hydrophobic pattern, HHXHXXHH (where H stands for a hydrophobic residue, usually Leu), is also conserved in the 310 helix preceding the first ß-strand. More importantly, several residues defining the internal cavity are also largely conserved (Fig. 2), suggesting that these proteins might bind similar ligands. They include the salt bridge between Glu126 and Arg190, as well as the residues in contact with the dioxane molecule in the Rv0813c structure (Tyr192, Tyr202, and Gln92) and others located close to the putative binding site (Trp106, Ser 108, Glu157, Ala219, and Leu221). In one homolog (P. acnes), residues Gln92 and Ser108, which are H bonded to each other in Rv0813c, are both replaced by Cys, suggesting the formation of a disulfide bridge. Unlike the FABP family, however, neither the long loops above the ß-barrel that protrude into the solvent nor the residues defining the entry to the binding site contain any conserved patch of residues.
A dimer occluding
2,100 Å2 from each monomer at the interface is observed in the Rv0813c crystal structure (Fig. 4A), and the residues defining this interface are well conserved in bacterial Rv0813c homologs (Fig. 4B). This dimeric arrangement is compatible with a putative ligand-binding function of the protein, since the entrance to the cavity in each monomer remains fully accessible from the solvent. To investigate the oligomeric state of Rv0813c in solution, we carried out dynamic light-scattering and gel filtration studies of the recombinant protein. As shown in Fig. 4, the results clearly indicated that Rv0813c behaves as a dimer in solution. Since the homologs At1g79260 and Rv2717c also crystallize as similar dimers, the above-mentioned results strongly suggest that, as in eukaryotic FABPs (12), the formation of homodimers might be a common feature of this bacterial protein family.
![]() View larger version (49K): [in a new window] |
FIG. 4. (A) The crystallographic Rv0813c dimer. (B) The monomer-monomer interface, shown in red, for one monomer. The labeled residues correspond to highly conserved amino acid positions in the bacterial-protein family, which account for over 70% of the dimer interface. (C) Monomodal size distribution dynamic light scattering profile of recombinant Rv0813c. The mean value of the molecular radius (3.2 nm) is consistent with a dimeric form ( 50 kDa) of the protein in solution. (D) Gel filtration chromatography of Rv0813c. The chromatograph was obtained in a SMART system (Amersham Pharmacia) with a Superdex 200 column at a flow rate of 40 ml/min. The retention time (39.7 min) is between those of bovine serum albumin (67 kDa; 36.5 min) and Chymotrypsin A (25 kDa; 44.3 min), consistent with a dimeric form of Rv0813c in solution.
|
This work was partially supported by grants from the Institut Pasteur (GPH-Tuberculose), the National Genopole Network (contract RG-2002-08), and the European Commission (SPINE, contract no. QLG2-CT-2002-00988, and X-TB, contract no. QLK2-CT-2001-02018).
Published ahead of print on 15 December 2006. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»