Previous Article | Next Article ![]()
Journal of Bacteriology, April 2007, p. 3290-3295, Vol. 189, No. 8
0021-9193/07/$08.00+0 doi:10.1128/JB.01937-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Infectious and Inflammatory Disease Center, Burnham Institute for Medical Research, 10901 N. Torrey Pines Rd., La Jolla, California 92037,1 The Scripps Research Institute, 10550 N. Torrey Pines Rd., La Jolla, California 920372
Received 21 December 2006/ Accepted 7 February 2007
|
|
|---|
|
|
|---|
A majority of yyc operons (except in streptococci) include two other genes, yycH and yycI (19, 26). Their protein products share no detectable homology with any other proteins in the databases, as evidenced from a BLAST search. Deletion of the genes coding for either of these proteins in B. subtilis gives identical phenotypes, with a 10-fold upregulation of the YycFG system. Furthermore, YycG, YycH, and YycI have been shown to form a ternary complex. Both YycH and YycI localize to the periplasm and are anchored to the membrane via a single N-terminal transmembrane helix (25, 26). Therefore, YycH and YycI appear to act together to negatively regulate the YycG histidine kinase.
In contrast to the high sequence conservation of the YycG kinase and the YycF response regulator, YycH and YycI orthologs are poorly conserved. We recently solved the crystal structure of YycH, which revealed a three-domain architecture and a novel fold (27) but gave no clues to the function of the protein. Here we present the crystal structure of YycI at a 2.9-Å resolution and discuss insights into its biochemical function derived from the structure.
Cloning, expression, and purification of YycI. The B. subtilis yycI gene encoding amino acids 31 to 280 was PCR amplified utilizing two oligonucleotides (restriction sites in boldface): yycIf (5'-GTTTTTTCAAAAACATATGGCTACCGGCAAAG-3') and yycIr (5'-TTGCAAGCGGATCCTTGTCTCACTCACTCC-3').
The resulting PCR fragment was digested with NdeI and EcoRI restriction enzymes and cloned into the same sites of vector pET28a (Novagen), generating pJS32. This vector expresses the yycI gene devoid of its N-terminal transmembrane helix and translationally fused to an N-terminal thrombin-cleavable His6 tag. Plasmid pJS32 was transformed into BL21(DE3) (Stratagene) for production of native YycI and into the methionine auxotroph B834(DE3) (Stratagene) for production of selenomethionine-labeled YycI.
The overexpression and purification protocol for unlabeled and selenomethionine-labeled YycI was identical to that previously described for YycH (27). Following purification of His-tagged YycI, thrombin was added at 1 U/mg YycI, and the mixture was dialyzed against three changes of buffer B (5 mM Tris-HCl [pH 8], 50 mM NaCl, 20 mM dithiothreitol) over a 14-h period. Thrombin, cleaved His tag, and undigested protein were removed by addition of p-amino-benzamidine agarose (Sigma) and Ni-nitrilotriacetic acid resin, respectively. This procedure yielded more than 15 mg/liter of pure YycI protein.
Crystallization and structure solution of YycI(31-280). YycI(31-280) was crystallized at 22°C by the hanging drop vapor diffusion method using selenomethionine-labeled protein concentrated to 15 to 30 mg/ml in 5 mM Tris-HCl buffer (pH 8.0) plus 50 mM NaCl and a reservoir solution consisting of 18% polyethylene glycol 1500, 100 mM citrate (pH 3.2), and 200 mM LiSO4. Crystals grew to a size of 0.2 by 0.2 by 0.4 mm3 in 7 to 14 days, and adopted space group P21 with cell dimensions a = 60.8 Å, b = 161.9 Å, c = 180.2 Å, and ß = 90.9°. Twenty percent glycerol was used as a cryoprotectant for data collection.
A three-wavelength data set to a 2.9-Å resolution suitable for MAD (multiwavelength anomalous dispersion) phasing was collected at
100 K at beamline 12.3.1 of the Advanced Light Source and processed with HKL2000 (20). A heavy-atom search was performed with SHELX-D (28), and the top 72 sites were used as input to autoSHARP (3) for phasing and density modification with SOLOMON (1) and DM (6). Mean figures of merit before and after density modification were 0.42 and 0.82 for reflections between 50 and 2.9 Å. Data reduction and phasing statistics are presented in Table 1.
|
View this table: [in a new window] |
TABLE 1. Data reduction and phasing statistics
|
15% (2.9-Å data) or 25% (5-Å data) of the origin height, consistent with noncrystallographic translational symmetry elements (0,0,1/3) and (0,0,2/3). The
90° ß angle further suggested that the monoclinic lattice could be derived by distortion of the orthorhombic lattice observed for the wild-type crystals. We therefore predicted that the asymmetric unit would comprise six noncrystallographic symmetry-related copies of a repeating molecular unit (a dimer, for a total of 12 proteins per asymmetric unit, assuming a solvent content of 51%). The clearest region of the initial electron density map was selected for manual building using O (14) of one protein subunit, allowing 85% of expected residues to be built as a polyalanine model. Eleven further copies of this model were manually placed into the map, and one round of energy minimization was performed with Refmac5 (17). The arrangement of the 12 polypeptide chains in three rows of four, each generated by noncrystallographic translation and containing two dimers related by a two-fold axis nearly coincident with the y axis, confirmed the prediction from the Patterson analysis.
The amino acid sequence was assigned to the polyalanine models by utilizing the six selenomethionine sites per chain and several clearly visible aromatic side chains as guides. Refinement was carried out using Refmac5 with phase restraints using medium and loose ncs restraints for molecules related by ncs translation or two-fold rotation, respectively, with further model building in O as necessary. Simulated annealing with tight ncs restraints in CNS was performed for the next two cycles of refinement (4). The free set of reflection for cross-validation was created from a pseudo-orthorhombic data set expanded to monoclinic (i.e., h,k,l and h,k,l).
The final model contains 12 protein chains, each comprising residues from 34 to 276 to 280, with up to 9 missing residues in the loop between strands ß20 and ß21 and 6 putative Cl ions, one at each dimer interface. The root mean square deviation (RMSD) for C
atoms when each subunit was superposed onto a reference molecule (chain A) varied between 0.27 and 1.16 Å. No water molecules were included as none could be found that obeyed the symmetry in more than four ncs-related copies (using ARP/wARP (22). The final R values are Rwork = 0.212 and Rfree = 0.262. The stereochemical quality of the model was excellent, as judged by PROCHECK (15) (Table 2). All but 2 of the 21 outliers in the Ramachandran plot are located in loops with strong but ambiguous electron density, suggesting the presence of alternative main-chain conformations. Surface area calculations were performed in CNS with a probe radius of 1.4 Å. Refinement statistics are summarized in Table 2.
|
View this table: [in a new window] |
TABLE 2. Refinement statistics
|
-helix (Fig. 1A).
![]() View larger version (60K): [in a new window] |
FIG. 1. YycI dimer in two orthogonal views. (A) Subunits are shown as red and blue in the ribbon representation, and the chloride ion is shown as a yellow sphere. N and C termini, domains 2 and 3 (D2 and D3), and structural elements mentioned in the text are labeled on one subunit. The ß20-ß21 hairpin is fully visible in the red subunit (see text). Two orthogonal views related by a horizontal axis are shown, from the side and from the bottom of the molecule along the pseudo-two-fold symmetry axis. Molecular graphics objects were created with Chimera (23). (B) Accessible surface of the YycI dimer in the same two views as panel A, colored according to electrostatic potential on a scale from 20 kT/e (red) to 20 kT/e (blue). The electrostatic potential map was calculated with Spock (5).
|
atoms (or 3.1 Å and 2.7 Å, respectively, when domains 2 and 3 are aligned separately) and with 10% sequence identity.
![]() View larger version (30K): [in a new window] |
FIG. 2. Comparison of a YycI subunit and a YycH monomer. (A) The two structures are shown in the same orientation after structural alignment colored in blue-green-yellow-orange-red so that corresponding structural elements are in similar colors. Only the fragment of YycH that aligns with YycI is shown for clarity. (B) A structure-based sequence alignment of YycI and YycH was generated using the DALI server. For accuracy, individual domains were aligned separately and then assembled into a single alignment. Residues conserved between YycH and YycI are shown between the two sequences. Secondary structure elements for YycI and YycH are shown above and below each sequence, respectively, and were labeled as previously defined for YycH (27). One strand in YycH located between ß18 and ß19 was not previously labeled and was therefore named ß18'. The N-terminal helix in YycI does not have a counterpart in YycH and was hence named 0. The putative transmembrane domain in YycI is highlighted in gray. Sequences in YycH that were not resolved are in gray.
|
7, has no counterpart in YycH and was therefore named
0. Dimerization is mediated by this helix as well as domain 3, including a contribution from an extended C terminus contacting helix
7 and strand ß21 in the apposing subunit, burying a total surface area of 3,000 to 3,500 Å2 per dimer. Electron density for the C terminus is visible to a varied extent in each of the 12 chains. The pair of helices made from
0 from each subunit forms the base of the dimer and is part of a central dimerization moiety, while domain 2 points away from the center. The Fold and Function Assignment System (FFAS) server predicts a common fold for YycH and YycI. A regular BLAST or iterative Psi-BLAST search was unable to predict any similarity between YycH and YycI; thus, the common protein fold revealed by crystallography was surprising at first. We subsequently used the FFAS server at the Burnham Institute for Medical Research (http://ffas.ljcrf.edu) (13) to further analyze this discrepancy. The algorithm uses profile-profile alignments to take advantage of information present in sequences of homologous proteins to amplify the sequence pattern defining the family (it does not directly use 3-dimensional structural information for the homology search). As a result, it can detect remote homologies beyond the limits of other methods. When the YycH structure was included in the database, the FFAS server correctly predicted the fold of YycI to be homologous to that of YycH, with a confidence level of >97% (Z score = 16.1; sequence identity, 10%).
Strong negative charge is present at the base of the dimer.
The periplasmic domain of YycI has a calculated isoelectric point of 5.5, with a net charge of 6 at neutral pH. Notable positive and negative electrostatic potentials occur on the outer surface at the top of the molecule and the "lower" surface of the N-terminal helix pair, respectively (Fig. 1B). The latter may be relevant to the signaling mechanism, as this surface is predicted to appose the membrane in vivo. A net negative charge is generally, though not universally, observed at this site in orthologs (not shown). In the crystal structure, there are only seven periplasmic residues unresolved between the transmembrane helices and the ordered N termini, which begin at Lys34. Given that the two ordered N termini within the dimer are
26 Å apart, it is unlikely that the transmembrane helices can interact within the homodimer without some geometric constraints. This suggests that the periplasmic domain is held rather rigidly vis-à-vis the membrane, either through electrostatic repulsion or through divalent cation-mediated interactions. In either case, we predict that the short periplasmic domain-transmembrane linker severely restricts the translational freedom of the transmembrane domains. That the latter play a role in the interaction between YycI, YycH, and YycG is suggested by the good sequence conservation between orthologs, in contrast to the low conservation of the periplasmic domains. We previously described several conserved hydrophilic residues in the transmembrane helix of YycH, including an SXXX(T/S) motif, that could confer hetero- or homotypic interactions (7, 27). Similarly, conserved hydrophilic residues are found in the YycI and YycG transmembrane helices, despite somewhat lower conservation. Therefore, one function of the periplasmic domain of YycI could be the regulation of the behavior of its transmembrane helix. We further note that the flat shape of the YycI dimer may allow juxtaposition of the transmembrane domains of YycI, YycH, and YycG by minimizing steric clashes between their periplasmic domains and therefore facilitate their interactions.
YycI displays a putative ligand binding site.
At the center of the dimer is a deep pocket lined by positively charged residues from domain 2. Strong residual electron density (5.5 to 6.0
) at the center of this pocket in the final 2Fo-Fc map (compared with 5.5 to 7.5
for the Se atoms) suggests the presence of a bound ion. We assigned the electron density to a chloride ion, based on electron count and the local chemistry, but we cannot be sure of its identity. The pocket brings basic residues from the two monomers into close apposition, suggesting that an anion is necessary at this site to promote dimer formation. From this pocket a narrow cavity extends all the way to the base of the dimer at helix
0, more than 20 Å away from the anion (Fig. 3). The presence of adjoining cavities suggests that the anion partially occupies a ligand-binding site that may serve the purpose of stabilizing or otherwise modulating the structure of the dimeric assembly in vivo. Our structure offers a rational approach through point mutagenesis to test this hypothesis.
![]() View larger version (86K): [in a new window] |
FIG. 3. Putative ligand binding site at the YycI dimeric interface. One subunit is shown as a white surface representation, and domain 3 of the other is shown as a blue ribbon. Side chains lining the dimerization interface are shown. The ion bound at the end of the putative ligand-binding cavity (positioned along the dashed line) is shown as a magenta sphere. N and C termini and 0 are labeled for reference in both subunits.
|
Protein structure accession number. Structure factors and coordinates for YycI have been deposited in the Protein Data Bank (accession no. 2O3O).
We thank Alexander Aleshin for synchrotron data collection and processing.
Published ahead of print on 16 February 2007. ![]()
Manuscript no. 18605 from The Scripps Research Institute. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»