Previous Article | Next Article ![]()
Journal of Bacteriology, July 2003, p. 4243-4247, Vol. 185, No. 14
0021-9193/03/$08.00+0 DOI: 10.1128/JB.185.14.4243-4247.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Centro de Pesquisas, Fundação Antônio Prudente,1 Sao Paulo Branch, Ludwig Institute for Cancer Research, São Paulo, SP 01509-090,3 Departamento de Bioquímica, Universidade Federal de São Paulo, São Paulo, SP 04044-020, Brazil2
Received 24 February 2003/ Accepted 30 April 2003
|
|
|---|
-helical conformation, was shown to form the electrostatic interactions Glu552-Gln89, Asp555-Arg92, and Arg558-Glu93 with the predicted binding site of the flagellin of S. enterica serovar Typhimurium, formed by the 88-to-97 chain fragment (LQRVRELAVQ), which is likewise
helical. |
|
|---|
To accomplish such a study, we made use of a strategy that, in this laboratory (6, 21, 28, 34) and elsewhere (14), has proved its value for the prediction of sequences of putative peptide ligands and the location of receptor binding sites. A hypothesis (15) that enables prediction of the structure of a peptide ligand was also used.
The complementary-hydropathy principle (3, 4, 5, 25) postulates the existence of stabilizing interactions between two peptide chains in whose sequences there is a correspondence between hydrophobic (hydrophilic) residues in one chain and hydrophilic (hydrophobic) residues in the other chain. According to this principle, two complementary peptide chains may form a complex, even though a pairwise binding of all complementary amino acid residues is not necessary.
There are now 69 reports of the successful exploitation of complementary hydropathy for the identification of interacting proteins (14), as opposed to only 3 negative reports (1, 9, 12), 1 of which (5) has already been refuted. Furthermore, the definition through this strategy of the binding site of collagenase (28) was subsequently confirmed (19) by site-directed mutagenesis. Also, the predicted site in the cellular prion molecule to which a putative ligand receptor would bind (21) was confirmed after the complete characterization of this molecule (34) by peptide inhibition of binding between the two molecules, as well as by removal (34) of the cellular prion binding site by sire-directed mutagenesis.
In the following discussion, it is shown how a search for complementary hydropathy between the sequences of TLR-5 and flagellin enables the prediction of binding sites. There still remains the problem, however, that although the structure of the flagellin of S. enterica serovar Typhimurium (26) has been determined by X-ray diffraction analysis, the structure of TLR-5 has not been experimentally determined.
To determine the structure of the TLR-5 fragment predicted to bind flagellin, we made use of a hypothesis that had already been applied to similar problems, such as
-MSH (18) and substrates (17) of oligopeptidase EC 24.15. According to this hypothesis, the structures adopted by peptide chains belong to conformational families that remain the same in different chemical environments. A transition to a new chemical environment causes a change in the probabilities of occurrence of the various conformational families so that, for instance, a conformational family that has a very low probability in an aqueous solution may have a very high probability in a lipid solution and vice versa.
According to the same hypothesis, a peptide chain and a protein fragment with the same sequence also occupy the same conformational families, although with different probabilities. Thus, various conformational families occupied by the predicted TLR-5 binding site may be tested against flagellin.
|
|
|---|
= -57,
= -47], B [
= -139,
= 135], G [
= -60,
= -30], D [
= -90,
= 0], and E [
= 70,
= -60], where
and
[reference 18 and references quoted therein] are main chain dihedral angles) as described in the Appendix. For the side chain conformations, we made use of the gauche minus, trans, and gauche plus rotamers of the classification of Ponder and Richards (23) (numerals 1, 2, and 3 indicate, respectively, the gauche minus, trans, and gauche plus rotamers of
1). Internal energies were calculated with the ECEPP/2 (35) force field. A search for complementary features on the surfaces of the binding site of TLR-5 and the S. enterica serovar Typhimurium flagellin, whose structure (26) was downloaded from the Protein Data Bank (2) website, was then carried out with translations and rotations of the two structures until a maximum score of hydrogen bonds and salt bridges was found. This search was done by hand with the software WEBLAB (provided by Accelrys) and WHAT IF (32). After that, the complex was docked with the HEX (24) software by using the parameters listed on page 11 of the manual (http://www.biochem.abdn.ac.uk/hex). Structures of the TLR-5-flagellin complexes of L. monocytogenes and P. aeruginosa were obtained with WHAT IF by making point mutations and structural optimizations in the TLR-5-flagellin complex of S. enterica serovar Typhimurium. |
|
|---|
|
View this table: [in a new window] |
TABLE 1. Regions of the TLR-5 sequence for which a complementary hydropathy with the flagellins of three bacterial species was founda
|
![]() View larger version (16K): [in a new window] |
FIG. 1. Hydrophobicity profiles (H, hydrophobicity measured by the Kyte and Doolittle scale [20]; aa, amino acid) of the peptides listed in Table 1. Solid line, TLR-5 binding site; dotted line, peptide I; dashed line, peptide II. The hydrophobicity profiles of peptides I and III are almost indistinguishable.
|
However, an additional difficulty is posed by the nonavailability of an experimentally determined structure of TLR-5. Although the structures of TLR-1 and TLR-2 are known (33), there is only partial homology among the sequences of TLR-1, TLR-2, and TLR-5 and this sequence homology does not encompass the predicted binding site. Therefore, to carry out the intended study, we counted only on the sequences of flagellin binding sites, the structure of S. enterica serovar Typhimurium flagellin, and the sequence of the TLR-5 binding site.
To cope with the absence of a TLR-5 structure, we did an exhaustive search of the conformations of its binding site that generated 14,414 different structures. The assumption that this high number of structures may be classified in a limited (15, 18) number of conformational families facilitated the, otherwise almost impossible, task of determining the active structure.
A set of conformational families available to the binding regions of TLR-5 was thus generated as described in the Appendix. Structures belonging to some of these conformational families are represented in Fig. 2.
![]() View larger version (22K): [in a new window] |
FIG. 2. Density-of-states ( ) versus energy (E, in kilocalories per mole) plot obtained for the 552-to-561 (EILDISRNQL) fragment of TLR-5. The energy was rescaled so that the minimum energy corresponds to zero. Thirteen density-of-states maxima are labeled. Some of the conformations found within these maxima are GGGAAAAAAD3112122121 (I), GGGAAAAAAD3112122111, GGGAAAAAGD3112122111 (II), GGGAAAAAAA3112122111 (III), GGGAAAAABA3112122131 (IV), GDDAAAAAGD1311122121 (V), GDDAAAAAAD1111121111 (VI), DDDGGAAAAG3311121121 (VII), GDDDAAAAAG2113211111 (VIII), DDDAAAAABD1311122131 (IX), GDDDDGAAAD1213111111 (X), DDDAAAAAGD1111231221 (XI), EGABBBBGAA1112221111 (XII), and BBDBGBDABG1311123131 (XIII), where the letters and numerals indicate, respectively, main chain and side chain rotamers, as defined in Materials and Methods. Structures found within three maxima are depicted. Stick representations of the polypeptide chain and ribbon representations of the secondary structure are shown.
|
In the present study, however, the condition that TLR-5 forms a stable complex with flagellin considerably restricted the search. The most obvious choice, the ground level conformation (lists of rotamers that define chain conformations are shown in the legend to Fig. 2) that in Fig. 2 corresponds to the maximum labeled I in the density-of-states versus energy plot and to conformation I, was considered first.
Docking (see Materials and Methods) of this
-helical conformation to the, likewise
-helical, predicted binding site of flagellin showed that, as shown in Fig. 3, three electrostatic interactions may form. More specifically, these are the TLR-5-flagellin interactions Glu552-Gln89, Asp555-Arg92, and Arg558-Glu93. Model building (data not shown) showed that the Arg558-Glu (or other residue in position) 93 interaction is maintained even when Glu93 is replaced with glutamine in the L. monocytogenes flagellin and by aspartate in the P. aeruginosa flagellin.
![]() View larger version (21K): [in a new window] |
FIG. 3. Complex formed between the binding sites of flagellin and TLR-5 predicted by the complementary-hydropathy principle. (A) Flagellin complexed to the binding site of TLR-5. Stick representations of the 87-to-97 and 552-to-561 chain fragments, respectively, are shown. Ribbon representations of the flagellin structure and the TLR-5 binding site are also shown. (B) Stick representation of the binding sites of flagellin (gray) and TLR-5 (black). Hydrogen-bonded side chains are represented by dotted lines.
|
On the other hand, TLR-5 conformations distorted in relation to the above-shown conformation would not have the required positioning of side chains. Inspection of the conformations found within the density-of-states maxima labeled V to XIII in Fig. 2 showed that, in every case, the three described electrostatic interactions cannot form because the side chain positioning has been disrupted, as a comparison between conformations I and VIII and I and XII, depicted in Fig. 2, would show. These findings were confirmed by docking (see Materials and Methods) conformations VIII and XII (depicted in Fig. 2) to flagellin.
The binding site of flagellin found in the present study is located within constant domain D1, which is relatively conserved (8) among different species. Two regions required for TLR-5 activation were experimentally (8) identified in this domain. One region, on the N-terminal side, includes amino acids 57 to 190 and therefore includes a fragment sequentially homologous to the predicted binding site at positions 88 to 97. The other region is on the C-terminal side. It has been shown (8), with the aid of linker insertion mutagenesis experiments, that within this region, the short stretch from 485 to 489 (GAIQN) is necessary for interleukin-8 release. A search for complementary hydropathy with the parameters Cm ≥ 0.75 and L = 5 identified the 484-to-488 chain fragment (LGAIQ) in the sequence (29) of enteroaggregative Escherichia coli, which is hydropathically complementary to the 323-to-327 (KINKI) TLR-5 fragment. These fragments were not included in this molecular modeling study because the 484-to-489 flagellin fragment is not part of the experimentally determined (26) structure and because a search of short (L = 5), hydropathically complementary peptide fragments is too unrestricted and is likely to yield false positives.
The present study is part of the broader research field of protein-protein interactions. It has been shown (24) that surface complementarity, i.e., optimal positioning of interacting side chains, is an effective predictor of complexes formed by proteins. This criterion is in accordance with our calculations that showed that the side chains of Gln89, Arg92, and Glu93 on the surface of flagellin are optimally positioned to interact, respectively, with the side chains of Glu552, Asp555, and Arg558 on the surface of the predicted TLR-5 binding site.
Although the proposed TLR-5 and flagellin binding sites are hydropathically complementary, the predicted complex is stabilized by electrostatic interactions between hydrophilic side chains. In this study, we used an in silico search for complementary peptides in interacting proteins, combined with conformational analysis calculations, to predict the binding sites and the structure of the complex.
|
|
|---|
Amino acid residue hydrophobicities are evaluated with the Kyte and Doolittle (20) scale. The degree of complementary hydropathy between two peptide fragments of length L is calculated (10) with the equation
where H(i) and H'(i) are the Kyte and Doolittle indices of amino acid i in each sequence and Abs is absolute value. In the present study, the sequences of TLR-5 and flagellin were examined.
The receptor sequence is scanned against the ligand sequence in order to find regions corresponding to a high degree of complementary hydropathy, as calculated by the equation above. Important parameters in this search are the window length (L) and the minimum complementarity (Cm). Each i...i + L fragment in the receptor sequence is searched against fragments j...j + L in the ligand and those fragments for which C ≥ Cm are tabulated.
Determination of the conformational families of a peptide ligand. When there is substantial evidence that a certain peptide chain or a region of a longer chain is the binding site or peptide ligand that we are looking for but the corresponding structure is not known, we can generate a reasonably complete set of conformational families and test the binding of each one.
To carry out the search for conformational families, a plot of the density of states (
, which is the number of conformations per unit energy interval) versus energy is obtained. It has been argued (15, 16) that minima in this plot correspond to kinetic barriers in the potential energy landscape and maxima correspond to energy minima that, in turn, correspond to conformational families. Therefore, the number of conformational families available to a peptide chain is approached by the number of maxima in a density-of-states versus energy plot.
The density-of-states versus energy plot shown in Fig. 2 includes 14,414 conformations of the 552-to-561 region of TLR-5. These conformations were generated with a matrix algorithm (16) and the adoption of a set (see Materials and Methods) of five main chain and three side chain conformations. The internal energy of each conformation was calculated with the ECEPP/2 (35) force field.
|
|
|---|
-MSH. Int. J. Quant. Chem. 53:335-341.[CrossRef]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»