Eerappa Rajakumara,2,
Pooja Anjali Mazumdar,1
Baisakhee Saha,1
Devrani Mitra,1
Harald G. Wiker,3
Rajan Sankaranarayanan,2* and
Amit Kumar Das1*
Department of Biotechnology, Indian Institute of Technology, Kharagpur,1 Centre for Cellular and Molecular Biology, Hyderabad, India,2 The Gade Institute, University of Bergen and Haukeland University Hospital, Bergen, Norway3
Received 21 July 2004/ Accepted 10 December 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Covalent modification of proteins by reversible phosphorylation and dephosphorylation is an essential physiological mechanism for the control of cellular functions. A wide variety of cellular processes, such as cell division, differentiation, and development, are controlled by molecular signals obtained through cascades of phosphorylation and dephosphorylation of proteins (17, 40). Protein phosphorylation and dephosphorylation are controlled by the enzymatic actions of kinases and phosphatases, respectively (20, 28, 31), on serine, threonine, histidine, and tyrosine residues. Histidine kinases are present in prokaryotic signal transduction systems such as the two-component system (20). The dephosphorylation of phosphohistidine takes place with PP1, PP2A, and PP2C (26). PTPases reverse the effect of tyrosine kinases by dephosphorylating the protein tyrosine residue of cellular substrate proteins and are important in cellular signaling processes (17, 29, 36). PTPases in general are divided into the following three families, based on their molecular masses and substrate specificities: the high-molecular-mass PTPase family, whose members have a conserved 30-kDa catalytic domain; the dual-specificity PTPase family (11, 43), whose members are capable of dephosphorylating both serine-threonine and tyrosine residues; and the low-molecular-mass PTPase family, whose members contain a single 18-kDa catalytic domain (44). High-molecular-weight PTPases include the following: the nonreceptor type of phosphatase, such as PTP1B (3, 4, 24); Yersinia PTPase (37, 48); and the receptor-like PTPases, such as CD45, µ, and LAR (16). All these families of PTPases exhibit very low sequence identities, except in a common active-site signature motif, CXXXXXR(S/T), at the phosphate-binding site (44).
The low-molecular-weight PTPases (LMWPTPases) exhibit no apparent sequence homology with the high-molecular-weight phosphatases apart from the minimal PTPase conserved signature motif C(X)5R(S/T) situated near the N terminus (45). The wide distribution of the low-molecular-weight PTPases among organisms from bacteria to humans suggests their pivotal role in cellular functions and substrate specificity (23, 25, 27, 32).
The common catalytic mechanism of dephosphorylation of the substrates by PTPases involves the conserved active-site sequence motif C(X)5R(S/T), which is called the protein tyrosine phosphate-binding loop (PTP loop) (39). Structural studies of eukaryotic LMWPTPases have demonstrated that the phosphate ion is cradled between the catalytic cysteine and arginine side chains, stabilizes the phosphate by hydrogen bonding, and proceeds through a cysteinyl-phosphate intermediate (47). Cysteine acts as the nucleophile, and its thiolate form binds covalently to the phosphate group of the substrate protein along with the side chain of the arginine located 6 residues downstream of the catalytic cysteine residue. Also, the backbone nitrogens of the PTP loop form hydrogen bonds with the phosphate group of the substrate (47). PTPases are active over a broad range of pHs, i.e., from pHs 5.5 to 7.5, but are inhibited by low concentrations of vanadates. The substrate specificity of LMWPTPase might be determined by the charge distribution around the active site, and distinct charge alterations near the active sites of different LMWPTPases are expected to recognize amino acids with different charges as part of potential peptide or protein substrates (8, 12, 42, 46).
Sequence analysis of the M. tuberculosis genome revealed the presence of two mycobacterial PTPases, mycobacterial PtpA (MPtpA) and MPtpB. MPtpA belongs to the LMWPTPase family and is present in slow-growing mycobacterial species as well as fast-growing saprophytes, whereas MPtpB is a 30-kDa protein and is restricted to members of the M. tuberculosis complex. These two PTPases are released by M. tuberculosis into extracellular culture medium and are implicated in the interaction of mycobacteria with the host cell (21). Disruption of the MPtpB gene impairs the ability of M. tuberculosis to survive in guinea pigs and demonstrates the role of PTPases in the pathogenesis of disease caused by M. tuberculosis (34). MPtpA is highly specific for substrates containing phosphotyrosine residues and shows some activity on myelin basic protein and no activity on phosphoserine or phosphothreonine (9). The MPtpA promoter expresses the protein when cloned into the slow-growing Mycobacterium bovis BCG and is also expressed during infection of human macrophages (9).
Here we report the crystal structures of a low-molecular-weight protein tyrosine phosphatase of M. tuberculosis (MPtpA) at 1.9- and 2.5-Å resolutions and present a detailed structural and sequence comparison with other low-molecular-weight protein tyrosine phosphatases of humans (HCPTPA) (46), bovines (BPTP) (39, 45), and Saccharomyces cerevisiae (LTP1) (42) to understand its specific features and role in substrate recognition and binding.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Initial searches for crystallization conditions were made at 277 K using the commercially available sparse-matrix screening kits Crystal Screen and Crystal Screen 2 from Hampton Research. The hanging-drop vapor diffusion method was used, where each drop, containing equal volumes (2 µl) of protein and reservoir solution, was equilibrated against 750 µl of reservoir solution. Crystals were obtained under two different conditions and were classified as MPtpA (A) and MPtpA (B). Rod-shaped MPtpA (A) crystals were obtained from 22% (vol/vol) ethylene glycol with dimensions of 0.1 by 0.1 by 1.0 mm, while diamond-shaped MPtpA (B) crystals were grown in a solution containing 30% (wt/vol) polyethylene glycol 4000, 200 mM ammonium acetate, and 100 mM trisodium citrate dihydrate [pH 5.6] and had dimensions of 0.3 by 0.3 by 0.6 mm.
Data collection.
Data were collected using a MAR Research MAR-345dtb image plate detector with Cu K
X rays generated by a Rigaku RU-H3R rotating anode X-ray generator equipped with an osmic mirror system and operated at 50 kV and 100 mA. Prior to flash cooling in a liquid nitrogen stream at 100 K for data collection, MPtpA (A) crystals were soaked in 10% glycerol along with the mother liquor for 60 to 120 s. The data were processed using DENZO (33), and the subsequent scaling and merging of intensities were carried out using SCALEPACK (33). Both crystal forms belong to the orthorhombic (P212121) space group and have different cell dimensions. MPtpA (A) and MPtpA (B) crystals diffracted to 1.9- and 2.5-Å resolutions, respectively, and both contained one molecule per asymmetric unit (AU). The solvent contents of MPtpA (A) and MPtpA (B) crystals were 43.7 and 48.2%, respectively. A complete data set was collected from both crystal forms, and the statistical data are given in Table 1.
|
in the Fo-Fc map was tentatively fitted as a chloride ion based on temperature factor refinement. After a few cycles of refinement with CNS and model building with program O, the structure could be refined to an Rfree and Rcryst of 22.6 and 20.1%, respectively. The PROCHECK program (22) was used to generate the Ramachandran diagram. Assessment of the model quality showed that 92.6% of the residues were in the most favored region and that the rest were in the allowed region. The refinement statistics for the MPtpA (A) crystal are given in Table 1. The MPtpA (B) crystal structure was also determined by the molecular replacement method using MPtpA (A) as a search model. An electron density in the active-site region mimicking the substrate was modeled as glycerol based on its electron density and the temperature factor of surrounding protein atoms. The final structure was refined to an Rfree and Rcryst of 27.3 and 21.1%, respectively. The refinement statistics of the MPtpA (B) crystal form are shown in Table 1. Residues 4 to 159 and 4 to 158 in the MPtpA (A) and the MPtpA (B) structures were modeled in 2Fo-Fc and Fo-Fc maps, respectively. The temperature factor of the chloride ion was 14.5 Å2 in MPtpA (A), whereas in MPtpA (B), it was 39.8 Å2. The average temperature factor of glycerol in MPtpA (B) was 36.8 Å2.
Protein Data Bank accession numbers. The coordinates of MPtpA (A) and MPtpA (B) have been deposited in the Protein Data Bank with accession codes 1U2P and 1U2Q, respectively.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
positions of MPtpA (A) relative to the RMS of MPtpA (B) was 0.397 Å. Also, there were no differences in core packing or accessible surface area between the two structures. A comparison of the B factors of the protein atoms indicated that MPtpA (A) is more ordered than MPtpA (B), and the overall B factors of protein atoms of MPtpA (A) and MPtpA (B) were 14.96 and 28.20 Å2, respectively. The following discussion is based on MPtpA (A) unless otherwise indicated. The MPtpA structure was composed of a single
/ß domain characterized by a central, parallel, four-stranded, twisted ß sheet flanked by
helices on both sides. This overall structure has the character of a classic Rossman fold, with two right-handed ß
ß motifs contributing to the central four-stranded parallel ß-sheet (Fig. 1A). A typical phosphate-binding loop (PTP loop) lies at the C terminus of ß1 and the N terminus of
1. The structure-based multiple sequence alignment is shown in Fig. 2. The overall topology of MPtpA closely resembles that of eukaryotic low-molecular-weight PTPases, and the major changes found were in insertions or deletions. Four amino acids were deleted from and 3 amino acids were inserted in the loop regions, i.e., in residues 100 to 107 and 115 to 124, respectively, and the
5 helix contained an additional 6 residues compared to the HCPTPA structure (Fig. 1B). Proline at position 147 (Arg150 in HCPTPA) created a kink in the
5 helix, resulting in the loss of the main chain hydrogen bonds 147N143O and 148N144O.
|
|
positions of MPtpA from those of HCPTPA, BPTP, and LTP1 were 0.99, 1.07, and 1.02 Å, respectively, but the RMS deviation between two mammalian LMWPTPases, HCPTPA and BPTPA, was 0.496 Å. MPtpA showed 38, 37, and 33% sequence homologies with HCPTPA, BPTP, and LTP1, respectively, whereas the sequence homology between the mammalian LMWPTPases HCPTPA and BPTP was 82%. These noticeable differences in RMS deviations and sequence homologies indicated that the structure of MPtpA is significantly different from that of mammalian LMWPTPases. Major changes from the HCPTPA structure were observed in the regions between ß4 and
5 (insertion),
4 and ß4 (deletion), and
3 and ß3 (Fig. 1B). The variable loop (residues 44 to 56), attributed to unique catalytic properties of isoenzymes that vary in size (46), was conformationally unchanged (RMS deviation of 0.342 Å in C
with HCPTPA); however, its residues are not conserved, in contrast to those of mammalian LMWPTPases.
A secondary-structure-based multiple sequence alignment of the MPtpA was made with other prokaryotic LMWPTPases, such as S. enterica serovar Typhi and E. coli, and with eukaryotes, such as Saccharomyces cerevisiae, Homo sapiens, and Bos taurus (Fig. 2). The amino acids Glu22, His65, His71, and Glu136 are highly conserved in the LMWPTPase family. Differences in the variable-loop regions are revealed by structural and multiple sequence alignment. The two proline amino acids in mammalian LMWPTPases (residues 53 and 54 in HCPTPA and residues 54 and 56 in BPTP) are replaced by different amino acids in MPtpA (Fig. 2). The pyrolidine ring of proline restricts this residue to fewer conformations than are available to the other amino acids. As a consequence, the variable loops of mammalian LMWPTPases might be conformationally more stable than that of MPtpA. There is considerable evidence that the variable loop plays a key role in determining the nature of the substrate (42, 46), so the variable loop in MPtpA might help the enzyme to recognize different types of substrates. In addition to these changes in the variable loop specific to MPtpA, the charge distribution around the active site is significantly altered in comparison to that of the mammalian LMWPTPases, due to negatively charged Glu56. The structural analysis shows that Arg65 in HCPTPA (46) and Asn65 in BPTP (45) interact with Glu139 but that these residues, Arg65 and Asn65, are mutated to a hydrophobic residue, Ala64, in MPtpA (Fig. 2), resulting in a loss of interaction with Glu136. Hence, the negative charge on Glu136 is shared only with His65, and the pKa value of His65 in MPtpA might be different from that of HCPTPA and BPTP. Also, the pKa value of His71 of MPtpA differs from that of mammalian LMWPTPases, due to the presence of Thr41 instead of Asp42 in mammalian LMWPTPases (Fig. 2), whereas Glu23 and Asp42 play a crucial role in changing the pKa value of His72 in BPTP (45) and HCPTPA (46). His71 stabilizes the left-handed conformation of Asn14 by interacting with N
2 and, in turn, stabilizes the PTP loop. These differences in the variable loops indicate the different substrate preferences of MPtpA and mammalian LMWPTPases. The specific activity of MPtpA (0.5 µM/min/mg) (9) is nearly 200 times lower than that of the LMWPTPases (95 to 100 µM/min/mg) (46) when the substrate p-nitrophenyl phosphate is used; this difference can be attributed to different amino acids around the active site.
Structure of the active site.
Despite the overall similarities in the active-site architectures and, hence, in the enzymatic mechanisms, significant differences between prokaryotic and eukaryotic LMWPTPases, mammalian LMWPTPases in particular, can be noted. These differences, along with mycobacterium-specific changes, are described in detail below. The active site of MPtpA reveals the presence of the PTP loop (residues 11 to 18) with two active-site residues at positions 11 and 17, an aspartic acid residue at position 126, and a group of hydrophobic residues that form the wall of the active-site cavity (Fig. 1A and D). The favorable binding of the tyrosine ring of the substrate and the substrate selectivity of the enzyme are duly attributed to the facts that the active-site cavity is deep and the residues lining the walls of the cavity are mainly aromatic and hydrophobic in nature (Fig. 1D and E). The P loops are formed by the active-site consensus sequence motifs, CTGNICRS in MPtpA and CLGNICRS in mammalian low-molecular-weight PTPases (46). The Leu12Thr substitution (underlined in the MPtpA sequence) may play a major role in PTP loop stabilization. Leu12 is a hydrophobic residue present along with the charged residues at the surface of the molecule facing the solvent region in mammalian LMWPTPases, and a change to a polar residue may be stabilizing. Also, the side chain O
1 of Thr12 makes a hydrogen-bonding interaction with the side chain N
2 of His93 (Fig. 1C), present only in bacterial LMWPTPases, whereas it is replaced by Asn in all mammalian LMWPTPases (Fig. 2). The side chain N
1 of His93, in turn, interacts with the main chain nitrogen and side chain O
1 of Asp90. The PTP loop is a well-ordered region in both structures and has low temperature factors for both its main chain and side chain atoms. The temperature factors of the main chain and side chain atoms in the PTP loop were 10.4 and 10.65 Å2, respectively, in the MPtpA (A) structure. These temperature factors are lower than the overall temperature factors of the main chain and side chain atoms of the MPtpA (A) structure, which were 12.31 and 15.32 Å2, respectively. An extensive network of H bonds in the three-dimensional structure, in accordance with a lower temperature factor, stabilizes the PTP loop conformation. However, the backbone structure of the PTP loop in this enzyme is very similar to that of other protein tyrosine phosphatases whose structures are known (42, 45, 46).
The PTP loop contains the residue Asn14, which is in a left-handed helical conformation (
= 59°;
= 34°), as is evident from the Ramachandran plot. This residue forms a network of H bonds with conserved residues Ser18, Ser42, and His71, which help to stabilize the left-handed conformation and thus the overall structure of the PTP loop, allowing it to adopt the most favorable geometry for substrate binding and transition state stabilization. Asn14 is highly conserved in all LMWPTPases. Arg17 is a key residue for substrate binding in MPtpA and is equivalent to Arg18 in HCPTPA. The PTP loop residues are arranged in such a way that all the amide protons between Cys11 and Ser18 face the center of the loop, and they are well positioned for phosphate binding; Arg17 also contributes two hydrogen bonds for interaction with phosphate oxygen. Both of the structures exhibited a chloride ion at the phosphate-binding region of the active site that interacts with the main chain nitrogens of Thr12 and Gly13 and with the side chain NH2 of Arg17 in MPtpA (A) (Fig. 1C). In the case of the MPtpA (B) structure, the position of the chloride ion is shifted by 3.45 Å with respect to that of MPtpA (A), and the chloride ion interacts with the main chain nitrogens of Ile15 and Cys16 due to the additional binding of the glycerol moiety in the active site.
Comparison of the active site with those of the other eukaryotic LMWPTPases indicated that MPtpA also has a crevice leading from one side of the molecule into the active site. The variable loop forms one wall and floor of the crevice, and the loop between ß4 and
5 containing the catalytically active residue Asp 126 forms the other wall. The crevice formed by Trp48, Tyr128, and Tyr129 creates a hydrophobic surface for the incoming phosphotyrosine substrate (Fig. 1D and E). The side chains of these residues line the wall near the active site. The charge pattern observed in the variable-loop region of MPtpA is different than that of others. The presence in the variable loop of His49 and Ser52 in MPtpA compared to the presence of Glu50 and Asn53 in HCPTPA and that of Asn50 and Arg53 in BPTP indicates the change in pattern of charge distribution around the wall of the crevice leading to the aromatic region at the active site (46) (Fig. 1D and E). The fact that these two residues are in close proximity to the active site, where phosphotyrosine binds, indicates that these two residues at positions 49 and 52 may be crucial in determining the substrate specificity. In bacterial LMWPTPases, His49 is either conserved (except in Streptococcus spp. and Bifidobacterium spp., where a Glu is present) or deleted, in contrast to what occurs with mammalian LMWPTPases, where this position is occupied by different amino acids (Fig. 2). The glutamate at position 91 in mammalian LMWPTPases is replaced by a positively charged residue, arginine, in MPtpA. In addition to this substitution, the presence of charged or polar residues (for example, Asp in MPtpA, where the side chain carboxylate group of Asp faces the active site) at position 123 in most bacterial LMWPTPases (except Photorhabdus spp.), compared to the presence of a hydrophobic residue, Ile, around the active site in mammalian LMWPTPases, alters the charge distribution significantly (Fig. 2). Thus, this unique charge distribution contributed by Asp123, Arg91, and His49 around the active site along with aromatic residues Trp48, Tyr128, and Tyr129 might have a role in the recognition of amino acids with different charges as part of potential peptide or protein substrates, in contrast to what is observed in mammalian LMWPTPases. This distinction can also explain the difference in the specific activities of bacterial and mammalian LMWPTPases towards the substrate p-nitrophenyl phosphate (9, 46).
We have presented here the structure of a low-molecular-weight protein tyrosine phosphatase in two crystal forms. This is the first structural report of a bacterial representative of LMWPTPases. Even though the structural comparison of the MPtpA with the eukaryotic LMWPTPases revealed overall structural similarities, features specific to bacterial enzymes and to mycobacteria could be discerned. These features include additional stabilization of the PTP loop and significant differences in the residues surrounding the active-site region besides common hydrophobic residues lining the active site. Though the structure of MPtpA is similar to those of other LMWPTPases, it may have a different substrate specificity that may lead to different physiological functions. Thus, this structural work provides a solid platform for exploring the mechanisms of the substrate specificities of MPtpA and other bacterial LMWPTPases that may be identified by biochemical and/or genetic screens.
| ACKNOWLEDGMENTS |
|---|
Dominique Caugant at the Norwegian Institute of Public Health is acknowledged for supplying DNA from M. tuberculosis. We thank Debabrata Das for support and encouragement.
| FOOTNOTES |
|---|
C.M. and E.R. contributed equally to this work. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |