Previous Article | Next Article ![]()
Journal of Bacteriology, June 2005, p. 3671-3677, Vol. 187, No. 11
0021-9193/05/$08.00+0 doi:10.1128/JB.187.11.3671-3677.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Bioinformatics Graduate Program, Boston University,1 Department of Biomedical Engineering, Boston, Massachusetts 02215,3 New England Biolabs, 32 Tozer Road, Beverly, Massachusetts 019152
Received 1 August 2004/ Accepted 1 March 2005
|
|
|---|
|
|
|---|
Through computer analysis of microbial genes, we suggested the existence of segmentally variable genes (SVGs), whose products have a modular architecture composed of highly variable domains and well-conserved domains (24). The variable domains are typically over 70 amino acids (aa) in length and lack conserved sequence features that might suggest their function. Among many, we noticed one SVG family, members of which all encode proteins with a conserved C-terminal domain and a variable N-terminal domain (Fig. 1). Since it includes gene ypdF from Escherichia coli, we call it the YpdF family. In the products of these genes, the conserved C-terminal domains show strong global similarity to the 264-amino-acid-long methionine aminopeptidase (Map) in E. coli (4). The variable N-terminal domains, with average lengths of about 100 aa, do not show any detectable similarity to any known domains. It is known that the protein product of E. coli map is a metalloaminopeptidase and is activated in vitro by cobalt ions (19). The presence of an extra N-terminal domain plus a characteristic C-terminal domain similar to that of Map in the YpdF family resembles the domain structure observed in one of the two methionine aminopeptidases in Saccharomyces cerevisiae (6, 16). Most genes in the YpdF family are unknown, with no experimental evidence on detailed substrate specificities. It seemed possible that SVG family members encode similar aminopeptidase activity.
![]() View larger version (20K): [in a new window] |
FIG. 1. Schematic alignment of YpdF and its homologs in several selected completely sequenced microbial species. Black boxes represent conserved blocks reported by BLOCKS (13) and are numbered I to VII according to their sequential order. Conserved residues involved in metal ion binding are shown at the bottom of the blocks. Lines connecting the boxes are variable segments. Sequences are identified on the left, with the GenBank accession number followed by the species of origin. The sequence on the first line is YpdF of E. coli. The triangles in the N terminus show the start points of N-terminal deletions. The last sequence is the original methionyl aminopeptidase of E. coli. Notice the absence of the extra N-terminal domain in this sequence.
|
25% identity) to a previously reported archaeal-type deblocking aminopeptidase (17). Like ypdF, most genes from the YpdE family remain uncharacterized. |
View larger version (6K): [in a new window] |
FIG. 2. Genomic region in E. coli that includes the two genes studied in this paper (ypdE and ypdF, shaded). The numbers shown between the genes are intergenic distances in base pairs. The arrowheads of the boxes indicate the direction of transcription. ypdE and ypdF overlap by 1 base with TAATG, in which TAA is the stop codon of ypdF and ATG is the start codon of ypdE. Gene names are from the ASAP database (10). ypdF and ypdE correspond to b2385 and b2384, respectively, in GenBank genome record U00096. Annotations for other genes are as follows: ypdD, PTS system component IIA and enzyme I; ypdG, PTS system component IIC; ypdH, PTS system component IIB; glk, glucokinase.
|
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Primers used in this study
|
Expression and purification of recombinant proteins. pET28a carrying an ORF encoding a six-His-tagged protein was transformed into E. coli strain ER2566 (New England Biolabs). Transformed cells were cultured in LB medium supplemented with 100 µg/ml kanamycin at 37°C to mid-log phase. Protein expression was induced with 1 mM isopropyl-ß-D-thiogalactopyranoside (IPTG), and the cells were incubated at 30°C overnight. Cells from 10 ml of culture were harvested by centrifugation at 4,000 x g for 20 min and stored at 20°C for 30 min. Frozen cells were resuspended in 0.7 ml of lysis buffer (300 mM NaCl, 50 mM NaH2PO4, 10 mM imidazole, pH 8.0) and then briefly sonicated in ice. The cell lysates were centrifuged at 14,000 x g for 20 min at 4°C, and the supernatant was loaded onto an Ni-nitrilotriacetic acid column. The column was then washed twice with 10 ml of washing buffer (300 mM NaCl, 50 mM NaH2PO4, 20 mM imidazole, pH 8.0). The column bound protein was eluted in 0.7 ml of elution buffer (300 mM NaCl, 50 mM NaH2PO4, 250 mM imidazole, pH 8.0). The final protein concentrations as determined from Bradford assays are approximately 0.4 µg/µl for YpdF and 0.3 µg/µl for YpdE.
For pMALc2x constructs, transformed cells were cultured in LB medium supplemented with 100 µg/ml ampicillin at 37°C to mid-log phase. The remaining purification steps follow the same procedure as described in reference 12.
TLC assay for aminopeptidase activity. Purified recombinant protein was assayed on a panel of peptides (BACHEM; New England Biolabs) to test their substrate specificity. All substrate peptides were dissolved in Tris buffer (10 mM, pH 7.6) to 1 mg/ml. Reactions were done in a total volume of 30 µl, with 25 µl of peptide substrate, 3 µl of purified enzyme, and 2 µl of metal ion (10 mM). The reaction mixture was incubated at 37°C for 30 min and then resolved by thin-layer chromatography (TLC) on a Silica Gel 60 plate (EMD Chemicals Inc.) using ethanol-isopropanol-water at a volume ratio of 1/2.1/0.9. The plate was then sprayed with ninhydrin (0.2% in ethanol) and heated. Images of the plate were taken under UV light at 366 nm.
Mutagenesis in the N-terminal domain of YpdF.
We used a two-step PCR procedure (12) to generate mutations in YpdF targeted at the conserved motif in the N-terminal domain. Briefly, the segment encompassing the sequence for this motif was first deleted from ypdF and then replaced with a synthetic double-stranded oligonucleotide with the desired mutations. In the first step, a pET28a plasmid with the inserted ypdF ORF was used as the template; oligonucleotides b2385f and b2385_mr were used as the forward and backward primers to obtain the 5' region of ypdF; oligonucleotides b2385_mf and b2385r were used as primers to obtain the 3' region of ypdF. In the second PCR step, a 1:1 mixture of purified 5' and 3' PCR products from the first step was used as the template; oligonucleotides b2385f and b2385r were used as the forward and backward primers. Compared with intact ypdF, the PCR product (ypdF
) from the second PCR step has a short internal segment deleted. Meanwhile, a new recognition sequence for PmlI (CAC
TG) was created at the deletion site. The purified PCR product was ligated into pET28a to generate pET28a-ypdF
. Next, we synthesized two complementary oligonucleotides with the desired amino acid changes and ligated it into the PmlI-digested plasmid pET28a-ypdF
as a replacement for the deleted segment. The resulting plasmids were then analyzed by DNA sequencing after purification.
|
|
|---|
Genes showing strong similarity to ypdF are widely distributed in other sequenced genomes. Part of the list is shown in Fig. 1. Interestingly, several genomes have multiple ypdF homologs (Fig. 1). The percent identity in the N-terminal-domain sequences of the multiple copies within the same genome is usually not high (<40%) and much less than the percent identity in the C-terminal-domain sequences, suggesting that these duplicated genes may have diverged.
YpdE is similar to an archaeal deblocking aminopeptidase, and the ypdEF structure is not conserved in other microbial genomes.
YpdE in E. coli shows moderate similarity (
24% identity) to the previously reported metal ion-dependent deblocking aminopeptidase (PH0519) in the archaeon Pyrococcus horikoshii (1, 17) and to an aminopeptidase in Haloarcula marismortui (
21% identity in an
150-aa region) (8). It is known that PH0519 shows broad aminopeptidase activity on nonblocked peptides and blocked peptides by acyl group (1). All three residues which are suggested to be involved in metal ion binding (17) are conserved (data not shown). However, computational analysis suggests that YpdE is not clearly orthologous to PH0519 because there exist multiple ypdE paralogs in both the P. horikoshii and E. coli genomes. Although the ypdE and ypdF ORFs overlap by 1 base in E. coli (including strains K-12, CFT073, and O157H7), this feature seems unique to E. coli and is not found in closely related species such as Salmonella enterica serovar Typhi or Shigella flexneri, etc. In the E. coli genome, the genes flanking ypdEF (Fig. 2) encode a sugar phosphotransferase (PTS) system that is responsible for carbohydrate transport (9).
Expression and purification of YpdF and YpdE in E. coli. We expressed the active recombinant YpdE and YpdF proteins in E. coli. The six-His-tagged proteins were purified by one-step chromatographic procedures using an Ni-nitrilotriacetic acid column. YpdF has 361 aa with a calculated molecular mass of 39.6 kDa, and YpdE has 345 aa with a molecular mass of 37.4 kDa. The purified recombinant YpdF and YpdE proteins exhibit single bands on sodium dodecyl sulfate-polyacrylamide gels with estimated molecular masses of 41 kDa and 38 kDa (data not shown).
YpdF has limited Xaa-Pro aminopeptidase and methionyl aminopeptidase activities. We found that YpdF has limited methionyl aminopeptidase activity when it was tested on a variety of peptides starting with L-methionine (Fig. 3a). A TLC assay shows that YpdF is capable of hydrolyzing the N-terminal methionine when the next amino acid is alanine, proline, or serine (Fig. 3a). It does not show detectable methionine-releasing activity when the second amino acid is glycine, lysine, or leucine, etc., among all tested peptides (Fig. 3a and Table 2) or when the first methionine residue is modified as formylmethionine or acetylmethionine (Table 2). Compared with the original methionyl aminopeptidase in E. coli (4), YpdF has a much narrower specificity in methionine cleavage. The map product in E. coli exhibits higher methionyl aminopeptidase activity when the second amino acid is glycine, alanine, proline, or serine, etc. (14). However, YpdF does not exhibit a similar property; for instance, it does not cleave methionine when the second amino acid is glycine (Fig. 3a).
![]() View larger version (133K): [in a new window] |
FIG. 3. TLC plate image showing substrate specificity of YpdF. (a) The lanes are as follows (from left to right): 1, Met; 2, Met-Ala-Ser; 3, Ala-Ser; 4, Met-Ala-Ser+YpdF; 5, Met-Ser-Gly; 6, Ser-Gly; 7, Met-Ser-Gly+YpdF; 8, Met-Pro-Gly; 9, Pro-Gly; 10, Met-Pro-Gly+YpdF; 11, Met-Gly-Gly; 12, Gly-Gly; 13, Met-Gly-Gly+YpdF; 14, Met-Gly-Met; 15, Gly-Met; 16, Met-Gly-Met+YpdF. (b) The lanes are as follows (from left to right): 1, Ala-Pro-Ala; 2, Ala; 3, Ala-Pro-Ala+YpdF; 4, Tyr-Pro-Ile-Ser-Leu; 5, Tyr; 6, Tyr-Pro-Ile-Ser-Leu+YpdF; 7, Asn-Pro-Thr-Asn-Leu-His; 8, Asn; 9, Asn-Pro-Thr-Asn-Leu-His+YpdF; 10, Leu-Asp-Leu-Leu-Phe-Leu; 11, Leu; 12, Leu-Asp-Leu-Leu-Phe-Leu+YpdF.
|
|
View this table: [in a new window] |
TABLE 2. Oligpeptides used in this study
|
YpdE is an aminopeptidase with broad specificity but has no detectable deblocking aminopeptidase activity. We tested if YpdE has deblocking activity against an array of acetyl or formyl group-blocked oligopeptides using TLC and matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry (data not shown). However, no detectable deblocking aminopeptidase activity was observed. Instead, we noticed that YpdE has a broad aminopeptidase activity on nonblocked peptides by progressively cleaving amino acids off the peptide substrate (Fig. 4). Its aminopeptidase activity stops at the residue before the first proline in the peptide, which was also observed for the related deblocking aminopeptidase homolog in archaea (1). This may be due to the unique imide bond in proline, which restricts the overall N-terminal conformation of the oligopeptides and disrupts the contacts between residues and active sites of the enzyme. From a panel of oligopeptides we examined by MALDI-TOF assay, we observed that YpdE can cleave most amino acids in the N terminus of the peptide (Table 2). Note that it does not cleave when proline is the first N-terminal residue (Table 2).
![]() View larger version (13K): [in a new window] |
FIG. 4. MALDI-TOF analysis of YpdE activity. The substrate used was the peptide Gln-Ala-Thr-Val-Gly-Asp-Val-Asn-Thr-Asp-Arg-Pro-Gly-Leu-Leu-Asp-Leu-Lys. The spectrum on the top is the reaction mixture of the substrate with enzyme supplemented with Co2+. The spectrum on the bottom is the control with just the substrate.
|
![]() View larger version (107K): [in a new window] |
FIG. 5. Divalent metal ion dependence for YpdF and YpdE. (a) Divalent metal ion requirement for YpdF. The substrate used was Met-Ala-Ser. Metal ions included Co2+ (lane 4), Cu2+ (lane 5), Zn2+ (lane 6), Ni2+ (lane 7), Mg2+ (lane 8), Ca2+ (lane 9), Mn2+ (lane 10), and Fe2+ (lane 11). Other lanes are as follows: lane 1, MAS only; lane 2, AS only; lane 3, methionine only; lane 12, MAS with no metal ions but YpdF; lane 13, MAS with no enzyme but Co2+. (b) Divalent metal ion requirement for YpdE. The substrate used was Met-Ala-Ser. Metal ions included Co2+ (lane 5), Mn2+ (lane 6), Cu2+ (lane 7), Fe2+ (lane 8), Ca2+ (lane 9), Ni2+ (lane 10), Zn2+ (lane 11), and Mg2+ (lane 12). Other lanes are as follows: lane 1, MAS only; lane 2, alanine only; lane 3, methionine only; lane 4, MAS with YpdE but no metal ion.
|
Partial deletion and mutagenesis in the N-terminal domain of YpdF. We asked whether the conserved C-terminal domain in YpdF is a stand-alone domain for the aminopeptidase activity. We then constructed two forms of YpdF with the N-terminal domain completely or partially deleted (Fig. 1a). We expressed and purified them as MBP (maltose-binding protein) fusion proteins (unpublished data). For control purposes, we also expressed the intact ypdF and purified the product as an MBP fusion protein. Purified MBP-YpdF shows the same activity as six-His-tagged YpdF (data not shown). However, YpdF with the N-terminal domain completely or partially deleted (Fig. 1a) loses its original aminopeptidase activity (data not shown). This suggests that the N-terminal domain is essential for the in vitro function of YpdF. However, we cannot rule out the possibility that the loss of the N terminus disrupts the overall folding of the protein.
Using the motif finding program MEME (2) and the N-terminal sequences extracted from YpdF and its homologs, we found a short motif conserved within a subgroup of the YpdF homologs from a number of distantly related species (Fig. 6a). In the motif, two sites that are completely conserved are a negatively charged aspartate (D) and a positively charged arginine (R) (Fig. 6b). Another site appears to be dominated by aromatic residues: tyrosine (Y) or phenylalanine (F) (Fig. 6b). The fact that this motif is conserved across a diverse group of species in YpdF homologs suggests that it might be functionally important. We did not find any match to motifs of known function in motif databases. To investigate the possible role that this motif might play, we expressed a mutated YpdF protein in which the D, R, and Y sites were changed to A (DRY2A) and compared the mutated YpdF protein with the wild type. We found that the substrate specificity profile does not change between the wild type and this mutant. However, the mutated YpdF protein appears to have lost its methionyl aminopeptidase activity against the substrate Met-Ala-Ser while the activity against Met-Pro-Gly remains at the same level (data not shown). More work is needed to examine the functional role of this motif within the N terminus.
![]() View larger version (49K): [in a new window] |
FIG. 6. Conserved motif in the N-terminal domain of YpdF. (a) Location of the conserved motif in a number of YpdF homologs. The black box is the conserved short motif, and the line represents the variable N-terminal domain. Gray blocks represent the conserved C-terminal domains. (b) Amino acid alignment of the conserved motif. Two completely conserved sites, aspartate (D) and arginine (R), and one position dominated by aromatic residues are indicated by dots at the bottom. The numbers in brackets are the residue numbers corresponding to this N-terminal domain. Motif finding was performed using MEME (2).
|
|
|
|---|
During previous screens for methionyl aminopeptidase activity, YpdF may be missed because of the limited nature of the peptides used (4, 22). Ben-Bassat et al. (4) used the peptide Met-Gly-Met-Met to screen an E. coli clone library, but this is not a substrate for YpdF. Similarly, during the screen for Xaa-Pro aminopeptidases, polyproline was used (22). Again, this is not a substrate for YpdF because YpdF cannot process peptides with an N-terminal proline.
The cellular roles of both YpdF and YpdE remain elusive. In a recent study (7), neither mRNA nor protein product of YpdF and YpdE were detected in the crude cell lysate of E. coli. However, the fact that both genes are evolutionarily conserved across distantly related species suggests they may be functionally important (15). Among all substrates tested, the maximum length of the peptides that YpdE can digest is 18 and the minimum is 2. It has a broad aminopeptidase activity and therefore could be involved in the ATP-independent downstream processing in cytosolic protein degradation pathways.
The segmentally variable feature of the YpdF family suggests that the attachment of an extra N-terminal domain may provide a convenient way of evolving new functions from existing protein scaffolds. If this is generally true, we may see it in other peptidase families in the database. The exact function of the N-terminal domains of YpdF and its homologs remains elusive. The N-terminal domains of methionyl aminopeptidases in S. cerevisiae harbor zinc finger motifs and short stretches of basic amino acids, which are suggested to be responsible for binding to the ribosome (20). However, in YpdF and its homologs, we did not observe a similar compositional bias in the N termini. E. coli PepP is similar to the type Ib methionyl aminopeptidase in domain structure but has a much longer N terminus than that in most type Ib methionyl aminopeptidases. Could the length of the N-terminal domains be the leading factor that distinguishes the two, or have all of the PepP proteins, which seem to have evolved from the methionyl aminopeptidase family, preserved some of the methionyl aminopeptidase activity? This should be tested experimentally. The crystal structure is available for aminopeptidase P, and it forms a tetramer (dimer of dimers) (21). Part of the N-terminal domain is responsible for contacts between subunits. Again, this may suggest another possible function of the N terminus of the YpdF family of proteins.
This work was supported by New England Biolabs Inc.
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»