Previous Article | Next Article ![]()
Journal of Bacteriology, May 2008, p. 3256-3263, Vol. 190, No. 9
0021-9193/08/$08.00+0 doi:10.1128/JB.01381-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Biotech Center and Department of Plant Biology and Pathology, Rutgers University, New Brunswick, New Jersey 08901,1 Department of Molecular Biology, Princeton University, Princeton, New Jersey 085442
Received 24 August 2007/ Accepted 14 February 2008
|
|
|---|
|
|
|---|
-amino adipic acid. The
-amino adipic acid pathway is found in most fungi (32), and a modification of it is found in selected eubacterial and archaeal species (21). Four different variants of the m-DAP-lysine pathway have been discerned, and they are depicted in Fig. 1. All share the initial and terminal steps but differ in the reactions at the center of the pathway. The common reactions include the first, in which aspartate β-semialdehyde is condensed with pyruvate to produce dihydrodipicolinate; the second, in which dihydrodipicolinate is reduced to L-2,3,4,5-tetrahydrodipicolinate (THDPA); and the final reaction, in which m-DAP is decarboxylated to form lysine. The enzymes catalyzing these reactions, dihydrodipicolinate synthase (DapA, EC 4.2.1.52), dihydrodipicolinate reductase (DapB, EC 1.3.1.26), and m-DAP decarboxylase (LysA, EC 4.1.1.20), are conserved in species carrying any of the variants of the m-DAP pathway (13, 19, 32).
![]() View larger version (15K): [in a new window] |
FIG. 1. The known variants of the DAP-lysine biosynthesis pathways. The chemical structures of intermediates are shown on the left. The name of the pathway is indicated at the top of the diagram, and the individual steps including enzyme symbol are shown below. In the DAP dehydrogenase and DAP aminotransferase diagrams, only the step(s) that differs from the succinyl and acetyl-DAP pathways is shown.
|
The most recent DAP pathway to have been discovered uses two enzymes to convert THDPA to m-DAP (14, 19). The distinguishing enzyme of this pathway catalyzes the glutamate-dependent transamination of THDPA to form LL-DAP (EC 2.6.1.83). m-DAP is then formed by an epimerase, as in the acyl-DAP-pathways. Therefore, the aminotransferase carries out in a single step a metabolite transformation that requires three enzymes, DapD, DapC, and DapE, in E. coli. The LL-DAP aminotransferase has been reported from plants, cyanobacteria, and Chlamydia (14, 19), where it appears to be the sole route for m-DAP-lysine biosynthesis. This assumption is based on the absence of orthologs for the acyl and Ddh pathway enzymes in the genome sequences of these species and the absence of acyl pathway enzyme activities demonstrated from several different plant species (4, 13, 19). The plant and cyanobacterial enzymes show exquisite substrate specificity in that they are able to distinguish LL-DAP from its isomer m-DAP. The chlamydial orthologs have some ability to use m-DAP, although less efficiently than LL-DAP. In recognition of the reaction that it catalyzes, LL-DAP aminotransferase has been named DapL, and this designation will be used throughout the present paper. Based upon its constrained substrate specificity, DapL does not appear to be closely related to the DapC aminotransferase that functions in the acyl-DAP pathways. In fact, at least three different and divergent aminotransferases are known to catalyze the DapC reaction (6, 8, 12, 17, 18). The crystal structure of the DapL protein from Arabidopsis thaliana has been solved, providing insight into the substrate specificity of the enzyme (35).
The presence of a DapL pathway in plants, cyanobacteria and Chlamydia raised the question of how widely this enzyme is distributed in prokaryotes and how it relates to the evolution of the DAP pathway. The present study made use of the extensive list of sequenced microbial genomes to identify and functionally verify additional DapL orthologs. The results indicate that DapL exists as two divergent groups showing a restricted phylogenetic distribution in both the archaea and eubacteria.
|
|
|---|
ORF cloning. For the open reading frames (ORFs) that were cloned for expression in E. coli along with the primers used for amplification by PCR, see Table S1 in the supplemental material. All ORFs were cloned initially into pET30a (Novagen Corp.) using the restriction sites introduced by PCR (see the italicized sequences of Table S1 in the supplemental material), with the exception of slr1666 (a Synechocystis sp. gene) which was cloned initially into pGEM T-Easy (Promega Corp). The sequences were vetted in the entry plasmid. pET30a ORFs were transformed into E. coli BL21-CodonPlus-RIPL for protein expression. For complementation analysis an expression cassette consisting of the entire ORF, the His tag coding sequence, and the ribosome binding site from pET30 was subcloned into pBAD33 (11) using XbaI and SalI or XbaI and HindIII for Morella thermoacetica Moth_0889. The slr1666 expression cassette was subcloned from pGEM-T-Easy to pQE30 using EcoRI and PstI (Qiagen Corp.). The pBAD33-derived or pQE-derived plasmid was used for complementation of E. coli mutant strains.
Functional complementation.
ORFs were tested for functional complementation of dapD (AT980), dapE (AT984) (obtained from the Coli Genetic Stock Center, Yale University), and a dapD dapE double mutant strain (AOH1). AOH1 was constructed by P1 transduction of a
dapD::kan allele from JC7623 (7) into AT984 as previously described (14). The strains were transformed with the pBAD33- or pQE-derived plasmids and were selected on LB medium supplemented with 50 µg ml–1 DAP (DL-
,
-diaminopimelic acid; Sigma-Aldrich product D-1377) and 34 µg ml–1 chloramphenicol (pBAD-ORF clones) or 100 µg ml–1 ampicillin (pQE-slr1666). Individual colonies were replica plated onto LB medium supplemented with 50 µg ml–1 DAP (pQE-slr1666) or onto LB medium without DAP; colonies were grown under inducing or repressing conditions with 0.2% (wt/vol) arabinose or 0.2% (wt/vol) glucose, respectively (pBAD-ORF clones), and without isopropyl-β-D-thiogalactopyranoside (IPTG) or with 1 mM IPTG (pQE-slr1666). The cultures were grown at 30°C for 24 h.
Enzyme assays. Recombinant protein was expressed in E. coli grown on LB medium at 37°C to an optical density at 600 nm of 0.5, followed by induction with 1 mM IPTG for 4 h at 25°C. Cells were lysed by sonication in a solution of 100 mM HEPES-KOH (pH 7.6). Metabolites in the extract were removed by using an Amicon Ultra 30,000 molecular-weight-cutoff ultrafilter; the concentrated soluble protein sample was used for measurement of enzyme activity as an initial assessment of enzyme function. When the recombinant protein was to be purified, a larger cell culture was grown and lysed in 50 mM sodium phosphate and 300 mM NaCl (pH 8.0). The soluble protein was incubated with Talon metal affinity agarose (Clontech 8901-2), which was then washed three times with sodium phosphate-NaCl buffer containing 10 mM imidazole, and finally the bound protein was eluted with sodium phosphate-NaCl buffer containing 300 mM imidazole. The pure protein was then concentrated in an Amicon Ultra 30,000 molecular-weight-cutoff ultrafilter, replacing the elution buffer with 100 mM HEPES-KOH (pH 7.6).
For enzyme assays LL-DAP and m-DAP were isolated from culture filtrates of a lysine auxotroph and purified as described by Gilvarg (10). C. glutamicum Ddh (DdhCg) was produced as a recombinant protein expressed from plasmid pET28-CgDdh obtained from D. I. Roper (University of Warwick) in E. coli BL21(DE3). DdhCg accumulated to approximately 90% of the soluble protein and was highly active, so it was not further purified. The preparation converted m-DAP to THDPA at a rate of 20 µmol min–1 mg–1 of protein at 30°C.
Three different enzyme assays were used to study DapL. Two assays were used to measure the physiologically reverse activity (LL-DAP
THDPA), and one was used for the physiologically forward activity (THDPA
LL-DAP). The first of the reverse assays measured THDPA formation by its reaction with O-aminobenzaldehyde (OAB). This assay is not strictly quantitative due to the instability and batch-to-batch variation of OAB. Also the extinction coefficient of the dihydroquinazolium adduct that is formed is unknown. However, this assay is useful for comparative measurement of DapL activity in crude cell extracts. The OAB assay contained in a 1-ml reaction mixture 100 µmol of HEPES-KOH (pH 7.6), 0.5 µmol of LL-DAP (or other amino donor), 2.0 µmol of 2-oxoglutarate (or other amino acceptor), 1.25 mg of OAB, and DapL. Reaction mixtures were incubated at 30°C, and the
A440 was measured using a spectrophotometer. A second assay of the reverse activity was quantitative when pure enzyme was being measured. A 1-ml reaction mixture containing 100 µmol of HEPES-KOH (pH 7.6), 0.3 µmol of NADPH, 50 µmol of NH4Cl, 0.5 µmol of LL-DAP, 5 µmol of 2-oxoglutarate, 16 µg of DdhCg, and DapL was incubated at 30°C, and the
A340 was measured continuously. The reaction sequence is as follows: (i) LL-DAP + 2-oxoglutarate
THDPA + glutamate + H2O; (ii) THDPA + NH4+ + NADPH
m-DAP + NADP+ (A340).
Assay of the physiologically forward reaction was measured in a two-step reaction. In the first step, THDPA was formed from m-DAP in a 1-ml reaction mixture containing 100 µmol of HEPES-KOH (pH 7.6), 0.5 µmol of NADP+, various concentrations of m-DAP, and 32 µg of DdhCg. The reaction was run to completion, determined by monitoring the A340 value. Aminotransferase activity was then measured in the second step. The reaction contained, in addition to the components of the first reaction, 0.3 µmol of thio-NAD+, 0.3 µmol of coenzyme A (CoA), and 0.5 µmol of glutamate (for convenience, these components were added in the first-step reaction mixture). Then the aminotransferase assay was started by adding 200 µg of 2-oxoglutarate dehydrogenase (0.625 µmol min–1 mg–1 of protein) and DapL. Activity was determined by measuring the
A398 at 30°C. Thio-NAD+ was used to avoid spectrophotometric interference from NADP+. The two compounds have different absorbance maxima. The reaction sequence is as follows: (i) m-DAP + NADP+
THDPA + NH4+ + NADPH (A340); (ii) THDPA + glutamate + H2O
LL-DAP + 2-oxoglutarate; (iii) 2-oxoglutarate + thio-NAD+ + CoA
succinyl CoA + CO2 + H+ + thio-NADH (A398).
Kinetic constants were determined by measuring the initial reaction rate at various concentrations of reactants. The data were analyzed by nonlinear regression analysis and the Michaelis-Menten equation using GraphPad Prism, version 3.03 for Windows, GraphPad Software, San Diego, CA (www.graphpad.com).
|
|
|---|
In an effort to discern authentic forms of DapL, a lower limit of homology was approximated by identifying aminotransferases that are unlikely to be DapL. The proteins showing the best homology with sll0480 from bacterial species known to be devoid of DapL activity including Agrobacterium tumefaciens, Bacillus subtilis, and E. coli (14) ranged from 23% to 29% identity. The best match was an aminotransferase from B. subtilis showing 29% identity with sll0480 (see Table S2 in the supplemental material).
The sll0480 homologs demonstrating between 29% and 42% identity were further analyzed by examining their genomic context. The chromosomal contiguity of genes in prokaryotic genomes has proven to be a useful tool for inferring function and identifying metabolic networks, based on the hypothesis that functionally coupled genes are often clustered (24, 27). An example is the dapL gene from Protochlamydia (locus tag pc0685) that is located immediately downstream of dapB and dapA in an orientation that suggests all three may exist in an operon (19). By using the SEED database (23), it was observed that some dapL orthologs do indeed exist in genomic contiguity with other DAP-lysine biosynthesis genes. For example, dapL is situated immediately downstream of dapF in 10 species belonging to the Bacteriodetes, Firmicutes, Chloroflexi, and Deltaproteobacteria phylogenetic groups (see Table S3 in the supplemental material). The likely significance of the contiguity of this gene arrangement is emphasized by the fact that in the microbial genomes only a single other example exists in which dapF is contiguous with another aminotransferase. That example is slr1666 in Synechocystis sp., which lies immediately downstream of dapF. slr1666 is annotated as a degT aminotransferase and shows only 15% identity with sll0480. In Bacillus stearothermophilus, mutation of the degT gene results in complex phenotypic changes including altered production of extracellular enzymes, altered control of sporulation, loss of flagella, and abnormal cell division. Hence, it was initially described as a pleiotropic regulatory gene (30). In other genomes, dapL was found to be contiguous with lysA or with multiple dap synthesis genes or with combinations of lysA and multiple dap synthesis genes (see Table S3 in the supplemental material). The most extreme example of gene clustering was uniquely found in Syntrophobacter fumaroxidans, in which the dapL ortholog (Sfum_0054) is contiguous with lysA, dapF, dapA, and dapB. This grouping is of particular interest since it would comprise the entire set of genes necessary to synthesize lysine from aspartate β-semialdehyde, assuming that Sfum_0054 encodes DapL. Thus, in several noteworthy examples genomic context analysis provided support for a role of DapL in DAP-lysine synthesis. Moreover, the significance of the associations discovered with dapL orthologs is underscored by the fact that DAP-lysine biosynthetic genes generally show weak genomic associations (27).
Genomic contiguity analysis produced two surprising results. In a number of the best examples of gene clustering, the aminotransferase is highly diverged from sll0480. For example, Syntrophobacter Sfum_0054 shows 30% identity with either sll0480 or pc0685, a level of homology that was earlier taken as the approximate cutoff for authentic DapL (see Table S2 in the supplemental material). This finding suggested the possibility that DapL may exist as two divergent types. In fact, many of the best DapL candidates showing marginal homology with sll0480 were found to be closely related to Sfum_0054 (see Table S2 in the supplemental material). Another surprising result was that nearly all of the cyanobacteria and several species from other phylogenetic groups contain two DapL orthologs, one that is related to sll0480 and the other to Sfum_0054 (three examples are listed in Table S2 in the supplemental material). That DapL orthologs form two divergent groups that are distinct from other class I/II aminotransferases is graphically shown in the phylogenetic tree presented in Fig. 2. The tree was constructed by comparing representatives from DapL1 (those more closely related to sll0480) and DapL2 (more closely related to Sfum_0054) with a selection of other aminotransferases. Several examples of DapC and ArgD were included because these enzymes catalyze transamination in the acyl-DAP pathways (8, 12, 17), the reaction that is analogous to that catalyzed by DapL. Several examples of AspC and TyrB were included because these represent the prototype of class I/II aminotransferases. Most of the DapL orthologs were initially annotated as aspartate or tyrosine aminotransferases. Finally, all the class I/II aminotransferases in B. subtilis were included to provide an overall context for aminotransferases that are unlikely to be DapL. In addition to the divergence of DapL into two major groups, the phylogenetic tree shows that both DapL types are more closely related to DapC than they are to ArgD, AspC, or TyrB.
![]() View larger version (24K): [in a new window] |
FIG. 2. Phylogenetic analysis of DapL orthologs. The diagram is of a neighbor-joining tree produced by alignment with Clustal W using a gap penalty of 10 and a gap length penalty of 0.2. The tree was constructed by bootstrap analysis using MEGA, version 3.1. Locus tags are indicated. The identities of enzyme clusters are indicated. Homologous proteins that cannot be DapL proteins are marked with an asterisk. The same proteins are indicated in Table S2 in the supplemental material.
|
![]() View larger version (12K): [in a new window] |
FIG. 3. Complementation assay. Complementation was tested in the indicated mutant under inducing and repressing conditions. The vector was pBAD33, and sll0480 was the dapL ortholog from Synechocystis cloned into pBAD33. The constructs were tested for complementation of the dapD mutant and dapD dapE double mutant. Each construct was serially diluted in 0.85% (wt/vol) saline (from the left, optical density at 600 nm of 0.1, 0.01, and 0.001), and 5 µl was plated onto the indicated medium. Gene expression from the constructs was induced on medium containing 0.2% (wt/vol) arabinose (Ara) or was repressed on medium containing 0.2% (wt/vol) glucose (Glc). DAP was added when indicated at 50 µg ml–1.
|
|
View this table: [in a new window] |
TABLE 1. Functional tests of DapL orthologs
|
To further characterize the collection of DapL proteins, selected examples were chosen for purification and analysis of kinetic properties. It was possible to isolate soluble, active protein in four instances, L. interrogans LIC12841, M. thermoautotrophicus MTH52, D. hafniense Dhaf_1761, and M. thermoacetica Moth_0889. The first three are members of the DapL1 group whereas Moth_0889 is a member of DapL2. The kinetic experiments revealed that each of the DapL orthologs shows kinetic properties that are very similar to previously characterized forms of the enzyme from Arabidopsis and Chlamydia (Table 2). In all cases, the activity was significantly greater in the reverse direction (THDPA synthesis) than in the forward direction (LL-DAP synthesis). In addition, all showed a much higher affinity for THDPA and LL-DAP than for glutamate or 2-oxoglutarate. Interestingly, the overexpressed LIC12841 and MTH52 showed much higher Vmax values than Dhaf_1761 and Moth_0889. However, the higher specific activity may be related to the quality of the protein since the amount of soluble LIC12841 and MTH52 produced in E. coli was much greater than Dhaf_1761 and Moth_0889, for which the yield of soluble protein was low.
|
View this table: [in a new window] |
TABLE 2. Kinetic constants of DapL proteins
|
Microbial diversity of DAP pathways. Hudson et al. (14) and McCoy et al. (19) argued that DapL must represent an alternative DAP-lysine biosynthesis pathway because plant and chlamydia genomes encode enzymes for THDPA synthesis and m-DAP or lysine synthesis from LL-DAP but lack the genes encoding enzymes necessary to bridge the anabolic gap between THDPA and LL-DAP. Moreover, experimental evidence indicated that DapL can catalyze the conversion of THDPA to LL-DAP. The effort to annotate genomes based on what genes exist and what genes cannot be identified is referred to as "metabolic reconstruction," and this technique has proven useful in uncovering new variants of many different metabolic pathways (5, 22). The SEED database provides a facile means to analyze the coincidence of metabolic pathway genes in the microbial genomes, and it was used for the present analysis. Of the species containing a DapL1 or DapL2 ortholog, all were found to contain DapA, DapB, DapF, and LysA but lacked DapD and DapE. A representative list extracted from the SEED database is given in Table S5 in the supplemental material. In only a few species in the Firmicutes phylogenetic group, the presence of DapL was coincident with a strong Ddh candidate gene. This observation is an interesting parallel to the existence of redundant DAP pathways in Corynebacterium as previously reported (26).
Analysis of all the available microbial genomes revealed that about 14.0% harbor a DapL pathway. Nearly all the rest harbor one of the two pathways that utilize acylated intermediates. A clear-cut phylogenetic lineage for the DapL pathway was not evident. Other than the Cyanobacteria and Chlamydiae, for which all the examples in the genomes database contained a DapL pathway, the other groups, including Deltaproteobacteria, Euryarchaeota, Firmicutes, Spirochaetes, Desulfuromonadales, and Bacteroidetes, showed only selected examples of DapL pathway-containing species.
Analysis of the DapL sequence. Aminotransferases are a major challenge for classification because the superfamily is so large and diverse. Classification of DapL proteins presents a further challenge because the enzymes have significantly diverged into two sequence types, DapL1 and DapL2. The recent report of the crystal structure of DapL from Arabidopsis (35) provides the structural basis to compare the divergent forms of DapL. The sequence alignment shown in Fig. S1 in the supplemental material illustrates that all the residues defined as being involved in pyridoxal phosphate binding in Arabidopsis DapL are conserved in both the DapL1 and DapL2. Most of these residues are conserved with related aminotransferases. By contrast, only one of the three residues that Watanabe et al. (35) suggested is involved in binding of LL-DAP are conserved. For example, the residues homologous with Lys129 are conserved in both DapL forms, but Glu97 and Asn309 are conserved only in DapL1, not DapL2. As a result of the catalytic mechanism of aminotransferases, both the amino donor and acceptor substrates are thought to bind to the same catalytic site but can involve contact with different residues. For DapL, Tyr152 was suggested to be involved in binding to glutamate. This residue is conserved in both the DapL1 and DapL2 forms. Examination of the alignments shown in Fig. S1 in the supplemental material indicates many conserved positions between DapL1 and DapL2 and also many positions that distinguish the two forms of DapL. Further investigation of the structure of DapL1 and DapL2 will be required to determine the functional role of these residues.
|
|
|---|
The phylogenetic distribution of DapL was studied with the aim of understanding the evolution and biochemical properties of DAP biosynthesis. The study revealed that there are two major classes of DapL that differ significantly in sequence. The two forms appear to be loosely associated with specific microbial lineages. For example, DapL1 occurs most often in bacteria, and DapL2 is found most often in archaea. There also appear to be signatures of lateral gene transfer events. For example, M. thermoautotrophicus and Methanosphaera stadtmanae stand out because they are Euryarchaeota of the Methanobacteria group that contain DapL1, whereas all the other available Methanobacteria contain a DapL2. There are also indications of interoperonic gene replacements. For example, the gene arrangement dapA-dapB-dapL found in the DapL-containing Deltaproteobacteria appears to be conserved, yet S. fumaroxidans is unusual in that a DapL2 replaces the DapL1 form found in the other Deltaproteobacteria.
The DapL orthologs are approximately 410-amino-acid proteins whose sequence conservation identifies them as members of the pyridoxal-5'-phosphate-dependent protein superfamily of class I/II aminotransferases (15, 29). Based on sequence homology, all were initially misannotated as aspartate or aromatic amino acid aminotransferases, but subsequent experimental analysis revealed their true catalytic function. The recent determination of the crystal structure of DapL from the plant species Arabidopsis (locus tag At4g33680) has significantly added to the understanding of the amino acid residues that make up the active sites of the enzyme. Interestingly, it appears that the DapL2 form lacks two of the three residues that are conserved in DapL1 (Glu97 and Asn309). It is important to point out that Watanabe et al. (35) modeled LL-DAP binding to At4g33680 based on the structure of the enzyme crystallized with malate, which mimics substrate binding. It is surprising that greater kinetic differences are not associated with the sequence divergence. On the other hand, the extreme divergence has allowed the identification of absolutely conserved residues, suggesting critical roles in function. Further study will be required to understand how LL-DAP binding to DapL2 can occur with the substitution of these residues.
Excellent candidates for DapL are always accompanied by the absence of acyl Dap pathway genes, meaning that pathway redundancy is not usual. The exception was in certain organisms where there seemed to be excellent candidates for Ddh, meaning that these organisms could possess dual lysine anabolic pathways. A precedent for pathway redundancy that includes Ddh exists for C. glutamicum (26). In this species, the Ddh pathway provides a greater flux to m-DAP under growth conditions at high ammonium concentrations.
The phylogenetic distribution of the DapL pathway shows that it is present in only 14.0% of the sequenced microbial genomes. It is important to point out that the sequenced microbial genomes are not representative of microbial diversity. Indeed, environmental species are not well represented compared with pathogenic and saprophytic species. As the genomes of more microbial species are sequenced and annotated, the presence of the DapL pathway will likely increase.
We are grateful to the following individuals for contributing genomic DNA or cell material: Sheila Patrick, James M. Tiedje, Takakazu Kaneko, Albert Ko, Caroline Plugge, John M. Reeve, Stephen W. Ragsdale, and Hideto Takami. We especially thank Valérie de Crécy-Lagard and Andrew D. Hanson for their help in using the SEED genome annotation tool.
Published ahead of print on 29 February 2008. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»