Previous Article | Next Article ![]()
Journal of Bacteriology, January 2008, p. 251-263, Vol. 190, No. 1
0021-9193/08/$08.00+0 doi:10.1128/JB.00826-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
State Key Laboratory of Bioorganic and Natural Product Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 354 Fenglin Rd., Shanghai 200032, China
Received 26 May 2007/ Accepted 23 October 2007
|
|
|---|
|
|
|---|
-amino nitrile moiety (at the C-21 position), and the departure of the nitrile moiety from C-21 in the presence of reduced cofactors allows the formation of an electrophilic iminium ion that alkylates the guanine residues of double-stranded DNA (17, 18, 23, 34, 35). Although it has frequently been speculated that covalent modification of DNA is essential for the antitumor activity of SFM-A, recently the identification of glyceraldehyde-3-phosphate dehydrogenase (a putative key transcriptional coactivator necessary for entry into the S phase of cell proliferation) as a protein target of SFM-DNA adducts suggested that the action of SFM-A involves a protein-drug-DNA interaction; thus, a distinct pathway may be involved for SFM-A antiproliferative activity (31, 48).
![]() View larger version (30K): [in a new window] |
FIG. 1. Structures of saframycins, safracins, and ecteinascidin 743.
|
Previous feeding experiments using isotope-labeled substrates showed that the backbone of SFM-A is derived from one alanine (Ala), one glycine (Gly), and two tyrosine (Tyr) residues, suggesting that it is of tetrapeptide origin (28), which is also probably shared among other structurally related analogs, such as saframycin Mx1 (SFM-Mx1) and safracin B (SAC-B) as shown in Fig. 1. The partial biosynthetic gene cluster of SFM-Mx1 (with a hydroquinone form of the E ring, a hydroxy group at the C-21 position, and a reserved
-amino group of Ala in comparison to SFM-A) and the entire biosynthetic gene cluster of SAC-B (one of the structurally simplest members in the SFM family) were cloned from Myxococcus xanthus in 1995 (32, 33) and Pseudomonas fluorescens in 2005 (46), respectively, indeed revealing a nonribosomal peptide synthetase (NRPS) system for the formation of an identical tetrapeptide intermediate. In both cases, sequential incorporation of Ala, Gly, and Tyr derivatives into the backbone was speculated to be catalyzed by NRPSs in a colinear way according to the substrate specificity of the NRPS modules. This biosynthetic logic was formulated using bioinformatics analysis, but to date, no biochemical studies on SFM-Mx1 and SAC-B have been reported.
We hypothesized that SFM-A is biosynthesized in a manner similar to that of SFM-Mx1 and SAC-B according to their conserved structure. Here we report the cloning and sequencing of the SFM-A biosynthetic gene cluster from S. lavendulae NRRL 11002 and propose biochemical functions for the deduced gene products. Sequence analysis and genetic comparison revealed a common strategy of NRPS-directed tetrapeptide assembly during the biosynthesis of SFM-A, SFM-Mx1, and SAC-B. However, in contrast to speculations from prior reports regarding SFM-Mx1 and SAC-B, we predicted that the same tetrapeptide backbone is catalyzed by NRPSs using the last module in an iterative manner rather than following the typical colinear rule. To confirm this prediction, we heterologously produced and purified proteins containing each adenylation domain of the NRPS modules and determined their substrate specificity using an ATP-PPi exchange assay. Thus, these findings shed new insight into tetrahydroisoquinoline biosynthesis and afford the opportunity to study iterative events during nonribosomal peptide biosynthesis. Finally, production of SFM-Y3, an aminated analog of SFM-A, was achieved by heterologous expression of the hydroxylase SfmO4 in the SAC-B producer P. fluorescens FERM BP-14, demonstrating the feasibility of producing tetrahydroisoquinoline analogs by rationally engineering of an established biosynthetic pathway in microorganisms. The availability of the gene clusters and biosynthetic pathways of SFM-A, SFM-Mx1, and SAC-B has paved the way for future studies regarding the unusual biochemistry found in this pathway and subsequent attempts of applying this knowledge for combinatorial biosynthesis.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Bacterial strains and plasmids used in this study
|
Genomic library construction and screening. A genomic library of S. lavendulae NRRL 11002 was constructed in Super-Cos1 according to a previously published protocol (22). E. coli VCS257 and Gigapack III XL packaging extract (Stratagene, La Jolla, CA) were used for library construction according to the manufacturer's instructions. The NRPS gene probes for library screening were obtained by PCR amplification and confirmed by sequencing. For PCR products P1 and P2, a 1.2-kb fragment was obtained by using the primers 5'-TACACGTCCGGCACSACSGGCAARCCNAARGG-3' and 5'-AWCGAGKSGCCSGGGSMGAAGAA-3'. For PCR product P3, a 0.8-kb fragment was amplified by using the primers 5'-GACAACTTCTTCGAGCTGGGSGGSSAYTC-3' and 5'-GCGGACCAACTTCTCCGCSRCCCAYTTRCT-3'. The genomic library (6.0 x 103 colonies) was screened by colony hybridization with P2 as a probe, and resultant positive clones were further confirmed by Southern hybridization with P2 and P3 as probes.
Sequence analysis.
The open reading frames (ORFs) were deduced from the sequence by performing FramePlot 3.0beta program (http://watson.nih.go.jp/
jun/cgi-bin/frameplot-3.0b.pl). The corresponding deduced proteins were compared with other known proteins in the databases by available BLAST methods (http://www.ncbi.nlm.nih.gov/BLAST/). Amino acid sequence alignments were performed by the CLUSTALW method, and the DRAWTREE and DRAWGRAM methods, respectively, from BiologyWorkBench 3.2 software (http://workbench.sdsc.edu). Prediction of amino acid specificity of individual NRPS A domains was performed by using the BLAST server provided at the website http://bix.umbi.umd.edu/Projects/nrps/ (see Table S2 in the supplemental material).
Production, isolation, and analysis of SFM-A in S. lavendulae. S. lavendulae wild-type and recombinant strains were grown on ISP-2 (0.4% glucose, 0.4% yeast extract, and 1% malt extract [pH 7.2]) agar plates (with the appropriate antibiotic for recombinant strains) at 30°C for sporulation. For fermentation, 200 µl of spore suspension (1.0 x 106 to 1.0 x 107/ml) of each S. lavendulae strain was inoculated onto a YSA (0.1% yeast, 0.5% soluble starch, 1.5% agar, pH 7.5) plate and incubated at 27°C for 7 days. A piece of YSA with spores was then transferred into a 500-ml flask containing 50 ml of fermentation medium (0.1% glucose, 1% soluble starch, 0.5% NaCl, 0.1% K2HPO4, 0.5% casein acid hydrolysate, 0.5% meat extract, pH 7.0) and incubated at 27°C and 250 rpm for 30 to 36 h.
For SFM-A isolation, each 50 ml of the culture broth was filtered and adjusted to a pH of 6.8. After treatment with 1 mM KCN at 35°C for 30 min, the filtered broth was extracted thrice with 30 ml of ethyl acetate, and the combined extract was finally concentrated to 100 µl in a vacuum.
High-performance liquid chromatography (HPLC) analysis was carried out on a Microsorb-MV 100-5 C18 column (4.6 by 250 mm) (catalog no. SN 281505; Varian). The column was equilibrated with 50% solvent A (H2O, 0.05% trifluoroacetic acid) and solvent B (CH3CN, 0.05% trifluoroacetic acid) and developed with the following program: 0 to 5 min, 90% solvent A and 10% solvent B; 5 to 25 min, a linear gradient from 90% solvent A and 10% solvent B to 15% solvent A and 85% solvent B; 25 to 27 min, constant 15% solvent A and 85% solvent B; 27 to 29 min, a linear gradient from 15% solvent A and 85% solvent B to 90% solvent A and 10% solvent B; and 29 to 30 min, constant 90% solvent A and 10% solvent B. This was carried out at a flow rate of 1 ml/min and UV detection at 270 nm using a Agilent 1100 HPLC system (Agilent Technologies, Palo Alto, CA). The identity of a compound was confirmed by coinjection with standard SFM-A and liquid chromatography-mass spectrometry (LC-MS) analysis performed on an LCMS-2010 A liquid chromatograph mass spectrometer (Shimadzu, Japan) under the same conditions. SFM-A showed [M+H]+ ion at m/z 563.0, consistent with the molecular formula C29H30N4O8.
Production, isolation, and analysis of SACs and SFMs in P. fluorescens. For fermentation, 50 µl of frozen vegetative stock of each P. fluorescens strain was transferred into a 250-ml flask containing 50 ml of YMP3 medium (1% glucose, 0.25% beef extract, 0.5% Bacto peptone, and 0.8% CaCO3, pH 6.5). After incubation at 27°C for 30 h, 2% of this seed medium was then transferred into a 500-ml flask containing 100 ml of M-16B medium [15.2% D-mannitol, 3.5% dried brewer's yeast, 1.4% (NH4)2SO4, 0.001% FeCl3, and 2.6% CaCO3, pH 6.5]. Strains were incubated at 27°C for 40 h, and the expression of sfmO3 to sfmK or expression of sfmO4 was induced by the addition of isopropyl-β-D-thiogalactopyranoside (IPTG) to a final concentration of 0.2 mM, and cultures were incubated at 24°C for an additional 72 to 96 h. The same fermentation procedure was applied to the wild-type strain as a compatible control.
For the production of cyano-substituted analogs, the supernatants of fermentation cultures were treated with 1 mM KCN at 35°C for 30 min.
The analysis on each filtered culture was carried out by using the same set of HPLC and LC-MS conditions as those for SFM-A detection described above. The column was equilibrated with 50% solvent A (10 mM NH4Ac, 1% diethanolamine, pH 4.0) and solvent B (CH3CN), and was developed with the following program: 0 to 5 min, 93% solvent A and 7% solvent B; 5 to 25 min, a linear gradient from 93% solvent A and 7% solvent B to 15% solvent A and 85% solvent B; 25 to 27 min, constant 15% solvent A and 85% solvent B; 27 to 29 min, a linear gradient from 15% solvent A and 85% solvent B to 93% solvent A and 7% solvent B; and 29 to 30 min, constant 93% solvent A and 7% solvent B. This was carried out at a flow rate of 1 ml/min and UV detection at 268 nm. The identity of compound was confirmed by LC-MS analysis under the same conditions. SAC-B showed [M+H]+ ion at m/z 541.2, consistent with the molecular formula C28H36N4O7. Cyano-substituted SAC-B showed [M+H]+ ion at m/z 550.2, consistent with the molecular formula C29H35N5O6. Aminated SFM-S showed [M+H]+ ion at m/z 555.2, consistent with the molecular formula C28H34N4O8. SFM-Y3 showed [M+H]+ ion at m/z 564.0, consistent with the molecular formula C29H33N5O7.
Determination of substrate specificities of SfmA, SfmB, and SfmC. For amino acid-dependent ATP-PPi exchange assay, a typical reaction (100 µl) was carried out in 75 mM Tris-HCl (pH 8.0) buffer, containing 50 to 100 nM NRPS protein, 5 mM ATP, 1.0 mM PPi with 0.5 µCi of 32PPi (40.02 Ci/mmol; NEN Life Science Products, Boston, MA), 10 mM MgCl2, 5.0 mM dithiothreitol, and a 1.0 mM concentration of each of various amino acids. After incubation at 30°C for 30 min, each assay was stopped by the addition of 0.5 ml of 1% (wt/vol) activated charcoal in 4.5% (wt/vol) tetrasodium pyrophosphate and 3.5% (vol/vol) perchloric acid. The precipitate was collected on a glass-fiber filter (2.4 cm) (G-4; Fisher, Pittsburgh, PA), washed successively with 10 ml of 40 mM sodium pyrophosphate plus 1.4% perchloric acid, 10 ml of water, and 5 ml of 95% ethanol and briefly dried in air. The filter was mixed with 5 ml of scintillation fluid (ScintiSafe gel; Fisher) and counted on a Beckman LS-6800 scintillation counter to determine the radioactivity.
Supplemental material. See the supplemental material for supporting data including deduced function of open reading frames beyond the SFM-A biosynthetic gene cluster boundary, prediction of amino acid recognitions of NRPS A domains, structures of SFMs, inactivation and complementation in S. lavendulae, biotransformation in P. fluorescens, overexpression and purification of SFM NRPSs, and chemical synthesis of Tyr derivatives in this study.
Nucleotide sequence accession number. The sequence reported in this paper has been deposited into GenBank under the accession number DQ838002.
|
|
|---|
Previous studies on SFM-Mx1 and SAC-B biosynthesis revealed a relatively rare reductase (RE) domain that contains a NAD(P)H binding site at the C-terminal ends of SafA and SacC (33, 46), both of which are NRPSs involved in the tetrapeptidyl backbone formation. These RE domains may act on the PCP-tethered polypeptidyl intermediate and reductively release it from the PCP as a linear aldehyde (21), instead of the thioesterase (TE) functionality at the C-terminal ends of typical NRPSs. Since the RE domain is conserved and always resides next to the last PCP, we adopted an alternative strategy to specifically clone the putative RE gene fragment by PCR according to two motifs [DNFFEL(G/D)GHS in the PCP domain and RVLKEAVWKS in the RE domain]. A single product with the expected size of 0.8 kb was readily amplified and cloned from the genomic DNA of S. lavendulae. Sequence analysis of the randomly selected clones confirmed that 80% of them contain an identical product, P3, and the sequence of P3 exhibits significant similarity to the sequences of the RE domains of the safA and sacC genes. To identify the locus on the chromosome, Southern analysis of the previously identified cosmids obtained by P2 screening was performed using the 0.8-kb P3 fragment as a probe. Intriguingly, most cosmids showed a positive signal, and a single 7.4-kb BamHI fragment that harbors both P2 and P3 was detected. All together, these results provided strong support that we have cloned the SFM-A gene cluster in S. lavendulae.
The DNA region represented by cosmid pTL2101 (partial), pTL2102 (entire), and pTL2103 (partial) was selected for sequencing, yielding a 62,804-bp contiguous sequence with 71.86% of the overall GC content that is characteristic for Streptomyces DNA. Bioinformatic analysis of the sequenced region revealed 47 ORFs, and 30 of the ORFs from sfmR1 to sfmO6 were proposed to constitute the SFM-A gene cluster according to functional assignment of their deduced products and genetic comparison with the gene cluster for SAC-B biosynthesis (Fig. 2 and Table 2; see also Table S1 in the supplemental material). Mutant strains with inactivated orf(–1) and orf(+1) retained the ability to produce SFM-A as deduced from HPLC analysis (see Fig. S3 in the supplemental material), supporting the idea that they are outside the sfm gene cluster. Consistent with the structure of SFM-A, the ORFs within the sfm cluster presumably include eight genes encoding enzymes involved in the biosynthesis of the SFM-A tetrapeptidyl backbone, nine genes encoding the tailoring enzymes, as well as three regulatory genes, two resistance genes, five genes involved in S-adenosylmethionine (SAM) recycling, and three additional genes whose functions could not be predicted or assigned for SFM-A production.
![]() View larger version (19K): [in a new window] |
FIG. 2. Genetic organization and comparison of the saf (SFM-Mx1), sfm (SFM-A), and sac (SAC-B) biosynthetic gene clusters. The proposed functions of individual ORFs are shown and summarized in Table 1.
|
|
View this table: [in a new window] |
TABLE 2. Deduced functions of ORFs in the SFM-A biosynthetic gene cluster
|
![]() View larger version (28K): [in a new window] |
FIG. 3. Proposed biosynthetic pathways for 3-hydroxy-5-methy-O-methyltyrosine (A) and saframycin A (B).
|
![]() View larger version (19K): [in a new window] |
FIG. 6. Domain organizations and proposed enzymatic mechanisms of NRPS systems for SFM-A (A), SFM-Mx1 (B) (33), and SAC-B (C) (46) biosynthesis.
|
![]() View larger version (17K): [in a new window] |
FIG. 4. HPLC analysis of saframycin and safracins. (A) Saframycin A (a) isolated from an authentic standard (I), S. lavendulae wild-type strain (II), strain TL2003 ( sfmB) (III), and strain TL2004 (PermE*::sfmB) (IV). (B) Safracin B (b) isolated from a P. fluorescens wild-type strain (I), cyano-substituted safracin B (c) isolated from a P. fluorescens wild-type strain treated with 1 mM KCN (II), aminated saframycin S (d) isolated from strain TL2102 (Ptac::sfmO4) (III), and saframycin Y3 (e) isolated from strain TL2102 (Ptac::sfmO4) treated with 1 mM KCN (IV). Absorbance of UV at 270 nm (A) or 268 nm (B) in milliabsorbance units (mAU) is shown on the y axes, and time (in minutes) is shown on the x axes.
|
Genes encoding tailoring enzymes. Postmodifications on the tetrapeptidyl intermediate compound 1, including cyclization, methylation, oxidoreduction, and nitrile moiety substitution, are postulated to proceed with a set of tailoring enzymes in the SFM-A biosynthetic pathway as outlined in Fig. 3B. The gene products of sfmCy1 and sfmCy2 exhibit high sequence similarity to MitR (67% and 45% identity, respectively), which might be responsible for the C8a-C9 bond formation in mitomycin biosynthesis (24). In a mechanistic analogy, SfmCy1 and SfmCy2 may act on compound 1 as cyclases to close the B and D rings at C9-C1 and C19-C11, although their regiospecificities need to be determined. Noticeably, such homologous genes have not been identified within the SAC-B biosynthetic gene cluster in P. fluorescens. It would be interesting to further determine whether these homologs can be identified in the P. fluorescens chromosome or whether they have any function regarding the formation of the B and E rings.
sfmM1 and sfmO2 encode proteins that have high sequence similarity to SacI (43% identity) and SacJ (43% identity), respectively, both of which have been functionally assigned to catalyze the last two steps for SAC-B biosynthesis on the basis of identifying shunt metabolites resulted from sacI or sacJ inactivation (46). Although the catalytic order remains to be established, it is likely that SfmM1 acts as a N-methyltransferase to introduce a methyl group at the N-12 position, and SfmO2 serves as a monooxygenase responsible for hydroxylation at the C-5 position on the A ring, which then undergoes a dehydrogenation to form the quinone ring of SAC-B. Based on this hypothesis, interestingly, SAC-B might be a key intermediate in the SFM-A biosynthetic pathway (Fig. 3B).
SFM-A structurally differs from SAC-B with a heavily oxidized E ring. Heterologous expression of SfmO4 in the SAC-B producer resulted in aminated SFM-S production (described below), supporting the hypothesis that SfmO4 acts on SAC-B at the C-15 position for a hydroxyl substitution. sfmO1, encoding a putative NAD(P)+-dependent oxidoreductase, might catalyze oxidation of the resultant hydroxyl on the E ring in vivo, producing the characteristic bisquinone core scaffold (Fig. 3B). The further desamination of Ala and substitution of a nitrile moiety at the C-21 position are proposed to yield SFM-A. Alternatively, it could not be excluded that the oxidative desamination step, which is predicted to be catalyzed by a putative FAD-dependent monooxygenase SfmO6, occurs at an earlier stage during the tailoring process. Since previous studies showed that treatment of SFM-S (an SFM-A precursor with a hydroxyl group instead of a nitrile moiety) with sodium cyanide led to the formation of SFM-A (2), the substitution of a nitrile moiety may be spontaneous, consisting with no obvious gene candidate identified within the sfmA cluster.
The S. lavendulae wild-type strain also produces a series of SFM derivatives with additional oxidation or O methylation at the C-14 position (as shown as a hydroxyl, methoxy, or keto group in Fig. 1) (40), suggesting that the branched biosynthetic pathways may start with the intermediate compound 3 and aminated SFM-S (or their desamino derivatives). The putative cytochrome P450 enzyme SfmO3 (coupled with the ferredoxin-like protein SfmK) presumably initiates the oxidative bioconversion at this position.
Genes encoding regulation, resistance, and other functions. Three genes (Fig. 2B), sfmR1, sfmR2, and sfmR3, are presumed to encode pathway-specific regulatory proteins. While SfmR1 resembles the TetR family of transcriptional regulators widely found in many microorganisms, SfmR2 and SfmR3, with high similarity to MitQ (63% and 51% identity, respectively) in the mitomycin biosynthetic pathway (24), belong to the OmpR family of DNA binding regulators in the two-component system.
Two resistance genes (Fig. 2B), sfmG and sfmH, were found in the sfm cluster. In contrast to SfmG that belongs to a family of transmembrane efflux permeases that usually exhibits multiple drug resistance, such as AraJ (37% identity) in the aranciamycin biosynthetic pathway (41), SfmH shows high sequence similarity to a family of UV repair proteins, such as CmrX (57% identity) in the chromomycin biosynthetic pathway (27), representing a more specific resistance protein in agreement with the mechanism of action of SFM-A as a DNA alkylation agent.
Sequence analysis within the sfm cluster revealed five genes (Fig. 2B), sfmS1 to sfmS5 as a complete set for the recycling of SAM from S-adenosylhomocysteine (SAH, a by-product in the SAM-dependent methylation reaction). SfmS1, a putative S-adenosyl-L-homocysteine hydrolase, may cleave SAH to adenosine and homocysteine. The latter could be methylated and converted into methionine by SfmS2, a putative methionine synthase, with N5-methyl tetrahydrofolate (N5-methyl THF) as the cosubstrate. SfmS5, closely related to a family of SAM synthetases, might be responsible for the generation of SAM from methionine and ATP. N5-methyl THF as a methyl donor originates from N5,N10-methylene THF, requiring SfmS2 as a putative N5,N10-methylene THF reductase. SfmS4 shows high sequence similarity to a family of adenosine kinases, presumably in charge of ATP regeneration by converting adenosine to AMP. The pathway for recycling SAH to SAM has been well established in primary metabolism and recently was identified to be involved in a few biosynthetic pathways for secondary metabolites, such as the polyketide-isoprenoid compound furanonaphthoquinone I (16). Since multiple SAM-dependent methylations at C, O, and N positions occur in the SFM-A biosynthetic process, the advantage of involvement of this complete pathway might facilitate enhancement of the supply of the SAM precursor.
Finally, three genes within the sfm cluster could not be functionally assigned on the basis of sequence analysis alone (Fig. 2B). sfmE encodes a protein that resembles proteins in the peptidase M28 family. The deduced products of two coupled genes, sfmI and sfmJ, exhibit high sequence similarity to MflvDRAFT_0798 (35% identity) and MflvDRAFT_0799 (41% identity), respectively, and the genes encoding MflvDRAFT_0798 and MflvDRAFT_0799 are clustered within the genome of Mycobacterium flavescens PYR-GCK (under the NCBI accession number NZ_AAPA01000017). Although SfmJ contains a putative pyridoxamine 5'-phosphate oxidase domain, their roles in SFM-A biosynthesis could not be speculated.
Determination of substrate specificities of SfmA, SfmB, and SfmC by utilizing amino acid-dependent ATP-PPi exchange assay. Initial attempts to directly determine substrate specificities of individual A domains (i.e., SfmA-A1, SfmB-A2, and SfmC-A3) or intact SfmA from the SFM-A NRPS system were hampered by either poor solubility of the resultant proteins in E. coli or low enzymatic activities (data not shown). Thus, truncated SfmA (C1-A1-PCP1) and the intact SfmB (C2-A2-PCP2) and SfmC (C3-A3-PCP3-RE) were heterologously expressed in E. coli by using pET28a yielding soluble N-terminal His-tagged proteins with a yield around 5 to 10 mg/liter. Using nickel affinity chromatography, gel filtration, or anion-exchange chromatography in tandem, all proteins were purified to near homogeneity as shown in Fig. 5A, and sodium dodecyl sulfate-polyacrylamide gel electrophoresis revealed a dominant band consistent with their deduced molecular masses (124 kDa, 119 kDa, and 163 kDa, respectively). Among the substrates used in the ATP-PPi exchange assay (L-Ala, D-Ala, L-Ala-L-Gly, L-Gly, pyruvate, L-Cys, L-Tyr, L-3h5mOmTyr, L-3mOmTyr, L-OmTyr, L-3hTyr, and L-Phe), as shown in Fig. 5B, truncated SfmA (C1-A1-PCP1), SfmB (C2-A2-PCP2), and SfmC (C3-A3-PCP3-RE) specifically recognized and activated L-Ala, L-Gly, and L-3h5mOmTyr, respectively.
![]() View larger version (35K): [in a new window] |
FIG. 5. (A) Purified SfmA-C1-A1-PCP1 (lane 1), SfmB-C2-A2-PCP2 (lane 2), and SfmC-C3-A3-PCP3-RE (lane 3) as analyzed by electrophoresis on a 7.5% sodium dodecyl sulfate-polyacrylamide gel. The positions of molecular mass markers (lane 4) (in kilodaltons) are shown to the right of the gel. (B) Substrate specificities as determined by the ATP-PPi exchange reaction with the amino acids predicted to be incorporated into SFM-A (100% relative activity corresponds to 51,540 cpm for SfmA-C1-A1-PCP1, 44,300 cpm for SfmB-C2-A2-PCP2, and 51,500 cpm for SfmC-C3-A3-PCP3-RE).
|
|
|
|---|
SfmA, SfmB, and SfmC constitute an NRPS system that exhibits similarities in domain organization and amino sequence from head to tail to those for SFM-Mx1 and SAC-B biosynthesis (Fig. 6). With the exception of the first module that is not found in the SAC-B NRPS system, the eight specificity-conferring amino acids for each A domain in the remaining three modules are almost identical, suggesting a common logic for the assembly of the tetrapeptide intermediate during SFM-A, SFM-Mx1, and SAC-B biosynthesis. Based on the colinearity rule (25), wherein the NRPS module organization parallels the order of the amino acid residues in the resultant polypeptide, sequential incorporation of Ala, Gly, and two Tyr derivatives into the tetrapeptide in SAM-Mx1 biosynthesis was previously speculated to be directed by four successive modules (i.e., SafB-AL-PCP0, SafB-C1-A1-PCP1, SafA-C2-A2-PCP2, and SafA-C3-A3-PCP3-RE) (Fig. 6B) (33). Since SacA in SAC-B biosynthesis lacks the first module AL-PCP0, a bifunctional adenylation activation by SacA or direct incorporation of an Ala-Gly dipeptide into the tetrapeptide by SacA was hypothesized (Fig. 6C) (46). In both systems, the last two modules (SafA-C2-A2-PCP2 and SafA-C3-A3-PCP3-RE or SacB-C2-A2-PCP2 and SacC-C3-A3-PCP3-RE) were suggested to be responsible for activation and incorporation of each Tyr derivative, 3h5mOmTyr.
Our sequence analysis revealed that the N-terminal domain of SfmA in SFM-A biosynthesis, as well as that of SafB in SFM-Mx1 biosynthesis, lacks the expected conserved motifs of A domains and closely resembles the AL family, suggesting that SfmA-AL-PCP0 or SafB-AL-PCP0 might not be involved in amino acid incorporation like typical NRPS modules. Furthermore, the lack of this complete module in SAC-B NRPS system suggests that it may not be essential for tetrapeptide biosynthesis. Comparisons of the eight specificity-conferring amino acids of the A2 domain (DILQLGLI) and the A3 domain (DPWGLGLI) show they are significantly different, and thus, it is unlikely that the third module (SfmB-C2-A2-PCP2, SafA-C2-A2-PCP2, or SacB-C2-A2-PCP2) activates and incorporates the same 3h5mOmTyr residue as the fourth module does (SfmC-C3-A3-PCP3-RE, SafA-C3-A3-PCP3-RE, or SacC-C3-A3-PCP3-RE). In fact, question on the original assignments of substrate specificities for the SFM-Mx1 NRPS system was raised in bleomycin biosynthesis (10), in which reexamination of the SafAB sequence suggested that SafB-A1 serves as a candidate for Ala activation, while SafA-A2 recognizes and activates Gly. Consequently, these NRPS systems most likely catalyze the formation of a tetrapeptide intermediate using the last module in an iterative manner rather than following a typical colinearity rule, as shown in Fig. 6A. To confirm this hypothesis, we performed an amino acid-dependent ATP-PPi exchange assay to determine the substrate specificities of SfmA, SfmB, and SfmC. As we anticipated, truncated SfmA (C1-A1-PCP1), SfmB (C2-A2-PCP2), and SfmC (C3-A3-PCP3-RE) showed exclusive activities with L-Ala, L-Gly, and L-3h5mOmTyr, respectively, strongly supporting the hypothesis that SfmC acts twice to incorporate two L-3h5mOmTyr residues into the tetrapeptide (Fig. 6A). This finding, as well as the recent emergence of many unusual NRPS systems (30, 39, 47), indicates a rich variety of biochemistry and architecture of NRPSs beyond those previously appreciated.
NRPS systems that are distinct from the current paradigm fall into one of two categories: they either contain at least one module with an atypical arrangement of the core domains, exemplified by the syringomycin biosynthesis (15), or use all their modules iteratively for product assembly with the TE domain channeling the multimeric intermediates, exemplified by enterobactin biosynthesis (14). While the SFM-A NRPS system shares the classical A-PCP-(C-A-PCP)n domain organization found in linear NRPSs, the last module (SfmC), in contrast to the SfmA and SfmB modules, acts in an iterative fashion to produce a tetrapeptide intermediate. Furthermore, the TE domain that is typically found as the C-terminal domain of NRPS is substituted with a terminal RE domain. Since the C domain catalyzes transpeptidation between amino acyl thioesters without covalently binding the intermediate and since NRPSs are usually recognized to function as monomers (39), SfmC might channel the tripeptidyl or tetrapeptidyl intermediate in an unusual iterative manner that is distinct from any known NRPS system and mechanistically different from iterative events found in type I polyketide synthases (47). Knowledge of the structure of SfmC will be necessary to understand the selectivity and interaction of the domains involved and to eventually engineer novel NRPS enzymes, like SfmC, for combinatorial biosynthesis.
As shown in Fig. 3B, a RE domain that resides at the C terminus of SfmC may reductively release the tetrapeptide intermediate from the PCP3 as a linear aldehyde. A series of intramolecular cyclizations lead to the formation of the B, C, and D rings, characteristic of the tetrahydroisoquinoline family. Subsequently, regiospecific methylation, oxidation, desamination, and substitution of a cyano group successively occur to produce SFM-A. Genetic comparison revealed that the sfm gene cluster contains all the structural genes for SAC-B biosynthesis (Fig. 2), supporting our hypothesis that SAC-B, which was originally isolated from a Pseudomonas strain, might serve as a key intermediate for SFM-A biosynthesis. It is not surprising that the sfm cluster harbors additional genes, since their functions are required for further modifications of the shared intermediate, such as multiple oxidations of the E ring. Heterologous expression of SfmO4 (a hydroxylase responsible for the introduction of a hydroxyl group at C-15 position) in the SAC-B producer resulted in a bisquinone derivative, aminated SFM-S, which was then converted into an aminated SFM-A analog, SFM-Y3, by treatment of the fermentation culture with potassium cyanide (Fig. 4B). Thus, our results are consistent with parallel biosynthetic pathways for SFMA-A and SAC-B production.
In conclusion, the availability of the sfm biosynthetic gene cluster described here provides an excellent opportunity to access the unusual enzymatic mechanism of SFM-A biosynthesis. Sequence analysis and genetic comparison revealed a common strategy for tetrapeptidyl assembly among SFM-A and its analogs (SFM-Mx1 and SAC-B), and biochemical determination of substrate specificities for the A domains supported that the backbone formation is catalyzed by a multimodular NRPS system in a semi-iterative manner rather than following a previously proposed colinear rule. On the basis of the SAC-B biosynthetic machinery, we reconstructed an aminated SFM-A pathway by heterologously expressing a regiospecific hydroxylase SfmO4 in the SAC-B-producing Pseudomonas strain, thereby demonstrating a unified mechanism of biosynthesis among the tetrahydroisoquinoline compounds and the feasibility of engineering bacterial strains to generate new or otherwise scarce bisquinone alkaloid analogs.
This work was supported in part by grants from the National Natural Science Foundation of China (20621062, 20402021, 30425003, and 30525001), the Ministry of Science and Technology of China (2006AA02Z185), the Chinese Academy of Science (KJCX2-YW-H08), and the Science and Technology Commission of Shanghai Municipality (04DZ14901 and 05QMX1466).
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
Published ahead of print on 2 November 2007. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»