Previous Article | Next Article ![]()
Journal of Bacteriology, July 2005, p. 4884-4889, Vol. 187, No. 14
0021-9193/05/$08.00+0 doi:10.1128/JB.187.14.4884-4889.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Hideaki Yukawa,2
Masayuki Inui,2 and
Roy H. Doi1*
Section of Molecular and Cellular Biology, University of California, Davis, Davis, California 95616,1 Research Institute of Innovative Technology for the Earth, Kyoto 619-0292, Japan2
Received 31 January 2005/ Accepted 6 April 2005
|
|
|---|
A consensus promoter sequences of gram-positive bacteria. EngO contains a typical N-terminal signal peptide of 28 amino acid residues, followed by a 149-amino-acid sequence which is homologous to the family 4-9 carbohydrate-binding domain. Downstream of this domain was an immunoglobulin-like domain of 89 amino acids. The C terminus contains a family 9 catalytic domain of glycosyl hydrolase. Mass spectrometry analysis of EngO was in agreement with that deduced from the nucleotide sequence. Expression of engO mRNA increased from early to middle exponential phase and decreased during the early stationary phase. EngO was highly active toward carboxymethyl cellulose but showed no activity towards xylan. It was optimally active at 40 to 50°C and pH 5 to 6. The analysis of the products from the cellulose hydrolysis through thin-layer chromatography indicated its endoglucanase activity. |
|
|---|
C. cellulovorans also produces noncellulosomal enzymes such as EngD (17), EngF (40), ArfA (28), and BgaA (28) that work synergistically with the cellulosomal enzymes (28). Thus far, 12 cellulosomal enzymatic subunits and 4 noncellulosomal enzymes from C. cellulovorans have been sequenced, including cellulases, xylanases, a mannanase, and a pectate lyase from eight different glycoside hydrolase families (9). Among the cellulosomal cellulase genes identified, five encode family 9 glycoside hydrolases, i.e., EngK (48), EngM (48), EngY (46), EngH (48), and EngL (45).
The nucleotide sequence, expression, and the characterization of the engO gene and its product EngO, a family 9 noncellulosomal endoglucanase from C. cellulovorans, are reported in this paper. A detailed transcriptional analysis of the engO promoter and its regulation during growth phase were carried out in order to gain some understanding of the expression pattern of this gene relative to the cellulosomal genes. The results indicate that EngO is a major noncellulosomal cellulase and that it is produced coordinately with the cellulosome. The analysis of its hydrolytic products indicates that it is an endoglucanase and produces primarily cellobiose. It is of interest that most of the cellulolytic cellulosomal and noncellulosomal enzymes produced by C. cellulovorans are members of glycosyl hydrolase family 9.
|
|
|---|
Fosmid library construction, screening, and DNA sequencing. A C. cellulovorans fosmid library was constructed as described previously (19). The procedure yielded 3,000 recombinant clones. The Luria-Bertani (LB) plates, on which the recombinant E. coli were grown, were overlaid with soft agar containing 0.3% carboxymethyl cellulose (CMC; Sigma) and 0.7% agar in 25 mM sodium acetate buffer (pH 6.0). After incubation at 37°C for 16 h, the plates were stained with 0.3% Congo red and destained with 1 M NaCl. The clones that formed the halos were selected as CMCase-positive colonies. The CMCase-positive clones were picked and restreaked to confirm formation of clearing zones around the colonies on LB-CMC, followed by applying colony hybridization (Roche) according to the manufacturers' instructions with previously identified gene-specific probes such as cbpA, engE, engB, and engD (20, 22). The knockout (CMC-negative) clones were then isolated by using the EZ::TN <oriV/KAN-2> insertion system (Epicentre) (19). The transposon insertion sites of mutant clones were mapped by DNA sequencing with the transposon-specific flanking primers (Epicentre).
5' RLM-RACE and Northern blot analysis. The RNA ligase-mediated rapid amplification of cDNA ends (RLM-RACE) and Northern blot technique were carried out with total RNA extracted from a C. cellulovorans culture grown on cellobiose and were used to determine the transcription start points and size of mRNA. Rapid amplification of 5' cDNA ends was carried out using a FirstChoice RLM-RACE kit (Ambion) according to the manufacturer's instructions with the following exceptions. The nested PCR conditions for 5' outer PCR were with 10 pmol gene-specific outer primer engO-5'-Outer (5'-TAAGCACGATAATCTCCACC), 1.25 units of Ambion's SuperTaq polymerase, 10 pmol 5' RACE outer primer (5'-GCTGATGGCGATGAATGAACACTG) (Ambion), 1x SuperTaq PCR buffer (Ambion), 100 µM deoxynucleoside triphosphates, 1 ng/µl first-strand cDNA reaction, and H2O to 50 µl. The PCR conditions were as follows: (1x) 94°C, 4 min; (35x) 94°C, 30 s; 60°C, 30 s; 72°C, 40 s; and (1x) 72°C, 7 min. The 5' inner PCR was carried out with 10 pmol gene-specific inner primer engO-5'-Inner (5'-CGTTGAAAAACTAGCATAACTTCCA) and 10 pmol 5' RACE inner primer (5'-CGCGGATCCGAACACTGCGTTTGCTGGCTTTGATG) (Ambion) using the same conditions as for the 5' outer PCR. PCR products were observed on 2% agarose gels and sequenced. For Northern blotting, RNA samples (5 µg) were separated by 1% gel electrophoresis and blotted onto a nylon membrane (Ambion). The membrane was treated with engO probe (5'-AAAAGGGGAATAGTGATATG/5'-TAAGCACGATAATCTCCACC) or previously identified gene-specific probes (22) and detected as described previously (19).
Preparation of noncellulosomal proteins. The noncellulosomal proteins were purified from culture supernatants of C. cellulovorans cells as described previously (41). The culture supernatants were obtained from the cultures at stationary phase (4 to 7 days) by centrifugation. The supernatants were precipitated by 80% (wt/vol) ammonium sulfate saturation and dialyzed. The extracellular material was then mixed with Avicel, which resulted in binding of the cellulosome complex to Avicel. After incubation for 1 h at 4°C, the suspension was poured into a column. The column was washed with 3 volumes of 50 mM Tris-HCl buffer (pH 7.5) to elute the unattached fractions as the noncellulosomal proteins. The unattached fraction was concentrated with Ultrafree Biomax (10-kDa cutoff; Millipore) before being subjected to gel filtration on a HiLoad 26/60 Superdex 200 prep grade column (320 ml; Amersham Biosciences) equilibrated with 50 mM Tris-HCl buffer (pH 7.5) by using the fast protein liquid chromatography system (Amersham Biosciences). The concentration of fractionated protein was measured by the method of Bradford (4) with a protein assay kit from Bio-Rad, using bovine serum albumin as the standard.
SDS-PAGE, zymogram, and mass spectrometry analysis. The recombinant enzyme was isolated from culture supernatants of a CMCase-positive clone as described previously (19). Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) was performed with a 10% polyacrylamide gel by the method of Laemmli (30). Proteins were fixed in the gels by soaking in a solution containing 40% (vol/vol) methanol and 10% (vol/vol) acetic acid for approximately 1 h and subsequently visualized by Coomassie blue staining (Genomic Solutions). The zymograms for CMCase were performed by using a 0.1% CMC (wt/vol) incorporated into the polyacrylamide. After SDS-PAGE, the gels were renatured in a renaturation buffer (100 mM succinic acid, 10 mM CaCl2, 1 mM dithiothreitol, pH 6.3) for 2 h at 25°C with gentle shaking. The renatured gel was then incubated in a fresh renaturation buffer for 1 h at 37°C with gentle shaking. The clearing zones corresponding to enzyme activities were visualized with 0.3% (wt/vol) Congo red (stained for 10 min and destained with 1 M NaCl solution) (2). Mass spectrometric analysis was performed to identify the engO on cellulosome, and recombinant proteins were separated by SDS-PAGE as described previously (22).
TLC. Cellulose degradation products were determined by thin-layer chromatography (TLC) on precoated TLC sheets (silica gel; Whatman) with acetone-ethylacetate-acetic acid (2:1:1, vol/vol/vol) (32). The plates were visualized by spraying with a 1:1 (vol/vol) mixture of 0.2% methanolic orcinol and 20% sulfuric acid.
Nucleotide sequence accession number. The nucleotide sequence reported here has been submitted to GenBank and can be accessed under accession number AY646113.
|
|
|---|
In order to localize the promoter, the transcription start site of engO was determined by 5' RLM-RACE PCR analysis described previously (19). A 336-bp PCR fragment (engO-5'-Inner/5' RACE inner primer) was generated for DNA sequencing of the upstream region up to the transcription start site of engO. One clear ending sequence, corresponding to position 264 bp upstream from the A of the first ATG, was obtained by DNA sequencing of the 5' RLM-RACE PCR product. The engO mRNA start point suggested a putative promoter sequence, TTGCAA and TAATAT, with a 17-bp spacing between them (Fig. 1).
![]() View larger version (28K): [in a new window] |
FIG. 1. Alignment of putative engO promoter located at 10 and 35 bases upstream of the start point of transcription. Transcription start points are indicated by the bent arrows. The nucleotide numbering begins from the first codon shown on the right. The consensus (Cons) sequence derived from this alignment is given at the bottom. It consists of nucleotides that are present in any given position in more than 50% of the sequences. Promoter sequence nucleotides that match those of the consensus sequence are in black boxes. The coordinates refer to the published nucleotide sequences: cbpA (42), GenBank accession no. M73817; engE (47), GenBank accession no. AF105331; xynB (19), GenBank accession no. AY604045; and manA (45), GenBank accession no. AF132735.
|
A) and E. coli (
70) RNA polymerases (36, 50) (Fig. 1). The spacing between the 35 and 10 regions was 17 bp, which is the optimal spacing observed for B. subtilis and E. coli promoter sequences (6, 23). The close similarity of the promoter to the consensus
A sequence suggests that the engO promoter, if not subjected to any regulatory constraints, would act as a strong promoter in vivo (8, 24).
Northern hybridizations of the RNA with an engO probe showed a single transcript of 2.6 kb (Fig. 2B). This mRNA was in good agreement with the size of the engO gene (2,172 bp) and indicated that engO was a monocistronic gene. On the basis of the size of engO mRNA and the location of the transcription start site, a putative transcription terminator that consists of a 47-bp palindromic sequence, corresponding to an mRNA hairpin loop with a
G of 72.0 kcal/mol (5, 44), was found downstream of the TAA termination codon at nucleotide position 3665 (GenBank accession number AY646113). This structure is similar to the rho factor-independent terminator of E. coli (36).
![]() View larger version (56K): [in a new window] |
FIG. 2. Relative levels of transcripts of engO at different growth phases of cellobiose cultures. (A) C. cellulovorans growth curve. (B) Northern blot analyses were conducted with the same concentrations of RNA (5 µg) isolated from C. cellulovorans culture grown on 0.5% cellobiose as the sole carbon source. Ethidium bromide staining of rRNA is shown as a loading control. The numbers of the lanes in panel B represent the numbers of the growth curve points in panel A.
|
Amino acid sequence analysis of EngO. Comparison of the amino acid sequence of EngO with those registered in SWISS PROT and GenBank databases clearly revealed that the mature EngO consisted of three distinct functional domains. The deduced N-terminal sequence of 28 amino acids contains a sequence similar to the signal peptide sequences found in prokaryotic secretory proteins (33). The N-terminal domain of the mature form of EngO, 149 amino acid residues downstream of the signal peptide, is homologous with the family 4 (subfamily 9) carbohydrate-binding domain (CBM 4-9) of other endoglucanases, i.e., 27% sequence identity with CelN of Clostridium thermocellum (52), 27% identity with EgVII of Ruminococcus albus (34), 26% identity with LamA of Thermotoga neapolitana (51), and 26% identity with CenC of Cellulomonas fimi (7).
The N-terminal immunoglobulin (Ig)-like domain of EngO, extending from position 190 to 278, exhibited 40% sequence identity with CelE of Clostridium cellulolyticum (14), 38% identity with CelD of Cytophaga hutchinsonii (GenBank access number ZP_00307924), 32% identity with Ced1 of Butyrivibrio fibrisolvens (3), and 32% identity with CelD of Microbulbifer degradans (GenBank access number ZP_00315637). Some enzymes in family 9 cellulases show a tight association of an Ig domain with the catalytic domain. For example, EngK and EngM of C. cellulovorans contain an Ig-like domain at the N-terminal side of the catalytic domain. In the earlier classification of cellulases, the enzymes in family 9 of the present classification had been compiled into two subfamilies, on the basis of amino acid sequence similarities (35). Most enzymes in one subfamily show a tight association of an Ig-like domain with a catalytic domain; e.g., EngK and EngM of C. cellulovorans and CbhA, CelD, CelJ, and CelK of C. thermocellum (29) contain an Ig-like domain at the N-terminal side of the catalytic domain.
The catalytic domain in EngO is a family 9 glycosyl hydrolase domain (435 amino acid residues) located in the C terminus of the peptide, with 38% sequence identity with CelA of Alicyclobacillus acidocaldarius (11), 36% identity with Ced1 of B. fibrisolvens (3), 35% identity with CelD of C. thermocellum (25), and 35% identity with EGC of Fibrobacter succinogenes (GenBank access number AAC41523). The catalytic domains of these enzymes are highly conserved in nine regions and presumably serve to form the overall structure of these enzymes, suggesting that these endoglucanases have diverged from a common evolutionary ancestor. Family 9 cellulases catalyze the hydrolysis of ß-1,4 linkages with the inversion of anomeric carbon configuration. The structure of the catalytic domain in this family consists of an (
/
)6-helical barrel (26, 37). Two aspartic acids, a histidine, and a glutamic acid, which have been confirmed as the catalytic amino acids in C. thermocellum (49), were conserved in four identical regions as the nucleophile and the proton donor.
The domain organization of EngO (CBM4-Ig-GH9) compared with those of closely related cellulases such as C. cellulolyticum CelE (14), C. thermocellum CelK (27), and Streptomyces reticuli Cel1 (38) reveals that conservation among catalytic cores is greater than conservation among CBM4, indicating either a lower level of evolutionary pressure on these last domains or changes in function and specificities. In addition, EngO exhibits a common domain organization among family 9 endoglucanases such as EngK and EngM in C. cellulovorans, sharing the same modular structure except lacking the dockerin domain as a noncellulosomal enzyme.
Identification and characterization of EngO as a recombinant protein. The product of engO was recovered from the supernatant of E. coli EPI300 grown on LB plates by ammonium sulfate precipitation. The noncellulosomal protein EngO was purified from C. cellulovorans grown on Avicel-containing medium in a cellulose affinity column (HiLoad 26/60 Superdex 200 prep grade column) as described previously (22). We analyzed the recombinant rEngO protein by zymograms and compared it with the native noncellulosomal EngO. The rEngO gave one major band in the zymogram with CMC, and the molecular mass of the enzyme was estimated to be 80 kDa (Fig. 3B, lane 2), which is similar in size to that of the mature native EngO deduced from the nucleotide sequence (79,474 Da). A band with an apparent molecular mass of 80 kDa was detected in the noncellulosomal proteins purified from C. cellulovorans (Fig. 3A and B, lanes 1). The size of the protein was in good agreement with that of the full-length EngO produced by recombinant E. coli and the size calculated from the deduced amino acid sequence. The mass spectrometry technique (22) was also used for the identification of proteins separated by SDS-PAGE. The mass spectra of EngO showed the amino acid sequences of tryptic peptides AYVDGQINTTSAVQSISK, AAFGDDFNIPESGNGVSDVLDEVK, NSLGFSFVSGYGTDYLK, and VVSTATNEVVYTGNIEGCR, which perfectly matched the deduced sequence of EngO. The tryptic peptides were found on the deduced entire (N- to C-terminal) sequences of EngO. The profiles based on SDS-PAGE, mass spectrometry, and zymogram analysis suggest that EngO is the major noncellulosomal endoglucanase.
![]() View larger version (118K): [in a new window] |
FIG. 3. Expression of EngO in C. cellulovorans and E. coli. (A) Gels stained with Coomassie blue. (B) Gels stained for CMCase activity. Lanes: M, protein mass standards; 1, noncellulosomal proteins of C. cellulovorans; 2, supernatant of recombinant E. coli; 3, supernatant of knockout EngO recombinant E. coli. The oblong circles represent the putative EngO (79,474 Da).
|
![]() View larger version (41K): [in a new window] |
FIG. 4. Thin-layer chromatography of hydrolysis products from oligosaccharides and CMC. The oligosaccharides and CMC (1 mg) were incubated with enzyme (1 µg) for 16 h, and the hydrolysates were analyzed by thin-layer chromatography. S, authentic oligosaccharides; SC, CMC reaction mixture with knockout recombinant EngO; G1, glucose; G2, cellobiose; G3, cellotriose; G4, cellotetraose; G5, cellopentaose; C, CMC.
|
Present address: Research Institute of Innovative Technology for the Earth, 9-2 Kizugawadai, Kizu, Soraku, Kyoto 619-0292, Japan. ![]()
|
|
|---|
A) factor and RNA polymerase with promoters. Biochimie 74:601-612.[Medline]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»