Previous Article | Next Article ![]()
Journal of Bacteriology, July 2006, p. 4601-4609, Vol. 188, No. 13
0021-9193/06/$08.00+0 doi:10.1128/JB.01787-05
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts 02543
Received 23 November 2005/ Accepted 14 April 2006
|
|
|---|
|
|
|---|
The taxonomic position of Shewanella bacteria has been refined over time. Early names applied to the organism were Pseudomonas putrefaciens and Alteromonas putrefaciens. Shewanella was recognized as a species in 1985 (15). Classification by 16S RNA sequences revealed its relationship to other aquatic organisms (25). Most recently, among Alberomonas-like bacteria, the genus Shewanella was placed within a new family, Shewanellaceae (10, 11).
Some of the interest in the organism stems from its ability to reduce metals and metal oxides in the environment. This has raised the possibility that shewanellae could serve as decontaminating agents in the environment (2, 18, 19, 24). Also, these bacteria can cause food spoilage and can act as opportunistic pathogens (3, 12) and thus are of some interest to the food industry and medicine.
Recently the S. oneidensis MR-1 genome was fully sequenced and its gene products were annotated (9). The early annotation has been extended since (6) and continues to be studied (13). In this work, we have placed predicted enzymes in pathways of intermediary metabolism for carbon and energy utilization in order to gain a picture from the genetic point of view of the metabolic capacities of the bacterium and to relate them to current experimental knowledge (22).
Information from the genome sequence was used to predict the presence of enzymes of carbon source metabolism. Before doing sequence comparisons, we identified fused genes in the S. oneidensis MR-1 genome and divided them so all gene sequences encode single proteins only (23). The list of all S. oneidensis MR-1 unimodular protein sequences has been compared to the protein sequences of 107 other organisms using the Darwin AllAllDb program (7). We have described previously the particular suitability of the Darwin analysis for sequence annotation (14).
Unlike the situation for most microorganisms whose genomes have been sequenced, there is a modest body of experimental information on some of the phenotypic characteristics of S. oneidensis MR-1. Thus, we have the opportunity to relate sequence-based predictions for Shewanella to existing experimental information about the organism. Do the sequence annotations for enzymes reflect experimentally known metabolic characteristics? In a few cases, experimentally derived information does not agree with the predictions. In other cases, information on expression would be required to relate phenotypic information to genomic results.
Some cautions and modest improvements of methods for pathway prediction have emerged in the process of genomic annotation of central metabolism of a Shewanella strain. Annotations of protein sequences are only predictions. We realize that there are limits to the accuracy of predicting metabolic properties by sequence comparisons. We describe steps taken to eliminate some sources of artifact from the analysis. We minimize predictions of pathways when evidence is weakened by the multiple uses of some enzymes in a cell.
Relating gene and protein sequences of any one organism to proteins of other organisms can only tell us about similarities or variations on metabolic themes already known to us through experimental work in other organisms. No completely new enzymatic functions or pathways will be revealed by current methods of gene annotation.
|
|
|---|
Genomic sequences for 107 microorganisms were obtained from the NCBI RefSeq web site (www.ncbi.nih.gov/RefSeq/) by ftp (ftp.ncbi.nih.gov/genomes/Bacteria/). Sequences of all proteins of the 107 genomes, predicted and known, were compared to sequences of all S. oneidensis MR-1 proteins. Data for S. oneidensis MR-1 genes were from the NCBI RefSeq database NC_004347.1 except that with further study some annotations have changed to greater specificity (unpublished data). A tab-delimited text table listing all genes for predicted enzymes of S. oneidensis MR-1 is available (see Table S1 in the supplemental material). For enzymes having sequence-similar alignments with homologs of at least 83 residues and occupying more than 45% of the sequence of both proteins, Pam values are reported. Table columns give the gene identification (ID), gene name, enzyme name or partial information, GenBank ID for the best sequence match, organism with best match, and Pam value of best match. EC numbers are included for the enzymes addressed in the paper as well as PubMed IDs for experimentally verified functions.
For the work reported here, sequence pairs were extracted from the totality of less stringent data collected: those pairs that had alignment lengths of at least 83 amino acids and distances of 125 Pam units or less. We chose the length requirement of 83 residues as it improves the significance of the sequence alignments for the more distantly related protein pairs (1). The requirement for at least 83 residues also avoids a class of commonly occurring protein domains smaller than 83 residues that appear widely in many otherwise unrelated proteins (such as small binding sites for a type of substrate, cofactor, or regulator). In addition, for this study we removed proteins directly involved in horizontal gene transfer (IS proteins, transposases, and known prophage components) from the data set.
Pathway reconstruction and vetting incomplete pathways. S. oneidensis MR-1 proteins annotated as enzymes were examined one by one and placed in known microbial pathways. Pathways that were experimentally determined in another organism were deemed to be present in S. oneidensis MR-1 by the presence of sequence similarity for all component enzymes. Pathways for which not all enzymes were found required careful inspection. It is a fact that some enzymes occur in more than one pathway. In the cases of pathways only partially filled with homologs, we noted which of the enzymes present were already being used in other complete pathways in the cell. These have little or no significance in establishing the presence of the second pathway. Evidence for the presence of enzymes unique to the second pathway is necessary before designating them as present. We did not lower thresholds of similarity in an extended attempt to find enzymes for the holes in such tenuous pathways. Following this rule eliminated many seemingly partial pathways that we believe are not in fact present in S. oneidensis MR-1.
Comparison with experimental information. We have compared experimental results on central metabolism of S. oneidensis MR-1 with the predictions based on sequence similarity to known enzymes. Besides published work, unpublished results have kindly been made available to us as a private communication from K. E. Nealson (noted in the text as K.N.) and extensive Biolog data have been shared by J. Klappenbach and J. Tiedje. Phenotypic results, published and unpublished, are not completely consistent for this organism from different laboratories at different times, but there is agreement on the major characteristics.
|
|
|---|
Most closely related organisms. To identify the organisms most closely related to S. oneidensis MR-1 among those tested, we counted the number of proteins having matches with a Pam value less than 75 (a stringent threshold). The results, listed in column 1 of Table 1, identify Yersinia and Vibrio spp. as most similar to S. oneidensis MR-1. Both Yersinia and Vibrio spp. are aquatic organisms like Shewanella. Both are major pathogens, the meaning of which is not clear at present as shewanellae are known only to be opportunistic pathogens. In this context, the proteins of the opportunistic pathogen Pseudomonas aeruginosa and the pathogen Pasteurella multocida have many similarities to those in S. oneidensis MR-1. Nevertheless, it seems clear that the aqueous habitats (marine, riverine, and estuararian) of the high-ranking Vibrio and Yersinia spp. may confer the most important physiological similarities that are reflected in the level of genome sequence similarity.
|
View this table: [in a new window] |
TABLE 1. Similarity of genes in other genomes to those of Shewanella oneidensis MR-1
|
Although E. coli strains ranked low in numbers of best matches to S. oneidensis MR-1 coding sequences, nevertheless the E. coli data were vital to interpretation of the results. E. coli among all organisms tested had the most information on gene products that have been experimentally characterized, not simply predicted by sequence similarity
Metabolism pathwaysoverview. The subset of orthologs with the best Pam scores from any of the genomes as well as all significant orthologs in E. coli K-12, whether the best or not, were collected for all the enzymes of central metabolism. We grouped enzymes by pathway, some enzymes appearing more than once if present in more than one pathway. We found, based on sequence similarity, that S. oneidensis MR-1 like E. coli has complete pathways for the biosynthesis of all amino acids, nucleotides, and cofactors. However, utilization of compounds as carbon and energy sources was more restricted than in E. coli, and in some instances the pathways were not those of E. coli but of other organisms such as P. aeruginosa.
Carbon source and energy metabolism by respiration. Homologous genes for enzymes of main carbon utilization and central metabolism are shown in Fig. 1. Convincing support for the major pathways of carbon utilization and multipurpose enzymes of central metabolism is present. Sequences are present for all enzymes of the pentose pathway, the pyruvate dehydrogenase complex, all enzymes of the tricarboxylic acid cycle, the glyoxylate bypass, the Entner-Douderoff pathway, major anaplerotic reactions, and other enzymes of central carbohydrate metabolism. Note that by sequence similarity an essential enzyme of glycolysis, 6-phosphofructokinase, is not found. In agreement, it is known experimentally that S. oneidensis MR-1 does not use glucose as a carbon and energy source (Biolog; K.N.). The critical glycolytic enzyme 6-phosphofructokinase was not found in cell extracts (22). Consistent with the inability to use glucose for growth, we find that the sequences for enzymes for the feeder reactions that convert six-carbon carbohydrates to glucose that are present in E. coli are not found in S. oneidensis MR-1 (see Table S2 in the supplemental material). Among E. coli enzymes used in the conversion of 15 compounds, we only detected S. oneidensis MR-1 homologs for galactokinase and ribokinase. The presence of these two kinases does not reflect utilization of galactose or ribose for carbon and energy; rather they are conversions in the course of intermediary metabolism. The predicted enzyme composition of S. oneidensis MR-1 in this respectshy on enzymes for utilization of five- and six-carbon carbohydratesis consistent with known phenotypic characterization (25).
![]() View larger version (15K): [in a new window] |
FIG. 1. Carbon source metabolism in S. oneidensis MR-1. The intermediary metabolic steps of glycolysis, tricarboxylic acid (TCA) cycle, glyoxalate bypass, the pentose phosphate, and the Entner-Doudoroff pathways are shown. Enzymes are shown by their EC number followed by the S. oneidensis locus tag (SO number) of the gene predicted to encode the respective activity. The function predictions were based on sequence similarity to proteins with experimentally verified functions. Predicted isozymes are shown as SO numbers separated by a comma. Enzyme complexes are indicated by SO numbers separated by a forward slash. Predictions were made for all steps, except for that of 6-phosphofructokinase, EC 2.7.1.11.
|
Instead of six-carbon sugars, S. oneidensis MR-1 prefers three-carbon carbohydrates for growth. Experimentally, L-lactate, pyruvate, and acetate are among compounds utilized as sources of carbon and energy (22, 25; Biolog data). Agreeing with the experiment, genes are present for the enzymes for metabolism of pyruvate and acetate and other two- and three-carbon molecules (Fig. 2). However, the genetic picture seemingly does not agree for L-lactate, which does in fact support growth, but no homolog for a currently sequenced bacterial L-lactate dehydrogenase is found in S. oneidensis MR-1. A homolog for an unrelated fermentative D-lactate dehydrogenase gene is present, although S. oneidensis MR-1 is not considered a fermentative organism and is unable to ferment glucose (22, 25). Since S. oneidensis MR-1 is usually grown with L-lactate as a carbon and energy source, the absence of a homolog for L-lactate dehydrogenase is puzzling. Perhaps there is a different kind of lactate dehydrogenase not represented in current sequence databases. We note that two genes, designated by locus tags SO1520 and SO1521, are located adjacent to the lactate permease gene. One codes for an iron-sulfur protein, and the product of the other contains a domain for binding flavin adenine dinucleotide and has similarity to a glycolate dehydrogenase. Could these CDS represent a different kind of prokaryotic L-lactate dehydrogenase, a dimer that contains iron sulfur center(s) and flavin adenine dinucleotide bound to separate subunits? Experimental exploration of the possibility could be fruitful.
![]() View larger version (17K): [in a new window] |
FIG. 2. Enzymes in S. oneidensis MR-1 for utilization of one-, two-, and three-carbon compounds. Enzymes involved in the degradation of compounds with one carbon (formate), two carbons (acetate, ethanol), and three carbons (propionate, glycerol, lactate, and pyruvate) are shown by their names and predicted S. oneidensis locus tags (SO number). The function predictions were based on sequence similarity to proteins with experimentally verified functions. Predicted isozymes are shown as SO numbers separated by a comma. Enzyme complexes are indicated by SO numbers separated by a forward slash. TCA, tricarboxylic acid.
|
S. oneidensis MR-1 uses a C-1 compound, formate, as an energy source (22). Standing out from a plethora of genes concerning formate are genes for three formate dehydrogenase operons and one hydrogenase operon found in the genome. The three clusters are similar to those known for three-subunit formate dehydrogenase enzymes that participate in respiration, using either oxygen as an electron acceptor or nitrate. The three clusters are SO0101 to -0103 plus the accessory protein product of SO0107; SO4509 to -4511, and SO4513 to -4515 plus the accessory protein product of SO4503. The hydrogenase SO2097, -2098, and -2099 genes code for the subunits of a quinone-reactive type of hydrogenase known to work together with respiratory formate dehydrogeases. Adjacent to the hydrogenase operon is a contiguous set of genes coding for hydrogenase accessory proteins (SO2089 to SO2096). Absence of the fermentative type of formate dehydrogenase in complex with hydrogenase to make the formate hydrogen lyase complex is consistent with the observation that S. oneidensis MR-1 is not a fermentative organism. The genes encoding for pyruvate formate-lyase and its activator were found (SO2912 and -2913). Although this enzyme usually is associated with fermentative metabolism, it may be supplying S. oneidensis with C-1 (formate) units.
The CO2 produced by formate oxidation cannot be fixed by S. oneidensis MR-1 as no homologs are found for any known enzymes for CO2 fixation. Consideration has been given to the possibility S. oneidensis MR-1 is a facultative methylotroph with the cyclic C-1 serine pathway. Some of the enzymes for the cyclic serine pathway are present but, by stringent sequence comparisons, not all. An alternative to the classical serine pathway that uses only enzymes of general metabolism has been proposed (22). However, this version may not be able to effect net incorporation of C-1 moieties into metabolism. Since all of the enzymes in this scheme are also required for other pathways, their presence in MR-1 is ambiguous.
As to the classical C-1 cyclic serine pathway, almost all needed genes are present in S. oneidensis MR-1. It appears to be able to activate both formate and formaldehyde substrates by enzymes of tetrahydrofolate metabolism, converting them into the derivative N-5,10-methylenetetrahydrofolate (methyleneTHF) (4, 16). The methyleneTHF enters the cyclic C-1 serine pathway by donating the C-1 moiety to combine with glycine producing serine, resulting ultimately in capture of carbon as acetyl-coenzyme A (CoA) and regeneration of the C-1 acceptor molecule glycine for the next cycle of capture. There are two unique enzymes that could identify the pathway: malate-CoA ligase and malyl-CoA lyase (Fig. 3). Searching for sequences for the two unique enzymes in this pathway did not reveal homologs in S. oneidensis MR-1 within the threshold of significance applied in this work. However, we note that enzymes are present for similar reactions such that broad substrate specificity could enable the enzymes to encompass the activities of the two missing reactions. The succinyl-CoA ligase (SO1932 and SO1933) might serve as the malyl-CoA ligase if it also had the ability to bind malate. As a possible substitute for the malyl-CoA lyase, the S. oneidensis MR-1 genome has a close homolog of the gene for 3-hydroxy-3-methylglutaryl-CoA lyase (SO1893). Could this enzyme, if it has broad substrate specificity, also serve as the malyl-CoA lyase? 3-Hydroxy-3-methylglutarate and malate are chemically related compounds. Experimental characterization of the proteins of SO1893 and SO1932 to -1933 could establish whether either possibility is the case. Otherwise Shewanella spp. could employ an as yet unknown pathway as a recycling mode for capturing carbon when growing on formate or formaldehyde as a carbon source (5). Biochemical investigation is needed.
![]() View larger version (13K): [in a new window] |
FIG. 3. Classical C-1 serine pathway enzymes in S. oneidensis MR-1. Prediction of S. oneidensis MR-1 enzymes involved in the growth and assimilation of one-carbon compounds via the serine pathway are shown. Function predictions were made based on sequence matches to proteins with experimentally verified functions. Predicted enzyme complexes are indicated by SO numbers separated by a forward slash. Significant sequence matches to the two key enzymes in the pathway, malyl-CoA lyase (EC 4.1.3.24) and malate-CoA ligase (EC 6.2.1.9), were not found. Tentative assignments for these reactions were made to S. oneidensis MR-1 gene products catalyzing similar reactions.
|
Ability to utilize amino acids as carbon sources is also widespread in S. oneidensis MR-1 (21). Sequence similarity was found to the genes coding for the enzymes of the pathways for utilization of amino acids as they are known in E. coli. In addition, in S. oneidensis MR-1, some of the amino acids are broken down by pathways found in other organisms. These are shown in Fig. 4. The ability to utilize histidine is not by the pathway found in E. coli K-12 but is like the transformations in Bacillus subtilis. The ability to utilize aromatic amino acids is not by the pathway in E. coli, but has similarities to part of the protocatechuate pathway as found in pseudomonads. Genes for the enzymes of the latter steps of the protocatechuate meta-pathway are present in S. oneidensis MR-1, but only in the steps after the intermediate
-carboxymuconolactone. Utilization of aromatic compounds may have unique early steps in S. oneidensis MR-1 that join the known pathway at the carboxymuconolactone step. S. oneidensis MR-1 also degrades leucine, isoleucine, and valine not simply by transamination as in E. coli but rather by pathways used in pseudomonads. Methionine is converted to 2-oxobutanoate as in pseudomonads.
![]() View larger version (14K): [in a new window] |
FIG. 4. Amino acid utilization in S. oneidensis MR-1 by other than E. coli pathways. S. oneidensis MR-1 enzymes predicted to be involved in the degradation of amino acids are shown by their locus tags (SO numbers) and the EC number of the reaction. Gene predictions are based on sequence similarities to known enzymes. Predicted isozymes are shown as SO numbers separated by a comma. Enzyme complexes are indicated by SO numbers separated by a forward slash. Only pathways not utilized by E. coli are included in the figure.
|
Fatty acids are also used as carbon and energy sources. Homologs for genes coding for enzymes for breakdown of fatty acids are found. There are multiple copies of some of the key enzymes in these pathways. Other compounds found by Biolog testing to be utilized as carbon sources are
-ketobutyric acid, methylpyruvate, and D-lactate methyl ester, each readily convertible to mainline metabolites. Also giving positive Biolog results is N-acetylglucosamine, for which all sequences for the enzymes of the degradative pathway are present in the genome.
Experimental information on nucleoside utilization is interesting. Biolog and laboratory (K.N.) results for utilization of the nucleosides uridine, adenosine, 2'-deoxyadenosine, inosine, and, to a lesser extent, thymidine are positive. In contrast, the Biolog results for the bases cytosine, thymine, adenine, and guanine are poor to negative. We see that sequences for nucleoside hydrolases are present in S. oneidensis MR-1 that could split the nucleosides to their bases and ribose-5-phosphate for further utilization. However, since Biolog data indicate that the bases are not utilized, evidently it is primarily the ribose-5-phosphate moiety that is supporting growth when nucleosides are provided. Again, further experimentation is required to establish this interpretation.
Aerobic and anaerobic respiration. With respect to respiratory electron transfer pathways, electron transfer elements seem to be emphasized in the S. oneidensis MR-1 genome. Pathways for synthesis of heme, siroheme, menaquinone, and unbiquinone are intact. S. oneidensis MR-1 is believed to contain 42 cytochrome c-type proteins (17), whereas by comparison there are only 5 in E. coli.
Shewanella spp. are well known for their use in anaerobic respiration of many electron acceptors other than oxygen, the most unique being metals and metallic compounds (2, 18, 25). In agreement, genes have been found by sequence similarity to code for reduction of nitrate, nitrite, sulfate, sulfur, trimethylamine N-oxide (TMAO), dimethyl sulfoxide, and fumarate reductase enzymes corresponding to many of the metabolites used as anaerobic respiration acceptors. Enzymes for reduction of Fe ions are known, but reductases for the numerous metallic compounds that serve as receptors have yet to be identified. However, there is no dearth of possibilities in the genome sequences. We found many candidate uncharacterized oxidoreductases.
Fermentation. S. oneidensis MR-1 does not ferment (22, 25), yet many enzymes of mixed acid fermentation seem to be present in the genome (Fig. 5). In fermentation, choices include multiple paths from lactate or pyruvate to ethanol or acetate and conversion of acetate to acetyl-CoA or acetylphosphate. Other pathways lead from formate to production of H2 and CO2. Although all enzymes of mixed acid fermentation are present, there is no evidence it does exist. Support for a pathway does not exist if all enzymes are known to be active in other pathways (Table 2). In some cases, such as with the cyclic C-1 serine pathway, there is at least one unique enzyme in the pathway. Designation of the pathway then hangs on the presence or absence of any unique member. Table 2 gives examples of pathways that use many or all of its enzymes that are present in other pathways. Note that all of the enzymes used in fermentation pathways are also used in other known pathways. Thus, there is no indication from genomic analysis whether fermentation is used or not by this organism.
![]() View larger version (15K): [in a new window] |
FIG. 5. Enzymes of mixed acid fermentation in S. oneidensis MR-1. Sequence matches for S. oneidensis MR-1 gene products (SO numbers) to the enzymes used by E. coli for mixed acid fermentation are shown. Significant similarities to all enzymes except pyruvate oxidase (EC 1.2.2.2) were found. Predicted isozymes are separated by commas. Reactions predicted to be carried out by enzyme complexes are shown by SO numbers separated by forward slashes.
|
|
View this table: [in a new window] |
TABLE 2. Some pathways that use enzymes in common with other pathways
|
Summary of general considerations of pathway prediction. We have emphasized that there are caveats for pathway prediction which we applied in this work. A common approach to connecting information about enzyme homologs with metabolic pathways is to list the enzymes that seem to be present for any and all pathways. By assessing pathways one by one, if all enzymes are present one asserts the pathway is present in the organism. If not all enzymes are present, there is a temptation to look for the missing enzymes, relaxing thresholds of sequence similarity in an effort to complete the pathway list. However, it is important first to ask whether any of the enzymes in a partial pathway are used in other pathways in the cell. If all enzymes of a pathway with "holes" are used in other pathways, then there is no evidence for existence of the partial pathway. For some pathways, there is at least one unique enzyme in the pathway that is not used in any other known pathway in the cell. If a homolog for the unique enzyme is found, then it does seem safe to assert that pathway is present, even if not all other enzymes have yet been found. Illustrations are given in Table 2. In the classical cyclic C-1 serine pathway, two enzymes are unique. However, for mixed acid fermentation, all enzymes are also used in general metabolism; none are unique to the fermentation. Therefore, one cannot know without experimentation whether any of the fermentations occur.
In asserting a metabolic capability of any kind by genomic analysis, one needs to keep in mind that not all pathways in the biological world are yet known, so failure to find standard ways to convert one substance to another need not mean the biological capability is absent in the organism. An example is evidence in S. oneidensis MR-1 for only part of the protocatechuate pathway of aromatic compound degradation.
In all predictions of enzyme activity, it goes without saying that sequence similarity does not guarantee biological activity. Mutations may be present that produce an inactive enzyme product. We have tried to demur with assertions like "appears to be present" since we cannot know whether a homolog that is not identical to an active enzyme is in fact capable of activity.
Concluding remarks. Identification of homologs to enzymes of metabolism has allowed us to assemble a picture of central carbon metabolism in S. oneidensis MR-1. Utilization of carbohydrates larger than three carbons is not well supported by sequence similarity to appropriate genes and enzymes. Utilization of three-carbon carbohydrates and smaller has genetic support. Capability for utilization of amino acids and fatty acids as carbon sources is also present. Basic biosynthetic enzymes are present. Most pathways were similar to those in E. coli even though the closest homologs by sequence analysis were not often found in E. coli. But these pathways presumably are highly similar in the most closely related gammaproteobacteria, Vibrio spp. and Yersinia spp. Exceptions were found for some pathways for utilization of amino acids that were similar to those of pseudomonads or bacilli.
There are opportunities for connecting study of the physiology and metabolism of S. oneidensis MR-1 in the laboratory with questions of the kind raised here about operation of particular pathways and the properties of particular enzymes. Each of the questions posed here provides an opportunity for investigation that has the potential of yielding new knowledge about microbial metabolism. This would seem to be a useful outcome of sequence similarity analysis of genomic DNA.
This work was supported by the Office of Science (BER), U.S. Department of Energy, grant no. DE-FG02-01ER63202.
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»