Previous Article | Next Article ![]()
Journal of Bacteriology, April 2008, p. 2790-2803, Vol. 190, No. 8
0021-9193/08/$08.00+0 doi:10.1128/JB.01583-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Jacek Pucha
ka,2,
Kimberly E. Fryer,1
Vítor A. P. Martins dos Santos,2,
* and
Jason A. Papin1,
*
Department of Biomedical Engineering, University of Virginia Health System, Box 800759, Charlottesville, Virginia 22908,1 Helmholtz Center for Infection Research (HZI), Inhoffenstrasse 7, D-38124 Braunschweig, Germany2
Received 30 September 2007/ Accepted 3 January 2008
|
|
|---|
|
|
|---|
Pseudomonas aeruginosa is a ubiquitous gram-negative bacterium that is capable of surviving in a broad range of natural environments, although it is mostly known for its role as an opportunistic pathogen (40, 60, 72). While P. aeruginosa is generally found in aerobic environments, it is able to thrive anoxically and, notably, to denitrify (58). It was also recently shown that P. aeruginosa can form biofilms under microaerobic (i.e., very low oxygen) conditions similar to those found in the lungs of cystic fibrosis (CF) patients (1). These features strongly contribute to the notable success of P. aeruginosa in chronically infecting the lungs of CF patients, nearly all of whom have lifelong P. aeruginosa infections starting at an early age (49, 70). P. aeruginosa is also a serious pathogen in nosocomial infections and various acute infections in immunocompromised patients, such as severe burns and urinary tract infections (39, 49, 68). Part of the reason for this remarkable ecological success is thought to be the considerable metabolic versatility and flexibility of P. aeruginosa (62), which renders the study of the metabolism of this life-threatening microbe crucial to the understanding of its pathogenicity and opportunistic nature.
We present a genome-scale metabolic reconstruction of P. aeruginosa PAO1 (hereafter referred to as PAO1), called in silico strain iMO1056 following an often-used naming convention (48). This model accounts for 1,056 genes encoding 1,030 proteins that catalyze 883 reactions (Fig. 1a). Gene-protein-reaction (GPR) associations are accounted for in the reconstruction, as well as the stoichiometry and thermodynamically derived directionality of all included reactions. This reconstruction process led to the reannotation of several open reading frames (ORFs). The model was tested against high-throughput substrate utilization experiments (90% matched the usage of common substrates) and two published sets of genome-wide knockout data (85% accuracy of essentiality predictions). Several additional predictions were made regarding PAO1 physiology and virulence, and these may be tested subsequently. This genome-scale reconstruction and subsequent constraint-based modeling enable integration of high-throughput data to generate novel, testable hypotheses that will assist in exploring the physiology of PAO1 and assessing its relevance for pathology.
![]() View larger version (37K): [in a new window] |
FIG. 1. Reconstruction statistics. (a) Statistics on the P. aeruginosa PAO1 genome and the iMO1056 reconstruction. *, see reference 62; **, reaction confidences were based on the scale for protein name confidences set out at www.pseudomonas.com. This scale is described in Materials and Methods. The confidence value for the PAO1 genome is from a version (19 June 2007) of the annotation at www.pseudomonas.com. (b) Numbers of genes participating in different metabolic processes in the iMO1056 metabolic reconstruction.
|
|
|
|---|
![]() View larger version (52K): [in a new window] |
FIG. 2. Schematic of the network reconstruction process for iMO1056. Resources used for the reconstruction are shown on the right, and the reconstruction process is shown on the left. (1) The P. aeruginosa annotation was used along with online resources to generate an initial reconstruction of metabolism. (2) Next, computational analysis was used in conjunction with extensive literature mining and online database searches to identify and fill gaps in metabolism. (3) The model was then validated with data sets from the literature and experimental data generated as a part of this study. (4) The current model version, iMO1056, is now ready for the next step, in which the model will continually be improved through cycles of computational hypothesis generation followed by experimental validation.
|
Confidence classes for reactions were assigned based on the PseudoCAP protein name confidence rating system (69), which rates the confidences of gene-protein associations in the PAO1 annotation. We extended the system to rating confidences for reaction associations. There are four levels in this rating system, as follows: class 1, which accounts for genes whose functions have been demonstrated experimentally with PAO1; class 2, which accounts for genes whose functions were inferred via homology with an experimentally characterized protein in another organism; class 3, which represents gene-protein associations based on the presence of conserved amino acid motifs; and class 4, which accounts for genes with similarity to other genes of unknown function. We adhered to this rating system for genes whose functions were newly annotated in this study (see Table 2, "proposed confidence" column), and for genes which were not newly annotated, we assigned reaction confidences equivalent to the protein name confidences in the annotation.
|
View this table: [in a new window] |
TABLE 2. Proposed annotation refinements
|
![]() View larger version (22K): [in a new window] |
FIG. 3. FBA. (a) The conversion of a metabolic system into an S matrix is shown for a simple prototype. Columns represent reactions, and rows represent compounds. Reaction stoichiometries are represented with negative coefficients for substrates and positive coefficients for products of a given reaction. (b) (Top) Standard form of an FBA problem. Flux through the objective reaction (v5) is maximized subject to thermodynamic, stoichiometric, and uptake constraints, as follows: reaction fluxes (vj) must lie between lower and upper bounds (lb and ub), metabolite concentrations are fixed over time (S · v = 0), and the flux of a limiting carbon source (v4) is set to some uptake value. (Bottom) Flux values calculated by FBA for the prototype network are represented by arrows.
|
In many FBA applications, a linear biomass reaction is defined and assumed to be the objective toward which the flux distribution of a network is optimized. The biomass reaction represents a weighted ratio of components forming the dry weight of a cell, as well as hydrolysis of ATP to account for the energy needs involved in growth and cellular maintenance (66). Optimization for biomass is justified by the assumption that bacteria have been optimized evolutionarily for growth, and experimental studies have verified the rationality of this assumption (16, 55). This optimization step is necessary due to the underdetermined nature of metabolic networks, as a range of flux profiles are possible for a given network. FBA optimization is constrained by the stoichiometry of the network and by upper and lower bounds on reactions, as set by thermodynamic and substrate uptake constraints. In addition, conditional constraints, such as gene knockouts and alternative gene regulation, can be placed on the system to simulate experimental conditions (22, 59).
To perform FBA, the flux of the biomass reaction is maximized given the steady-state condition, as follows: S · v = 0, where S is the S matrix representing the network (Fig. 3a) and v is the vector of all fluxes in the network (Fig. 3b, top panel). Ultimately, FBA yields a predicted optimal growth yield and a flux distribution through the system that will support this optimal growth (Fig. 3b, bottom panel). See the supplemental material for a discussion of constraints used in FBA simulations in this study.
Biolog experiments. PAO1 was tested for the ability to utilize various carbon sources, using Biolog GN2 microplates (Biolog Inc., Hayward, CA). All procedures were performed as indicated by the manufacturer. Bacteria were grown overnight at 28°C on a Biolog universal growth agar plate. Samples were swabbed from the surface of the plate and suspended in GN inoculating fluid. Each well of the microplate was inoculated with 150 µl of bacterial suspension, and the plate was incubated at 28°C for 24 h. Subsequently, the microplate was read by a microplate reader, and the results were analyzed with MicroLog3 4.20 software.
The assay performed with the Biolog plate is a measurement of the reduction of tetrazolium dye after 24 h, indicating whether P. aeruginosa has been respiring actively during that period. Active respiration in minimal medium indicates utilization of the sole carbon source provided.
Technical computing methods. FBA optimizations were performed on a Dell computer with 2 GB of RAM and a 1.8-GHz Intel Centrino processor linked to a dedicated database server with Simpheny software (Genomatica, Inc.). Transposon knockout data were handled in Excel (Microsoft Corp.), and statistical analysis of transposon data was performed in Matlab R2006a (Mathworks, Inc.).
|
|
|---|
![]() View larger version (47K): [in a new window] |
FIG. 4. Reannotation processes. (a) Full network map of iMO1056. Metabolic processes are grouped together, and an effort was made to link pathways to their major carbon donors and sinks in central metabolism, although many metabolites occupy two or more locations on the map. All internal boxes are meant to highlight metabolic processes rather than to denote cellular localizations. The double line surrounding the map represents the cell barrier. (b, c, and d) Several types of reannotation that went into iMO1056. Large gray arrows denote the reannotation process: initial annotation statuses are shown before the arrows, and reannotated gene functions are shown after the arrows. The location of each newly annotated gene is highlighted on the map. (b) PA3004 was functionally identified by literature mining. (c) A gap in the KDO synthesis pathway was identified, and PA4457 was found by a BLAST search to fulfill the missing function. (d) NAD initially could not be synthesized anaerobically in iMO1056, since O2 was required for NAD synthesis. However, literature mining revealed a putative alternate stoichiometry for the L-aspartate oxidase reaction (ASPO8) that does not require oxygen.
|
|
View this table: [in a new window] |
TABLE 1. Virulence factors in P. aeruginosaa
|
The first level of annotation refinement is the assignment of a specific function to a previously uncharacterized gene based on literature evidence or a BLAST search. An example of this in iMO1056 is the gene mtnP (PA3004), which was annotated in PseudoCAP as a "probable nucleoside phosphorylase" but was subsequently found in the literature to have the specific function of a 5'-methylthioadenosine phosphorylase (Fig. 4b) (57). The mtnP gene was first identified (57) through detailed sequence analysis, and then its function was confirmed (57) by knocking out mtnP in a
metZ strain of P. aeruginosa and observing a loss of growth on minimal medium supplemented with methionine, homocysteine, or methylthioadenosine (57). mtnP is necessary for the function of the methionine salvage pathway in P. aeruginosa and was previously characterized in the literature but not in the P. aeruginosa annotation, and its inclusion in iMO1056 is an example of how the manual reconstruction process can benefit annotation efforts in a way that would not be immediately obvious from automatic model building. In this case, the reconstruction process led us to investigate the methionine salvage cycle and to uncover a reaction that was necessary for the pathway to function but which would have been difficult to pinpoint without a rational model-building approach.
The next level of gene annotation refinement is the assignment of a new function based on gap analysis. This type of annotation refinement occurs when a gap in a pathway renders model growth infeasible, and the missing reaction is subsequently identified by literature mining or a homology search. For example, the previously uncharacterized conserved hypothetical protein PA4457 was identified as an arabinose-5-phosphate isomerase gene (kdsD), which was lacking in the original PAO1 annotation (Fig. 4c). Gap analysis revealed a need for the production of D-arabinose-5-phosphate in the network to enable synthesis of 3-deoxy-D-manno-2-octulosonate (KDO). KDO links the backbone of the lipid A moiety to the core oligosaccharide of lipopolysaccharide and is essential for growth of PAO1 (20). A series of BLAST searches in the nonredundant NCBI database revealed that PA4457 (thus far annotated as a gene coding for a hypothetical protein) had 79% identity over a length of 326 amino acids to the protein KdsD carried by Pseudomonas fluorescens Pf-5 (score, 501; E value, 3 x 10–130). KdsD is an arabinose-5-phosphate isomerase, the enzyme that catalyzes the interconversion of D-arabinose 5-phosphate and D-ribulose 5-phosphate (38). This reannotation and its subsequent incorporation into the model enabled iMO1056 to grow in silico.
Gap analysis also enables the reconciliation of conflicting knowledge in the literature. An earlier version of the PAO1 reconstruction was unable to produce biomass in the absence of oxygen. It was discovered through gap analysis that the inability of iMO1056 to grow without oxygen was due to the oxygen consumption of the enzyme L-aspartate oxidase (NadB), which catalyzes the first step in the production of NAD. Literature mining revealed that while NadB has a strict requirement for O2 when assayed in vitro, it has been suggested that another electron acceptor might suffice in the absence of elemental oxygen in vivo (17). In the network reconstruction, we included an alternate reaction stoichiometry for NadB that had been suggested previously (17), and we thus eliminated the oxygen requirement for growth (Fig. 4d). Despite this literature analysis, however, it is noteworthy that knocking out nadB is not lethal for P. aeruginosa in vivo (10). This result suggests that another pathway for NAD biosynthesis likely exists. More detailed biochemical study is necessary to determine if there is another anaerobic path for NAD synthesis in PAO1 and whether NadB can truly function in the absence of oxygen.
Gap analysis was also used to refine the stoichiometry and directionality of several reactions that were incorrect or vague in online databases. For instance, the reaction catalyzed by the electron transport enzyme NADPH-quinone oxidoreductase (PA4975; EC 1.6.5.5) was found upon analysis of free ATP production to be necessarily irreversible, despite its being listed as reversible in the EXPASY database (19). Such network refinements would be difficult to systematize without a genome-scale model that accounts for the interdependency of reactions and metabolites.
Growth predictions. The in silico biomass yield of iMO1056 on glucose minimal medium under steady-state (continuous culture) conditions was calculated using FBA. In silico constraints used for minimal medium conditions are outlined in the supplemental material. The maximum specific growth rate was determined to be 1.048 h–1, with a specific glucose uptake rate of 10 mmol Glc · g dry weight–1 · h–1, giving an in silico biomass yield of 0.1048 g dry weight · (mmol Glc)–1. This value for in silico biomass yield falls within the range of experimentally determined values for biomass yield for P. aeruginosa under similar conditions, which were reported previously (67) as 0.094, 0.085, and 0.118 g dry weight · (mmol Glc)–1 at temperatures of 30°C, 38°C, and 41°C, respectively. This value for in silico biomass yield is also within 20% of the maximum biomass yield determined experimentally for P. aeruginosa ATCC 9027 under aerobic growth conditions in another study [0.088 g dry weight · (mmol Glc)–1] (7).
P. aeruginosa is renowned for its ability to survive by denitrification in anaerobic environments (13, 58). The pathway for denitrification of nitrate was included in the reconstruction, and iMO1056 is able to grow anaerobically, with an in silico biomass yield of 0.0846 g dry weight · (mmol Glc)–1, when nitrate is provided under glucose limitation. This in silico biomass yield is within 25% of an experimental value for maximum biomass yield reported previously for P. aeruginosa ATCC 9027 under anaerobic growth conditions [0.067 g dry weight · (mmol Glc)–1] (7). The ratio of maximum anaerobic yield to maximum aerobic yield reported previously (7) is 0.76, which differs from the ratio obtained in silico (0.81) by only 6%, indicating a good match between in silico versus in vivo (experimental) ratios of anaerobic to aerobic yield.
Phenotyping data. A Biolog study was performed for P. aeruginosa by cultivating PAO1 in minimal medium supplemented with various carbon sources (as described in Materials and Methods). Of the 95 carbon sources tested, 30 could be compared directly to in silico growth of iMO1056 (by assuming that substrate utilization indicates growth, which is usually but not always true). Disagreements between the Biolog data and the in silico growth simulations were used as hypotheses for refining the network reconstruction. The remaining 65 carbon sources included 15 that were present as intracellular metabolites in iMO1056 but for which there were no transporters assigned and 50 that are not currently accounted for in the iMO1056 reconstruction. Of these, 14 showed a growth phenotype in the Biolog data, indicating that incorporation of these compounds into the model might be a future step in iMO1056 model expansion.
Computational experiments were performed to determine the growth of iMO1056 on the 30 Biolog carbon sources that were directly testable with iMO1056 (Fig. 5a). The in silico growth experiments were performed in Simpheny by using FBA, as described in Materials and Methods. Disagreements between Biolog substrate utilization data and in silico growth simulations were investigated through gap analysis and literature mining, and these discrepancies were rectified where possible. In one case (L-leucine), the Biolog data were unclear about the growth of PAO1, so that data point in Fig. 5a is shaded lighter than those for the other carbon sources but is still considered to show a growth phenotype. After completion of pathways for substrates which initially conflicted with the Biolog data set, PAO1 viability matched iMO1056 viability under 27 of 30 conditions (Fig. 5a). This 90% match indicates that the core metabolism and various catabolic pathways of PAO1 were sufficiently reconstructed in iMO1056 for the model to properly predict catabolism of a variety of common substrates, including many amino acids and several common sugars.
![]() View larger version (26K): [in a new window] |
FIG. 5. Biolog validation. The results of a Biolog study of P. aeruginosa viability on different carbon sources in minimal medium were compared with in silico viability predictions for iMO1056 obtained via FBA. (a) Thirty-one carbon sources tested in Biolog were directly comparable with in silico predictions. The results of this comparison are shown here. An "X" indicates no growth, while shaded boxes in the Biolog column indicate that growth occurred. Lightly shaded boxes in the Biolog column indicate weak results that are close to the sensor threshold for growth and were included as a growth phenotype for analyses. (b) Fifteen carbon sources tested in Biolog are intracellular compounds in iMO1056 but did not sustain in silico growth since they lacked appropriate transporters. The number of carbon sources from this group belonging to each permutation of Biolog growth (yes or no) and in silico growth (yes or no) is represented in the table, as well as a brief description of what each combination of Biolog growth and in silico growth indicates about iMO1056 regarding the set of carbon sources.
|
Gene essentiality prediction and validation. A list of genes predicted to be essential for in silico growth of iMO1056 was compiled and compared with a set of in vivo essential genes (i.e., genes that are reported to be required for growth) from the literature. The set of in vivo essential genes encompasses genes deemed essential in both of two independent genome-scale transposon mutant studies of PAO1 (25, 35). This overlapping data set was chosen because it has a higher confidence value for gene essentiality than the candidate essentials from either transposon study individually, since the inclusion of two transposon studies increases coverage of the PAO1 genome and thus reduces the number of genes erroneously labeled essential. The set of in vivo essentials includes all genes in the PAO1 genome that were not knocked out in either of the two transposon studies. In silico gene essentiality predictions for iMO1056 were calculated in the Simpheny platform via FBA as previously described (27).
Genes in the in vivo essential set but not in the iMO1056 reconstruction were assumed to be involved either in nonmetabolic functions or in metabolic functions peripheral to central metabolism and were thus not included in our essentiality analysis. The total accuracy of essentiality predictions was 85% (Fig. 6a). This accuracy is comparable to the 94% accuracy achieved in a recent reconstruction of Bacillus subtilis, which also used a genome-scale knockout set for model validation (42). Of genes in the iMO1056 reconstruction, in silico essentiality predictions matched in vivo essentiality for 41% of in vivo essential genes and 91% of in vivo nonessential genes (Fig. 6a). Although the total accuracy of predictions is significant, these discrepancies (particularly in matching in vivo essentiality) deserve further discussion (as follows) and can serve as a collection of hypotheses that can subsequently be tested.
![]() View larger version (28K): [in a new window] |
FIG. 6. Gene essentiality comparison. FBA-derived (in silico) iMO1056 essentiality predictions were compared with an experimentally generated (in vivo) candidate essential gene set for P. aeruginosa. (a) Genes in iMO1056 were broken into the following four groups: true-positive (essential in vivo and in silico), true-negative (nonessential in vivo and in silico), false-positive (nonessential in vivo but essential in silico), and false-negative (essential in vivo but nonessential in silico) groups. The specific pathways represented in true-negative (b), true-positive (c), false-positive (d), and false-negative (e) groups are indicated. Numbers on pie charts indicate numbers of genes belonging to each group.
|
Vitamin and cofactor synthesis genes make up the largest number of genes in both the false-positive and false-negative categories (Fig. 6d and e), and they make up the second largest number of genes in the true-positive category as well (Fig. 6c). The high discrepancy between in silico and in vivo predictions of essentiality for vitamin and cofactor synthesis genes might be due in part to uncertainty about the vitamin and cofactor content of the LB medium on which the transposon mutants were grown. Since only trace amounts of vitamins are required for cell survival, it is possible that some essential vitamins were present at low concentrations in the LB medium but were not accounted for in our in silico LB medium. This is especially relevant considering previous analyses of yeast extract composition, which showed a high variance in vitamin concentration between extracts from different manufacturers (42; see the supplemental material). Furthermore, since the iMO1056 biomass reaction was based on that of E. coli, it is possible that nutritional differences between the two bacteria contribute to errant predictions about PAO1 vitamin and cofactor essentiality. It is therefore likely that growth experiments on better-defined media will be particularly helpful in elucidating the true essentiality of genes in vitamin and cofactor synthesis pathways.
Second, it is important that the in vivo gene essentiality predictions from the transposon mutant sets are not completely accurate in assessing gene essentiality. An example of a gene that was predicted to be essential in vivo despite in actuality being a nonessential gene is PA0555, encoding the enzyme fructose-1,6-biphosphate aldolase (FBPA). This gene was classified as essential in vivo because transposon mutants with a disrupted FBPA gene were not identified in either PAO1 transposon study, but previous literature shows that the FBPA knockout is not lethal for PAO1 (2). FBPA was correctly predicted to be nonessential in the in silico data set.
The in vivo PAO1 transposon studies used in this analysis discuss the possibility of errors in their candidate essential gene sets, and it was suggested in one of the studies that the number of candidate essential genes in that study (678 total) may overestimate the true number of essential genes by 40 to 50% (25). This overestimate prediction was based on an analysis of the number of ORFs that would likely escape transposon mutagenesis, assuming that all base pairs have equal susceptibility to transposon insertion, including essential genes. Due to its higher coverage of the PAO1 genome, the set of overlapping essentials from the two PAO1 studies should have a higher confidence than that for candidate essentials from either study alone, but the combined transposon set yields an in vivo candidate essential set that is reduced in size by only 34 genes, which does not nearly approximate the 40 to 50% discrepancy suggested (25). Hence, some of the mismatches between the in vivo essentials and the in silico essentials are likely due to overestimation of gene essentiality in the in vivo studies, which would translate into false-negative results in the in silico predictions of essentiality.
Additionally, in one of the transposon studies used in our analysis (35), transposons were reported to likely disrupt genes downstream within an operon, but in the other study (25), an outward-facing neomycin phosphotransferase promoter was inserted as part of the transposon in an attempt to reduce the disruption of downstream genes. It is somewhat unclear from the study, however, how successful the neomycin phosphotransferase promoter actually was in preventing disruption of downstream genes. We therefore undertook an analysis of whether genes downstream of transposon insertions are generally disrupted in vivo, a phenomenon that would cause some nonessential genes to be classified as essential in vivo due to transposons in those genes disrupting downstream essential genes. In order to investigate this phenomenon, we performed a statistical analysis of the number of in vivo essential genes <1,000 bp downstream of genes classified as giving false-negative results. Consistent with the reports for the two transposon studies, we determined that the false-negative results were significantly more likely to have essential genes downstream of them than a set of random genes from the in vivo study that reported polar disruption of downstream genes (indicating possible disruption of downstream essential genes by transposons) (35) but that false-negative results were not more likely to have essential genes downstream of them than a set of random genes from the in vivo study with the neomycin phosphotransferase promoter (indicating a lack of disruption of downstream essential genes by transposons) (25). The result for the in vivo study reporting polar disruption of downstream genes (35) might have been due to the study's much lower coverage of the genome than that of the other study (23% versus 88% of ORFs disrupted). Regardless, this analysis suggests that in a genome-scale transposon study, an outward-facing neomycin phosphotransferase promoter is effective in preventing disruption of downstream genes (see the supplemental material for more details).
|
|
|---|
Building a genome-scale reconstruction of PAO1 offered insights that would have been difficult to obtain through any other means, including evidence for annotation refinements, the ability to quantitatively predict growth yields and secretion rates under various conditions, and a potential way to probe metabolic stresses incurred by the production of various virulence factors that may be explored in future studies. During the model-building process, we were forced to approach each metabolic pathway quantitatively and comprehensively, so previous gaps in knowledge were highlighted and investigated in a rational manner. This process of gap analysis involves coupling in silico growth simulations with bioinformatic searches and literature mining to fill holes in otherwise known pathways and offers a unique tool for identifying areas of metabolism that require further elucidation. Through gap analysis of P. aeruginosa metabolism, we were able to refine the annotation of a number of genes with respect to their function, directionality, or stoichiometry. Some of these changes are highlighted in Table 2. These annotation refinements represent many crucial metabolic functions of P. aeruginosa, and without a systematic network-level approach to guide our analysis, it would have been difficult to highlight these weak points in current knowledge.
As in any genome-scale annotation effort, the network reconstruction process for P. aeruginosa will require continued work in the future. Mismatches between in vivo and in silico essential genes have highlighted metabolic regions of uncertainty in current knowledge, and these discrepancies still require more work to be explained adequately. In addition, several pathways in the model remain incomplete due to a lack of available knowledge. These gaps include synthesis of cofactors (e.g., cobalamin and thiamine) and, notably, the entire fatty acid β-oxidation pathway, which exists in but has not been characterized extensively for pseudomonads, to our knowledge (52). The Biolog data also indicated some pathways that should be investigated further in the future, such as substrates that PAO1 utilized but which are not in iMO1056 or for which iMO1056 has no membrane transporter. Furthermore, some pathways were not included in the model despite having been studied in the literature, simply because they were peripheral to the main processes of interest in this study (growth and expression of virulence traits). Thus, for instance, pathways for degrading aromatic compounds and for handling xenobiotics were generally not included in the model, and these offer areas for model expansion in the future. Specific validation involving controlled growth experiments with P. aeruginosa will also be informative, and coupling wet lab and in silico experiments will lend greater insight into both specific functions and global properties of P. aeruginosa metabolism.
Additionally, a genome-scale reconstruction of P. aeruginosa metabolism enables the interrogation of several otherwise difficult research questions. For instance, it would be informative to compare the P. aeruginosa metabolic network with metabolic networks of other related but nonpathogenic species. This comparison would allow probing of properties such as pathway redundancy and growth burden of key virulence pathways and would offer insight into how these system-level properties might affect pathogenicity. Such a genome-scale network analysis between a pathogen and a nonpathogen has never been done, to our knowledge, and could provide significant insight into the mechanisms for disease and possible therapeutic targets. Another question of great interest involves the enumeration of selective pressures on P. aeruginosa in the CF lung environment. For example, P. aeruginosa samples obtained from sputum during the life of a CF patient have shown gene mutations and significant alterations in gene expression over time (4, 60). Notably, convergent evolution toward a loss of virulence factors by P. aeruginosa strains taken from multiple CF patients has been demonstrated, suggesting that virulence traits might be selected against once a stable infection has been achieved (60). The effect of the loss of the lasR gene, a master regulator of hundreds of virulence-related genes through the quorum-sensing system in P. aeruginosa, has also been evaluated (8, 56). It was shown that loss of the lasR gene conferred a growth advantage both to strains extracted from the lungs of CF patients and from strains that emerged by selection on rich medium, indicating that optimization of yield could act as a metabolic objective even in the CF lung (8). This result is counterintuitive, as much of the success of P. aeruginosa in chronic lung infections seems to lie in its ability to adopt a slow-growth phenotype, which would suggest that optimization of yield is perhaps not a strong selective pressure in that environment (6). Regardless of whether slow-growing mutants of P. aeruginosa in the CF lung environment are actually optimized for yield, these studies indicate that metabolic selective pressure might be a factor in the evolution of chronic CF strains. With the integration of simple regulatory rules into iMO1056, it will be possible to model different hypotheses about selective pressures in the lung and to analyze the causes for these selective processes.
In addition to the pathogenicity analysis and evolutionary studies outlined above, iMO1056 can serve as a valuable tool in interpreting and informing genome-scale transcriptomic studies of PAO1. Microarray analysis has attracted sizeable interest in the P. aeruginosa community and was recently used to determine genome-level expression changes under various stresses and conditions (13, 36, 71). Enabling a survey of system-level traits of an organism in a relatively unbiased way, microarrays are a crucial element in postgenomic analyses of cell phenotype. While gene expression data cannot be linked directly with metabolic fluxes, past studies have used gene expression data to indicate the regulation of whole pathways in metabolism, thus indicating global phenotypes (9, 12). Furthermore, metabolic reconstructions have been used to predict which genes in a network are likely to be coregulated, and overlaying expression data on a metabolic reconstruction can inform interpretation of microarrays within the context of a genome-scale model (9, 43).
P. aeruginosa is an organism of much interest for its various roles as an opportunistic pathogen (60). From chronic lifelong infections of the lungs of CF patients to acute, highly deadly infections of severe wounds in burn victims, the robustness and environmental diversity of P. aeruginosa are testament to its remarkable natural metabolic agility. We chose to focus our reconstruction effort on P. aeruginosa PAO1 since it is the most studied P. aeruginosa strain and is also the best-characterized strain, but with minor modifications to iMO1056 the model can be tailored to describe similar strains, such as the more virulent P. aeruginosa PA14 (62, 69). A genome-scale metabolic model represents a potentially enormous tool for rational drug design and prediction of cell phenotypes and, in conjunction with regulatory information, can serve in modeling disease processes and engineering therapeutic responses. For P. aeruginosa, a bacterium whose metabolic diversity is a major determinant of virulence, a metabolic network reconstruction will serve as an essential component in a multifaceted and effective response to disease.
We acknowledge funding from the Whitaker Foundation, the National Science Foundation (CAREER grant 0643548), the National Institutes of Health (NIH Biotechnology training grant [M.A.O.]), the Kluyver Centre for Genomics (The Netherlands), the European Union (Project PROBACTYS; no. 21904), and ERA-NET SysMO (BMBF).
Published ahead of print on 11 January 2008. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
M.A.O. and J.P. contributed equally to this study. ![]()
V.A.P.M.D.S. and J.A.P. contributed equally to this study. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»