Professional Documents
Culture Documents
points outside this space are denied by the constraints. Through optimization of an objective
The flux through all of the reactions in a function, FBA can identify a single optimal flux distribution that lies on the edge of the
network is represented by the vector v, allowable solution space.
which has a length of n. The concentrations
of all metabolites are represented by the analyze single points within the solution function. In practice, when only one
vector x, with length m. The system of mass space. For example, we may be interested reaction is desired for maximization or
balance equations at steady state (dx/dt = in identifying which point corresponds to minimization, c is a vector of zeros with a
0) is given in Fig. 2c26: the maximum growth rate or to maximum value of 1 at the position of the reaction
Sv = 0 ATP production of an organism, given its of interest (Fig. 2d).
Any v that satisfies this equation is particular set of constraints. FBA is one Optimization of such a system is
said to be in the null space of S. In any method for identifying such optimal points accomplished by linear programming
realistic large-scale metabolic model, within a constrained space (Fig. 1). (Fig. 2e). FBA can thus be defined as
there are more reactions than there are FBA seeks to maximize or minimize an the use of linear programming to solve
compounds (n > m). In other words, objective function Z = cTv, which can be the equation Sv = 0, given a set of upper
there are more unknown variables than any linear combination of fluxes, where and lower bounds on v and a linear
equations, so there is no unique solution c is a vector of weights indicating how combination of fluxes as an objective
to this system of equations. much each reaction (such as the biomass function. The output of FBA is a particular
Although constraints define a range of reaction when simulating maximum flux distribution, v, which maximizes or
solutions, it is still possible to identify and growth) contributes to the objective minimizes the objective function.
glucose gDW–1 h–1; DW, dry weight) and set- constrained to an uptake rate of 0 mmol growth of deleting every pairwise combina-
ting the maximum rate of oxygen uptake to gDW–1 h–1. Constraints can also be tailored tion of 136 E. coli genes to find double gene
an arbitrarily high level, so that it does not to the organism being studied, with lower knockouts that are essential for survival of
limit growth. Then, linear programming is bounds of 0 mmol gDW–1 h–1 used to simu- the bacteria.
used to determine the maximum possible late reactions that are irreversible in some FBA has limitations, however. Because it
flux through the biomass reaction, result- organisms. Nonzero lower bounds can also does not use kinetic parameters, it cannot
ing in a predicted exponential growth rate force a minimal flux through artificial reac- predict metabolite concentrations. It is also
of 1.65 h–1. Anerobic growth of E. coli can be tions (like the biomass reaction) such as the only suitable for determining fluxes at steady
calculated by constraining the maximum rate ‘ATP maintenance reaction’, which is a bal- state. Except in some modified forms, FBA
of uptake of oxygen to zero and solving the anced ATP hydrolysis reaction used to sim- does not account for regulatory effects such
system of equations, resulting in a predicted ulate energy demands not associated with as activation of enzymes by protein kinases
growth rate of 0.47 h–1 (see Supplementary growth13. Constraints can even be used to or regulation of gene expression. Therefore,
Tutorial for computer code). simulate gene knockouts by limiting reac- its predictions may not always be accurate.
As these two examples show, FBA can be tions to zero flux.
used to perform simulations under differ- FBA does not require kinetic parameters The many uses of flux balance analysis
ent conditions by altering the constraints and can be computed very quickly even for Because the fundamentals of flux balance analy-
on a model. To change the environmental large networks. This makes it well suited sis are simple, the method has found diverse
conditions (such as substrate availabil- to studies that characterize many different uses in physiological studies, gap-filling efforts
ity), we change the bounds on exchange perturbations such as different substrates or and genome-scale synthetic biology3. By alter-
reactions (that is, reactions representing genetic manipulations. An example of such a ing the bounds on certain reactions, growth on
metabolites flowing into and out of the sys- case is given in example 6 in Supplementary different media (example 1 in Supplementary
tem). Substrates that are not available are Tutorial, which explores the effects on Tutorial) or of bacteria with multiple gene
uc s
Reactions
Ox ose
Gl mas
en
yg
consumed during biomass production. Exchange 1 2 ... n
o
Bi
reactions (green columns) are used to represent the b Mathematically represent A
B
–1
1 –1
v1
v2
flow of metabolites, such as glucose and oxygen,
Met abolit es
metabolic reactions C 1 –2
* = 0
...
in and out of the cell. (c) At steady state, the flux and constraints D 1
vn
through each reaction is given by Sv = 0, which ... –1 vbiomass
defines a system of linear equations. As large –1 vglucose
m voxygen
models contain more reactions than metabolites,
Stoichiometric matrix, S Fluxes, v
there is more than one possible solution to these
equations. (d) Solving the equations to predict the
maximum growth rate requires defining an objective –v1 + ... = 0
function Z = cTv (c is a vector of weights indicating c Mass balance defines a v1 – v2 + ... = 0
how much each reaction (v) contributes to the system of linear equations
objective). In practice, when only one reaction, such v1 – 2v2 + ... = 0
© 2010 Nature America, Inc. All rights reserved.
knockouts (example 6 in Supplementary A more advanced form of robustness analysis that predict which reactions are missing by
Tutorial) can be simulated14. FBA can then be involves varying two fluxes simultaneously to comparing in silico growth simulations to
used to predict the yields of important cofactors form a phenotypic phase plane19 (example 5 experimental results20-22. Constraint-based
such as ATP, NADH, or NADPH15 (example 2 in Supplementary Tutorial). models can also be used for metabolic engi-
in Supplementary Tutorial). All genome-scale metabolic recon- neering where FBA-based algorithms, such
Whereas the example described here structions are incomplete, as they contain as OptKnock23, can predict gene knockouts
yielded a single optimal growth phenotype, ‘knowledge gaps’ where reactions are miss- that allow an organism to produce desirable
in large metabolic networks, it is often pos- ing. FBA is the basis for several algorithms compounds24,25.
sible for more than one solution to lead to
the same desired optimal growth rate. For
example, an organism may have two redun- Box 2 Tools for FBA
dant pathways that both generate the same
FBA computations, which fall into the category of constraint-based reconstruction and
amount of ATP, so either pathway could be
analysis (COBRA) methods, can be performed using several available tools 27-29. The
used when maximum ATP production is the COBRA Toolbox11 is a freely available Matlab toolbox (http://systemsbiology.ucsd.edu/
desired phenotype. Such alternate optimal Downloads/Cobra_Toolbox) that can be used to perform a variety of COBRA methods,
solutions can be identified through flux vari- including many FBA-based methods. Models for the COBRA Toolbox are saved in
ability analysis, a method that uses FBA to the Systems Biology Markup Language (SBML) 30 format and can be loaded with the
maximize and minimize every reaction in function ‘readCbModel’. The E. coli core model used in this Primer is available at
a network16 (example 3 in Supplementary http://systemsbiology.ucsd.edu/Downloads/E_coli_Core/.
Tutorial), or by using a mixed-integer lin- In Matlab, the models are structures with fields, such as ‘rxns’ (a list of all reaction
ear programming–based algorithm17. More names), ‘mets’ (a list of all metabolite names) and ‘S’ (the stoichiometric matrix).
detailed phenotypic studies can be performed The function ‘optimizeCbModel’ is used to perform FBA. To change the bounds on
such as robustness analysis18, in which the reactions, use the function ‘changeRxnBounds’. The Supplementary Tutorial contains
effect on the objective function of varying examples of COBRA toolbox code for performing FBA, as well as several additional
a particular reaction flux can be analyzed types of constraint-based analysis.
(example 4 in Supplementary Tutorial).
Ultimately, FBA produces predictions that 659–667 (2008). 17. Lee, S., Phalakornkule, C., Domach, M.M. & Grossmann,
must be verified. Experimental studies are 4. Oberhardt, M.A., Palsson, B.O. & Papin, J.A. Mol. I.E. Comput. Chem. Eng. 24, 711–716 (2000).
Syst. Biol. 5, 320 (2009). 18. Edwards, J.S. & Palsson, B.O. Biotechnol. Prog. 16,
used as part of the model reconstruction 5. Thiele, I. & Palsson, B.O. Nat. Protoc. 5, 93–121 927–939 (2000).
process and to validate model predictions. (2010). 19. Edwards, J.S., Ramakrishna, R. & Palsson, B.O.
6. Feist, A.M., Herrgard, M.J., Thiele, I., Reed, J.L. & Biotechnol. Bioeng. 77, 27–36 (2002).
Studies have shown that growth rates of E. Palsson, B.O. Nat. Rev. Microbiol. 7, 129–143 20. Reed, J.L. et al. Proc. Natl. Acad. Sci. USA 103,
coli on several different substrates predicted (2009). 17480–17484 (2006).
by FBA agree well with those obtained by 7. Durot, M., Bourguignon, P.Y. & Schachter, V. FEMS 21. Kumar, V.S. & Maranas, C.D. PLoS Comput. Biol. 5,
Microbiol. Rev. 33, 164–190 (2009). e1000308 (2009).
experimental measurements14. Model-based 8. Covert, M.W. et al. Trends Biochem. Sci. 26, 179– 22. Satish Kumar, V., Dasika, M.S. & Maranas, C.D. BMC
predictions of gene essentiality have also 186 (2001). Bioinformatics 8, 212 (2007).
been shown to be quite accurate2. 9. Edwards, J.S., Covert, M. & Palsson, B. Environ. 23. Burgard, A.P., Pharkya, P. & Maranas, C.D.
Microbiol. 4, 133–140 (2002). Biotechnol. Bioeng. 84, 647–657 (2003).
This primer and the accompanying tuto- 10. Price, N.D., Reed, J.L. & Palsson, B.O. Nat. Rev. 24. Feist, A.M. et al. Metab. Eng. published online,
rials based on the COBRA toolbox (Box 2) Microbiol. 2, 886–897 (2004). doi:10.1016/j.ymben.2009.10.003 (17 October
11. Becker, S.A. et al. Nat. Protoc. 2, 727–738 2009).
should help those interested in harnessing (2007). 25. Park, J.M., Kim, T.Y. & Lee, S.Y. Biotechnol. Adv. 27,
the growing cadre of genome-scale metabolic 12. Orth, J.D., Fleming, R.M. & Palsson, B.O. in EcoSal 979–988 (2009).
reconstructions that are becoming available. —Escherichia coli and Salmonella Cellular and 26. Palsson, B.O. Systems Biology: Properties of
Molecular Biology (ed. Karp, P.D.) (ASM Press, Reconstructed Networks (Cambridge University
Washington, DC, 2009). Press, New York, 2006).
ACKNOWLEDGMENTS
13. Varma, A. & Palsson, B.O. Biotechnol. Bioeng. 45, 27. Jung, T.S., Yeo, H.C., Reddy, S.G., Cho, W.S. & Lee,
This work was supported by National Institutes of 69–79 (1995). D.Y. Bioinformatics 25, 2850–2852 (2009).
Health grant no. R01 GM057089. 14. Edwards, J.S., Ibarra, R.U. & Palsson, B.O. Nat. 28. Klamt, S., Saez-Rodriguez, J. & Gilles, E.D. BMC
Biotechnol. 19, 125–130 (2001). Syst. Biol. 1, 2 (2007).
© 2010 Nature America, Inc. All rights reserved.
1. Duarte, N.C. et al. Proc. Natl. Acad. Sci. USA 104, 15. Varma, A. & Palsson, B.O. J. Theor. Biol. 165, 477– 29. Lee, D.Y., Yun, H., Park, S. & Lee, S.Y. Bioinformatics
1777–1782 (2007). 502 (1993). 19, 2144–2146 (2003).
2. Feist, A.M. et al. Mol. Syst. Biol. 3, 121 (2007). 16. Mahadevan, R. & Schilling, C.H. Metab. Eng. 5, 30. Hucka, M. et al. Bioinformatics 19, 524–531
3. Feist, A.M. & Palsson, B.O. Nat. Biotechnol. 26, 264–276 (2003). (2003).