lecture #19

46
Lecture #19 Alternative (equivalent) Optima

Upload: kaethe

Post on 15-Jan-2016

45 views

Category:

Documents


0 download

DESCRIPTION

Lecture #19. Alternative (equivalent) Optima. Summary. LP statement and the occurrence of many optimal solutions Methods to study degenerate solutions Flux variability analysis (FVA) Extreme pathways and multiple optima Enumerating all the alternative optima. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Lecture #19

Lecture #19

Alternative (equivalent) Optima

Page 2: Lecture #19

Summary

• LP statement and the occurrence of many optimal solutions

• Methods to study degenerate solutions– Flux variability analysis (FVA)– Extreme pathways and multiple

optima– Enumerating all the alternative

optima

Page 3: Lecture #19

FBA Optimization Problem Statement

• Objective Function: A function that is maximized or minimized to identify optimal solutions

• Constraints: Place limits on the allowable values the solutions can take on.

Maximize: cv

Such that S v = b =0LB v UB

v1

v3

v2

Page 4: Lecture #19

Equivalent Optimal Solutions Exist:How can we find & characterize them?

Gro

wth

Rat

e

Projected Solution Space

FBA

Some Flux

Page 5: Lecture #19

Example of two equivalent optima

•Two equivalent (same input/output state) flux distributions for optimally generating malate from succinate in the core E. coli model.•All non-negative combinations of them give the same optima

Page 6: Lecture #19

FLUX VARIABILITY ANALYSIS (FVA)

Method 1:

Page 7: Lecture #19

maximize 3w1 + 3w2

such that w1 + w2 ≤ 2 w1, w2 ≥ 0w2

w1

1

1

Alternate optimal solutions:3w1 + 3w2 = 6

min(w1) from FVA max(w1) from FVA

min(w2) from FVA

max(w2) from FVA

Alternate Optimal Solutions

maximize 3w1 + w2

such that w1 + w2 ≤ 2 w1, w2 ≥ 0w2

w1

1

1

Unique optimal solution

Unique Optimal Solution

Example: flux variability analysis

Page 8: Lecture #19

Flux Variability Analysis:

Flux Variabilityfor V1

Alternate orEquivalent Optima

Incr

ease

in

Sta

ted

Obj

ectiv

e F

unct

ion

V1

V2

• First, identify the maximum value of the objective function and constrain objective function to this value.

• Second, minimize and maximize each flux independently to identify flexibility in the fluxes across alternate optima.

If we have n fluxes, we solve ≈ 2n FBA problems

Page 9: Lecture #19

Flux Variability Analysis (FVA): the concept

• Determine what is possible based on external measurements and network stoichiometry

Solve series of 2 linear optimization for each reactions j.

Objective: maximize & minimize vj (for all

j) subject to (be in the space of optimal solutions):

i = metabolitesj = reactionsjv

+ ∞

max

“FeasibleRegion”

- ∞

jvmin

Metabolic Eng, 2003. 5(4): p. 264-76.

maxminjjj v

S ∙ v = 0

<c ∙ v> = Zopt

Page 10: Lecture #19

Reduced Cost

• Definition: – dZ/dvi =0; (and more specifications)

• In order for a flux to be variable, the reduced cost must be equal to zero. The converse is not necessarily true.

Page 11: Lecture #19

Core E. coli Metabolism

Page 12: Lecture #19

Metabolite YieldCarbon

Conversion

# of Variable

Fluxes3PG 2 100% 24PEP 2 100% 24Pyr 2 100% 24OA 2 133.33% 24G6P 0.8916 89.16% 16F6P 0.8916 89.16% 16R5P 1.0571 88.10% 44E4P 1.2982 86.55% 25G3P 1.6818 84.09% 13AcCoA 2 66.67% 36αKG 1 83.33% 36SuccCoA 1.64 109.33% 2

Yield on Glucose (Aerobic):E. coli core model (95 fluxes)

Page 13: Lecture #19

Yield on Glucose (Anaerobic):E. coli core model

Metabolite YieldCarbon

Conversion

# of Variable

Fluxes3PG 1 50% 38PEP 1 50% 38Pyr 1 50% 38OA 1 66.67% 38G6P 0.625 62.50% 2F6P 0.625 62.50% 2R5P 0.72 60.00% 26E4P 0.8491 56.60% 21G3P 1.0345 51.72% 21AcCoA 1 33.33% 38αKG 0.4 33.33% 38SuccCoA 1.434 95.60% 2

Page 14: Lecture #19

Flux Variability for optimal 3PG yield on Glucose

Page 15: Lecture #19

flux = 0 always

flux = 1 always

flux varies between 0 and 0.5

flux varies between 1 and 1.5

Page 16: Lecture #19

Studies using FVA• Mahadevan R, Schilling CH.

The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab Eng. 2003 Oct;5(4):264-76.

• Duarte, N.C., Palsson, B.Ø., and Fu, P., "Integrated Analysis of Metabolic Phenotypes in ''Saccharomyces cerevisiae'', BMC Genomics, 5:63 (2004).

• Reed, J.L. and Palsson, B.Ø., "Genome-scale in silico models of ''E. coli'' have multiple equivalent phenotypic states: assessment of correlated reaction subsets that comprise network states”, Genome Research, 14:1797-1805(2004).

• Vo, T.D., Greenberg, H.J., and Palsson, B.Ø., "Reconstruction and functional characterization of the human mitochondrial metabolic network based on proteomic and biochemical data", Journal of Biological Chemistry, 279(38):39532-40 (2004).

• Teusink B, Wiersma A, Molenaar D, Francke C, de Vos WM, Siezen RJ, Smid EJ. “Analysis of growth of Lactobacillus plantarum WCFS1 on a complex medium using a genome-scale metabolic model.” J Biol Chem. 281(52):40041-8 (2006).

Page 17: Lecture #19

Literature Example: Vo et. al. 2004Reconstruction and functional characterization of the human mitochondrial metabolic network based on

proteomic and biochemical data

• Study on human mitochondria under various optimality conditions:– Condition 1: max ATP synthesis– Condition 2: max Heme production– Condition 3: max Phospholipid production

Page 18: Lecture #19

FVA of the human mitochondria in the cardiomyocyte

Ordered set of reactions showing decrease in variability as additional optimization criteria are added.

Highly variable fluxes

Page 19: Lecture #19

FVA as a method for evaluating effect of constraints

• L. plantarum model.• Blue – range of fluxes

in unconstrained model.• Green – range of fluxes

in ATP-constrained model:– ATP production equals

ATP consumption

J Biol Chem. 281(52):40041-8 (2006).

Page 20: Lecture #19

FVA Final thoughts

Pseudo codeFVA(S, vmin, vmax, k)

% optimization 1: maximize reaction k

c_k = (0 … 0,1,0, 0) % a 1 at position k

f_opt = max(c_k∙v | S∙v = 0, vmin < v < vmax)

% the actual FVA. max/min every other rxn

for i = 1:nc = (0 … 0,1,0, 0) % a 1 at position i

FVAmini = min(c∙v | S∙v = 0, vmin < v < vmax, vk = f_opt)

FVAmaxi = max(c∙v | S∙v = 0, vmin < v < vmax, vk = f_opt)

end

return (FVAmin, FVAmax)

• Pros:– Easy to compute– Easy to interpret

• Cons:– FVA does not give any

information about correlated reactions

– Each reaction is treated independently

Page 21: Lecture #19

EXTREME PATHWAY ANALYSIS

Method 2:

Page 22: Lecture #19

Metabolic Genotype to PhenotypeDefined within the context of convex analysis

Convex AnalysisConvex Analysis Cellular BiologyCellular Biology

Unique GeneratingVectors

IndependentExtreme Pathways

Flux VectorPositive Combination of Extreme Pathways

me

tab

olic

flu

x (v

1)

metabolic flux (v

2 )

metabolic flux (v 3)

Convex HullCapabilities of a Metabolic Genotype

Particular SolutionMetabolic Phenotype

Page 23: Lecture #19

ExampleFrom the core E. coli model – pyruvate yields from glucose, aerobic

without regulation with regulation

1.48 1.5 1.52 1.54 1.56 1.58 1.6 1.62 1.64 1.66 1.68

x 104

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

2

Extreme Pathway Number

Pyr

uvat

e Y

ield

1450 1500 1550 1600 1650 1700 1750 1800 1850 1900

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

2

Extreme Pathway Number

Pyr

uvat

e Y

ield

Optimal equivalent pathways

Suboptimal equivalent pathways

Non-producing pathways

Page 24: Lecture #19

Equivalent sets exponentially increase the number of ExPas

X1

v1X1a

X1b

X2

X2a

X2b

v1a

v1b

v2a

v2b

X3 … XnVy

Xna

Xnb

vna

vnb

Xy

v1c

v1d

v2c

v2d

v3a

v3b

V(n-1)c

V(n-1)d

vnc

vnd

Pathway of n equivalent sets of size 2 forms 2n extreme pathways.

Set 1 Set 2 … Set n

Page 25: Lecture #19

Equivalent Sets

Two equivalent sets in E. coli. A – Two alternative and equivalent ways to import a proton using two succinate transporters or two transhydrogenases. B - two equivalent ways to produce pyruvate, CO2 and NADH from Malate and NAD.

Page 26: Lecture #19

Some Extreme Pathway Literature• Schilling,C.H., Edwards, J.S., Letscher, D.L., and Palsson, B.Ø.,

"Combining pathway analysis with flux balance analysis for the comprehensive study of metabolic systems" , Biotechnology and Bioengineering 71: 286-306 (2001).

• Price, N.D., Reed, J.L., Papin, J.A., Famili, I. and Palsson, B.Ø.; "Analysis of Metabolic Capabilities using Singular Value Decomposition of Extreme Pathway Matrices ", Biophysical Journal 84:794-804 (2003).

• Papin, J.A., Price, N.D., and Palsson, B.Ø.; "Extreme Pathway Lengths and Reaction Participation in Genome-Scale Metabolic Networks ," Genome Research, 12: pp. 1889-1900 (2002).

• Price, N.D., Famili, I., Beard, D.A., and Palsson, B.Ø., "Extreme Pathways and Kirchhoff's Second Law ", Biophysical Journal, 83: pp. 2879-2882. (2002).

• Schilling, C. H., Covert, M.W., Famili, I., Church, G.M., Edwards, J.S., and Palsson, B.Ø., "Genome-scale metabolic model of Helicobacter pylori 26695 ", Journal of Bacteriology, 184(16): pp. 4582-4593 (2002).

• Wiback, S.J., and Palsson, B.Ø., "Extreme Pathway Analysis of Human Red Blood Cell Metabolism ", Biophysical Journal, 83(2): pp. 808-818 (2002).

• Papin, J.A., Price, N.D., Edwards, J.S., and Palsson, B.Ø., "The Genome-Scale Metabolic Extreme Pathway Structure in Haemophilus influenzae Shows Significant Network Redundancy ", Journal of Theoretical Biology, 215(1): pp. 67-82 (2002).

• Price, N.D., Papin, J.A., Palsson, B.Ø., "Determination of Redundancy and Systems Properties of the Metabolic Network of Helicobacter pylori Using Genome-Scale Extreme Pathway Analysis",Genome Research, 12: 760-769 (2002).

Page 27: Lecture #19

Extreme Pathways Final thoughts

Implementation:Actual implementation is quite difficult. A

rough sketch:Begin with matrix T(0) ≈ ST

Through a series of transformations T(0) → T(1) → … → T(n) zero out certain elements in T.

Read off Extreme Pathways from T(n)

As this happens, size of T increases.

Several implementations available at:

http://systemsbiology.ucsd.edu/Downloads/Extreme_Pathway_Analysis

• Pros:– Provides biologically

meaningful pathways– Form a mathematically

relevant convex basis

• Cons:– Computation scales

very poorly with network size.

– Numbers grow with networks size

Page 28: Lecture #19

Enumerating all Optimal Solutions

Method 3:

Page 29: Lecture #19

Algorithm For Identifying Different “Corner” Points

• GOAL: given your past solutions, find a new one that uses a different set of non-zero fluxes in the solution.

• The result is that you will identify all the different corner point solutions that have the same objective function value.

• Any optimal solution, can be written as the weighted sum of the corner point optimal solutions (aka we have a convex basis) of the optimal solution space.

Page 30: Lecture #19

Metabolic Network Example

v1: A → Bv2: A → C

b1: → Ab2: B →b3: C →

Reaction List

AB

Cb3

b2

b1v1

v2

Metabolic Map

Maximize Z = c·v = b3

Such that S·v = 0 0 v1,v2,b1,b2,b3 10

-10 v3 10

v3: B Cv3

Page 31: Lecture #19

AB

C 10

10

10

Solution 1:

AB

C 10

10

10

10

Solution 2:

1. Find one valid flux distribution (Solution 1)

2. Repeatedly solve for optimal again:

1. At each Iteration, add additional constraints that new solution must be “different” from any previous solution.

In this Example there are only 2 optimal solutions so the algorithm must only be run for two iterations.

Intuitive Description of Algorithm

Page 32: Lecture #19

Metabolite YieldCarbon

Conversion

# of Variable

FluxesAlternate

optima3PG 2 100% 24 11PEP 2 100% 24 11Pyr 2 100% 24 81OA 2 133.33% 24 11G6P 0.8916 89.16% 16 2F6P 0.8916 89.16% 16 2R5P 1.0571 88.10% 44 2E4P 1.2982 86.55% 25 2G3P 1.6818 84.09% 13 2AcCoA 2 66.67% 36 162αKG 1 83.33% 36 >500*SuccCoA 1.64 109.33% 2 1

Number of Alternate Optima for various Metabolic Objectives (aerobic) in core E. coli

*Did not converge

≥ # of ExPas

Page 33: Lecture #19

Studies using enumeration of AO• Lee, S. , Palakornkule, C., Domach, M. M., and Grossmann, I.E.,

“Recursive MILP model for finding all the alternate optima in LP models for metabolic networks”, Computers & Chemical Engineering, 24(2-7): 711-716 (2000)

• Mahadevan R, Schilling CH. “The effects of alternate optimal solutions in constraint-based genome-scale metabolic models.” Metab Eng. 2003 Oct;5(4):264-76.

• Reed, J.L. and Palsson, B.Ø., "Genome-scale in silico models of ''E. coli'' have multiple equivalent phenotypic states: assessment of correlated reaction subsets that comprise network states”, Genome Research, 14:1797-1805(2004).

• Thiele, I., Fleming, R.M.T., Bordbar, A., Schellenberger, J., and Palsson, B.Ø., "Functional Characterization of Alternate Optimal Solutions of Escherichia coli's Transcriptional and Translational Machinery", Biophys J 98(10):2072-2081 (2010).

Page 34: Lecture #19

Literature Analysis:Genome-Scale In Silico Models of E. coli Have Multiple Equivalent Phenotypic States:

Assessment of Correlated Reaction Subsets That Comprise Network States

• Simulated E. coli iJR904 model under 136 growth conditions (88 aerobic, 48 anaerobic)

• Generated up to 500 alternate optima for each growth condition using MILP algorithm.

Page 35: Lecture #19

Result 1: Only few alternate optima needed to describe range of FVA solutions

• Comparisons of properties for sampled optima with all optima.

• The number of variable fluxes and the allowable ranges for these fluxes across all optima were calculated using FVA

• Each line is for one of the 88 carbon sources capable of supporting aerobic growth.

– (A) shows that as the number of calculated optima increases, the number of variable fluxes found in these sampled optimal solutions approaches the total number of variable fluxes.

– (B) shows how the magnitude of the flux variations is represented by the sampled optima relative to the actual flux variability across all optima.

Finding variable fluxes

Finding the range of variable fluxes

Page 36: Lecture #19

Result 2: Reactions usage in optimal flux distributions

• For each reaction in the metabolic network, what fraction of the optimal flux distributions utilize this reaction (fopt).

• The reactions are then rank-ordered by frequency of use in optimal flux distributions.

• Each reaction in the model was previously classified into one of 30 subsystems.

• Different subsystems are used with different frequency.

All optimal solutions

904 genes931 rxns

Page 37: Lecture #19

Result 3: Correlated sets and regulon structure

• All optimal distributions were combined and the reaction correlations determined.

• 66 correlated reaction sets emerged, most of size 2.

• Reactions in sets tend to be controlled by the same set of genes as defined by the EcoCyc regulon structures

Page 38: Lecture #19

Results 4: Comparing to expression data

• Used 20 expression data sets to see if it correlated with: – A) Genes in correlated

reaction sets.– B) Genes in the same

transcription units.

Significant clusters

Page 39: Lecture #19

Main Results

• Only a small subset of reactions in the network have variable fluxes across optima;

• Sets of reactions that are always used together in optimal solutions, correlated reaction sets, showed moderate agreement with the currently known transcriptional regulatory structure in E. coli and available expression data, and

• Reactions that are used under certain environmental conditions can provide clues about network regulatory needs.

Page 40: Lecture #19

Enumeration: Final thoughts

Implementation:Iterative MILP method where each

iteration produces one additional flux distribution.

Implemented in GAMS and Matlab.

For exact specifications see:Lee, S. , Palakornkule, C., Domach, M. M., and

Grossmann, I.E., “Recursive MILP model for finding all the alternate optima in LP models for metabolic networks”, Computers & Chemical Engineering, 24(2-7): 711-716 (2000)

Reed, J.L. and Palsson, B.Ø., "Genome-scale in silico models of ''E. coli'' have multiple equivalent phenotypic states: assessment of correlated reaction subsets that comprise network states”, Genome Research, 14:1797-1805(2004).

• Pros:– Easier to compute (MILP)

than ExPas– Can terminate early without

computing all alternatives (not possible for ExPas)

– Forms mathematically “nice” convex set of optimal flux distributions (tighter than FVA)

• Cons:– Number of equivalent

solutions may be quite large (> 500)

Page 41: Lecture #19

Summary• Biologically relevant alternate optima exist• Linear programming often does not define

unique optimum.• Several techniques exist for studying alternate

optima and sub-optima– Flux variability analysis– Extreme pathways– Enumeration of all equivalent optimal solutions

• These methods have their pros and cons and you need to choose one that suits your needs

Page 42: Lecture #19

Appendix

Details of enumeration algorithm

Page 43: Lecture #19

Algorithm Details

• GOAL: given your past solutions, find a new one that uses a different set of non-zero fluxes in the solution.

• The result is that you will identify all the different corner point solutions that have the same objective function value.

• Any optimal solution, can be written as the weighted sum of the corner point optimal solutions.

Page 44: Lecture #19

AB

C 10

10

10

Solution 1:

AB

C 10

10

10

10

Solution 2:

1. Find non-zero fluxes (NZJ-

1) at current solution, J-1.

2. Pick at least one NZJ-1 flux to become zero at next solution (yi=1).

3. Make sure that the set of non-zero fluxes haven’t been visited at previous k iterations.

4. Constrain those selected fluxes to have zero flux.

5. Find solution & repeat.

yi 1

i NZJ-1

wi |NZk|-1i NZk

yi+wi 1

wi·vmin vi wi·vmin

Page 45: Lecture #19

AB

C 10

10

10

Solution 1: 1. Find non-zero fluxes (NZJ-

1) at current solution, J-1.

2. Pick at least one NZJ-1 flux to become zero at next solution (yi=1).

3. Make sure that the set of non-zero fluxes haven’t been visited at previous k iterations.

4. Constrain those selected fluxes to have zero flux.

5. Find solution & repeat.

yi 1

i NZJ-1

wi |NZk|-1i NZk

yi+wi 1

wi·vmin vi wi·vmin

1. My NZ1 fluxes are v2,b1,b3.

2. I will pick v2 to become zero at next solution: yv2=1 & yb1=yb3=0;

3. wv2 = 0; lets assume that wb1 and wb3 = 1.

4. v2 = 0 and the other fluxes can be between their normal vmin and vmax values.

wi |NZ1|-1 (2 2)i NZ1

Page 46: Lecture #19

AB

C 10

10

10

Solution 1:

AB

C 10

10

10

10

Solution 2:

1. Find non-zero fluxes (NZJ-

1) at current solution, J-1.

2. Pick at least one NZJ-1 flux to become zero at next solution (yi=1).

3. Make sure that the set of non-zero fluxes haven’t been visited at previous k iterations.

4. Constrain those selected fluxes to have zero flux.

5. Find solution & repeat.

yi 1

i NZJ-1

wi |NZk|-1i NZk

yi+wi 1

wi·vmin vi wi·vmin

wi |NZk|-1i NZk

yi+wi 1