from genomes to pathway tools and from pathway tools to metabolic models

27
From genomes to Pathway tools and from Pathway Tools to Metabolic Models Jeremy Zucker Broad Institute of MIT and Harvard

Upload: minerva-hendricks

Post on 01-Jan-2016

36 views

Category:

Documents


2 download

DESCRIPTION

From genomes to Pathway tools and from Pathway Tools to Metabolic Models. Jeremy Zucker Broad Institute of MIT and Harvard. Outline. Enzyme prediction with EFICAz TBCyc curation FungiCyc curation Network debugging tools. From genome sequences to Metabolic flux models. Enzyme predictor. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

From genomes to Pathway tools and from Pathway Tools to

Metabolic ModelsJeremy Zucker

Broad Institute of MIT and Harvard

Page 2: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Outline

• Enzyme prediction with EFICAz• TBCyc curation• FungiCyc curation• Network debugging tools

Page 3: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Pathway predictor

Enzyme predictor

Model

Predictions

Omics data

From genome sequences to Metabolic flux models

Page 4: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Enzyme Prediction

• EFICAz: – Predicts Enzyme Commision Numbers– Functionally Discriminating Residues

• Homofunctional MSA• Heterofunctional MSA

– Support Vector Machine– Sequence Identity Threshold– Integrates 9 independent methods to achieve

maximal accuracy• KEGG-BLAST

Page 5: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Enzyme Prediction

• EficazTool incorporated into Calhoun web interface

• Runs on LSF cluster for all Transcripts in an AnalysisRun

• Automatically assigns EC number OntologyTerm and ToolEvidence to Transcript

Page 6: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

TBCyc curation

Page 7: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

TBCyc curation

• TBDB TBCyc integration• Palsson Flux models => curated pathways• McFadden flux model => curated pathways• Literature => curated pathways• Expression data => Omics viewer• Chip-Seq => curated gene regulatory network• Coming soon: – annotation jamboree for TB– Connect to Decode Open source drug discovery for TB

Page 8: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

NeurosporaCyc curation

Page 9: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Neurospora curation

• Resources:– Model organism– Active Community Annotation Project– Recent genome assembly and annotation– Systems Biology grant• Chip-seq• KO library• Biofuels

Page 10: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Biofuels from Neurospora?

• Growing interest for obtaining biofuels from fungi

• Neurospora crassa has more cellulytic enzymes than Trichoderma reesei

• N. crassa can degrade cellulose and hemicellulose to ethanol [Rao83]

• Simultaneous saccharification and fermentation means that N. crassa is a possible candidate for consolidated bioprocessing

Xylose

Ethanol

Page 11: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Effects of Oxygen limitation on Xylose fermentation in Neurospora crassa

Zhang, Z., Qu, Y., Zhang, X., Lin, J., March 2008. Effects of oxygen limitation on xylose fermentation, intracellular metabolites, and key enzymes of Neurospora crassa as3.1602. Applied biochemistry and biotechnology 145 (1-3), 39-51.

Xylose

Pyruvate

TCA Ethanol

Respiration Fermentation

Glycolysis

0 2 4 6 8 10 12 140

10

20

30

40

50

60

70

Ethanol production vs Oxygen level

Oxygen level (mmol/L*g)

Etha

nol c

onve

rsio

n (%

)Low O2

Intermediate O2

High O2

Page 12: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Glycolysis

Xylose degradationPentose phosphate

Aerobic respirationFermentation

TCA Cycle

Model of Xylose Fermentation

Xylose

Oxygen

Ethanol

ATP

Two paths from xylose to xylitol

Page 13: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Glycolysis

Xylose degradationPentose phosphate

Aerobic respirationFermentation

TCA Cycle

Oxygen=5

ATP=16.3

NADPHRegeneration

NADPH &NAD+

Utilization

HighOxygen

NAD+ Regeneration

Page 14: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Glycolysis

Xylose degradationPentose phosphate

Aerobic respirationFermentation

TCA CycleEthanol

LowOxygen

Oxygen=0

Page 15: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Glycolysis

Xylose degradationPentose phosphate

Aerobic respirationFermentation

TCA CycleEthanol

IntermediateOxygen

OptimalEthanol

NADPH &NAD

Utilization

Oxygen=0.5

ATP=2.8

NAD Regeneration

NADPHRegeneration

All O2 used to regenerate

NAD used in first step

Page 16: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Glycolysis

Xylose degradationPentose phosphate

Aerobic respirationFermentation

TCA CycleEthanol

IntermediateOxygen

OptimalEthanol

NADPH &NAD

Utilization

Oxygen=0.5

ATP=2.8

NAD Regeneration

NADPHRegeneration

All O2 used to regenerate

NAD used in first step

BottleneckPyruvate

decarboxylase

Improve NADHenzyme

Page 17: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Xylose Lipids

• col-2 (Glucose-6-phosphate dehydrogenase) mutant produces lipids from xylose and glucose– 35x more TAGS than WT– 12% biomass is lipid

• Mapping out the metabolic pathways to explain this phenomenon.

• Lipids for NADPH regeneration?

Page 18: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Algorithms for Debugging Metabolic Networks

• Metabolic network is too complicated.• The metabolic network is infeasible.• E-flux results in dead model.

Page 19: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Minimal Organism

• Given a feasible model under the given nutrient conditions, find the fewest number of nonzero fluxes that still results in a viable organism.

minimize card(v)subject to: Sv = 0, l <= v <= u

Page 20: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Minimal Reaction Adjustment

• Given an infeasible model, find a reaction with the smallest number of reactants and products that results in a feasible model.

minimize card( r )subject to Sv + r = 0l <= v <= u

Page 21: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Minimal Limit Adjustment

• Given a set of (feasible) baseline limits, and an (infeasible) set of expression-constrained flux limits, find the smallest number of adjustments to the flux limits that results in feasible model (without exceeding the baseline constraints).

minimize card(dl) + card(du)subject to: Sv = 0,l – dl <= v <= u+dul-dl >= l_0, u+ du <= u_0dl >= 0, du >= 0.

Page 22: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Minimum Cardinality

• Each of these problems is a special case of the minimum cardinality problem

Minimize card( x )Subject to Ax + By >= fCx + Dy >= g• Caveat: the minimum cardinality problem is

NP-Hard!

Page 23: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Sparse Optimization to the rescue!

• Recent results from Compressed Sensing have shown that minimizing the L1-norm is a decent heuristic

• Iterative methods can improve the results• Instead of minimize card( x ), Minimize norm(diag(w) x, 1) = sum(w_i |x_i| )Update w_i = 1/(epsilon + |x_i|), i=1,…,k

Page 24: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Implementations

• 3 software packages in matlab– Cvx (Sedumi and spdt3)– Glpkmex (GLPK)– Cplexmex (CPLEX)

• min cardinality– MILP and L1 heuristic version

• min limit adjustment– MILP and L1 heuristic version

• Min limit adjustment => min cardinality

Page 25: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Acknowle

Page 26: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

AcknowledgementsBroad InstituteJames GalaganBruce Birren

Brian HaasAaron BrandesMatt HennLi Jun MaChristina Cuomo

Carsten Russ

Broad Genome Sequencing Platform

Program ProjectHeather Hood

SRIPeter KarpMarkus KrumenackerSuzanne PaleyMario LatendresseTomer Altman…

Finishing TeamMargaret PriestHarindra ArachchiLynne AftuckMike Fitzgerald

Genome AssemblySarah YoungSean Sykes

Annotation TeamBrian HaasMike KoehrsenQian ZengTom Walk

Page 27: From genomes to Pathway tools and from Pathway Tools to Metabolic Models

Outline

• e354f1709c• ICS Guest network