introduction to the pathway tools software david walsh and simon eng bigdata workshop—may 29, 2010

18
Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Post on 20-Dec-2015

222 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Introduction to thePathway Tools Software

David Walsh and Simon EngbigDATA Workshop—May 29, 2010

Page 2: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Outline

1. Metabolic annotation from genomic data

2. Pathway Tools and MetaCyc facilitatedgenome/pathway annotation

3. Pathway/genome database (PGDB) exploration

4. Potential applications in environmental ‘omics

Page 3: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

?!?!

Page 4: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Metabolic Annotation Workflow

Automated

Manual

Pathway Tools

Page 5: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Pathway Toolsis a software suite for working

with pathway/genome databaseshttp://bioinformatics.ai.sri.com/ptools

Page 6: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Pathway/Genome Databases(a.k.a. PGDBs)

integrate genomic data withdetailed functional annotations

(including metabolic pathways)

Page 7: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

The Three Tiers of PGDBs

Page 8: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010
Page 9: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

MetaCycis a continuously curated PGDB of

1400+ reference pathways from1800+ organisms

encompassing all domains of lifehttp://www.metacyc.org

Page 10: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

MetaCyc Statistics by Year

Adapted from http://www.metacyc.org/release-notes.shtml

2001 2003 2005 2007 2009 2010

Metabolic pathways 445 491 692 1010 1436 1471

Reactions 4218 4858 5520 6576 8248 8409

Enzymes 1115 1618 3029 4582 6056 6198

Genes 0 1673 2931 4161 5709 5952

Chemical compounds 2335 3029 4817 6561 8363 8572

Organisms 158 222 601 1077 1834 1861

Citations 2381 3619 8599 15875 21713 22459

Page 11: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Novel metabolic pathwayscan be created at will in Pathway Tools

(PGDBs are therefore dynamic)

Page 12: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

PathoLogicpredicts metabolic pathways in a PGDB

based on reference PGDBs

Page 13: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

PathoLogicreads GenBank and nucleotide FASTA files

(but it also has a native file format)

Page 14: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

PathoLogicinfers metabolic pathways

from functional annotationsProduct names

EC numbersGO terms

Page 15: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

PathoLogicapplies an iterative algorithm

that keeps track ofcandidate and undecided pathways

Page 16: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

How PathoLogic Works

Page 17: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Demonstrationon two rRNA-containing fosmids of

an uncultivated organism

Page 18: Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Resources

Pathway Tools http://bioinformatics.ai.sri.com/ptools

MetaCyc http://www.metacyc.org