an overview of bioinformatics

47
An overview of Bioinformatics

Upload: marlie

Post on 11-Jan-2016

25 views

Category:

Documents


2 download

DESCRIPTION

An overview of Bioinformatics. Cell and Central Dogma. Source: “Post-genome Informatics” by M Kanehisa. Source: “Post-genome Informatics” by M Kanehisa. Deduction and Analogy. Biological System (Organism) Reductionistic Synthetic - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: An overview  of Bioinformatics

An overview of

Bioinformatics

Page 2: An overview  of Bioinformatics

Cell and Central Dogma

Page 3: An overview  of Bioinformatics

Source: “Post-genome Informatics” by M Kanehisa

Page 4: An overview  of Bioinformatics

Source: “Post-genome Informatics” by M Kanehisa

Page 5: An overview  of Bioinformatics

Deduction and Analogy

Page 6: An overview  of Bioinformatics

Biological System(Organism)

Reductionistic SyntheticApproach Approach

(Experiments) (Bioinformatics)

Building Blocks(Genes/Molecules)

Source: “Post-genome Informatics” by M Kanehisa

Page 7: An overview  of Bioinformatics

Principles Known

Physics Chemistry Biology

Matter Compound Organism

Elementary Elements GenesParticles

Yes Yes No

Source: “Post-genome Informatics” by M Kanehisa

Page 8: An overview  of Bioinformatics

Searching and learning problems in biologyMethods in Informatics

Pairwise sequence alignment Optimization algorithms

Database search for similar sequences -Dynamic programmingMultiple sequence alignment -Simulated annealing

Phylogenetic tree reconstruction -Genetic alogrithmProtein 3D structure alignment -Hopfield neural network

RNA secondary structure prediction -Gibbs samplingRNA 3D structure prediction -Monte Carlo

Protein 3D sturcture prediction

Motif extraction Pattern recognition and learning algorithmFunctional site prediction -Discriminant analysisCellular localization prediction -Heirarchical neural networkCoding region prediction -Hidden Markov Model

Transmembrane segment prediction -Formal Grammar

Protein secondary structure prediction

Protein 3D sturcture predictionSuperfamily classification Clustering algorithm

Ortholog/paralog grouping of genes -Heirarchical cluster analysis

3D fold classification -Kohonen neural network

Gene Expression Clustering -Self Organization Map3D fold classification -Kohonen neural networkNetwork comparison -Graph theoryPathway construction -Network theoryDynamic analysis of network -Control theoryControl and design of system -System theory

Interaction and Pathway

Problems in Biology

Similarity search

Molecular classificatoin

Structure/function

prediction

ab initioprediction

Knowledgebased

prediction

Source: “Post-genome Informatics” by M Kanehisa

Page 9: An overview  of Bioinformatics

Sequence Comparison: Sequence Comparison: Algorithms and ApproachesAlgorithms and Approaches

Page 10: An overview  of Bioinformatics

Homology Search

New sequence

Similar sequences

Expert knowledge

Sequence interpretation

Sequence database(Primary data)

retrieval

Source: “Post-genome Informatics” by M Kanehisa

Page 11: An overview  of Bioinformatics

Pairwise sequence alignment by dynamic programming

Needleman Wunsch alogrithmSource: “Post-genome Informatics” by M Kanehisa

Page 12: An overview  of Bioinformatics

Database Search

for Similar Sequencesfor Similar Sequences

Page 13: An overview  of Bioinformatics

Web LabWeb Lab

Page 14: An overview  of Bioinformatics

MotifMotif

Page 15: An overview  of Bioinformatics

Source: “Introduction to Protein Structure” by Branden & Tooze

Page 16: An overview  of Bioinformatics

Web LabWeb Lab

Page 17: An overview  of Bioinformatics

Motif Search

New sequence Expert knowledge

Sequence interpretation

Sequence database(Primary data)

Motif library(Empirical rules)

inference

Source: “Post-genome Informatics” by M Kanehisa

Page 18: An overview  of Bioinformatics

Introduction toIntroduction to

Structural BiologyStructural Biology

Page 19: An overview  of Bioinformatics

Source: “Introduction to Protein Structure” by Branden & Tooze

Page 20: An overview  of Bioinformatics

Source: “Introduction to Protein Structure” by Branden & Tooze

Page 21: An overview  of Bioinformatics

Web LabWeb Lab

Page 22: An overview  of Bioinformatics

Web LabWeb Lab

Page 23: An overview  of Bioinformatics

Web LabWeb Lab

Page 24: An overview  of Bioinformatics

Genome ProjectGenome Project

Page 25: An overview  of Bioinformatics
Page 26: An overview  of Bioinformatics

Web LabWeb Lab

Page 27: An overview  of Bioinformatics

Genome SequencingGenome Sequencing

and and

Genome AnnotationGenome Annotation

Page 28: An overview  of Bioinformatics

A general model of the structure of genomic sequences

Source

: “Bio

info

rmatics” b

y D

W M

ount

Page 29: An overview  of Bioinformatics

MicroarrayMicroarray

Page 30: An overview  of Bioinformatics

Joe Sutliff for Science 291 p1224 (2001)

What kind of solution Genomics can provide with ? High Throughput Gene Discovery

Page 31: An overview  of Bioinformatics

165 genes are up-regulated in 75% tumors(MAPK pathway, APC, promotion of mitosis; 69 unknown)

170 genes are down-regulated in 65% tumors (hepatocyte-specific gene products, retinoid metabolism; 75 unknown)

Hierarchical ClusteringK-meansSelf Organization MapSupport Vector Single Value Decomposition

Page 32: An overview  of Bioinformatics

Gene Expression Gene Expression

andand

TranscriptomeTranscriptome

Page 33: An overview  of Bioinformatics
Page 34: An overview  of Bioinformatics
Page 35: An overview  of Bioinformatics

Web LabWeb Lab

Page 36: An overview  of Bioinformatics

Proteomics Proteomics

and and

Functional GenomicsFunctional Genomics

Page 37: An overview  of Bioinformatics

Source: “Post-genome Informatics” by M Kanehisa

Page 38: An overview  of Bioinformatics

Web LabWeb Lab

Page 39: An overview  of Bioinformatics

Integrative GenomicsIntegrative Genomics

Page 40: An overview  of Bioinformatics
Page 41: An overview  of Bioinformatics

Network of physical interactions between nuclear proteins

Page 42: An overview  of Bioinformatics

Attributes of generic network structures

Page 43: An overview  of Bioinformatics
Page 44: An overview  of Bioinformatics

Virtual Cell

Living Cell

PerturbationEnvironmental changeGene disruptionGene overexpression

Dynamic ResponseChanges in:Gene expression profiles,Etc.

BiologicalKnowledgeMolecular and CellularBiology,Biochemistry,Genetics, etc

Basic PrinciplesPractical Applications

Complete Genome Sequences

Source: “Post-genome Informatics” by M Kanehisa

Page 45: An overview  of Bioinformatics

Take Home Message

Define the biological problem.

Why is bioinformatics important ?A synthesis approach.

Prediction is a dangerous game. Always try your best to validate in the bench side.

The devil is in the detail. Always try different bioinformatic tools and databases.

Your knowledge rests on your own practice.

Page 46: An overview  of Bioinformatics

Reference Books you will find useful:

Bioinformatics -sequence and genome analysis by D W Mount

Introduction to Bioinformatics by A M Lesk

Post-genome Informatics by M Kanehisa

Page 47: An overview  of Bioinformatics

Evolution of molecular biology databases

Database category Data content Examples

1. Literature database Bibliographic citations MEDLINE(1971)On-line journals

2. Factual Database Nucleic acid sequences GenBank(1982)Amino acid sequences EMBL(1982)3D molecular structures DDBJ(1984)

SWISS_PROT(1986)PDB(1971)

3. Knowledge base Motif libraries PROSITE(1988)Molecular classification SCOP(1994)Biochemical pathways KEGG(1995)