introduction to microarray analysis and tools · user access, roles, security - - + barcode...

72
Introduction to microarray analysis and tools Module B: Survey of Microarray Analysis Tools Commercial Tools Agnes Viale, Ph.D. Genomics Core lab MSKCC

Upload: others

Post on 21-Dec-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Introduction to microarrayanalysis and tools

Module B: Survey of Microarray Analysis ToolsCommercial Tools

Agnes Viale, Ph.D.Genomics Core lab

MSKCC

Page 2: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Microarray assay life cycle

BiologicalQuestion

SamplePreparation

MicroarrayHybridization

Microarray Detection

Data Analysis& modeling

M.Shena and R. Davis,MIcroarray biochip technology

Page 3: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Plan

I- GeneChip Operating System (GCOS)

II- Genespring

III-Submission to public repository

IV- NetAffx

Page 4: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Affymetrix GeneChip-Definitions

5’ 3’600bp

PMMM

Page 5: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Software name

– MicroArray Suite 4.0 (MAS 4.0) = Empirical algorithm

– Microarray Suite 5.0 (MAS 5.0) = Statistical algorithm

– Genechip Operating System (GCOS)

Page 6: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Data comparability

• Affy arrays

1995 1998 2001 20032.0 platform

•Redesign the oligos•Change probe set names

•Keep same name•Change the manufacturingprocess

•Redesign the oligos•Change probe set names

• Software

MAS 4.0 MAS 5.0 GCOS

•New Algorithm •Same algorithm•Different data management

GenechipSoftware

•New Algorithm

Page 7: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Signal intensity

1- Genechip software: use all pairs

∑Α∈

−Α

=j

jj MMPMAvDiff )(1 A: probe pairs selected by the

software

2- MAS 4.0: excluded outlier pairs: PM-MM values that were more than 3 SDfrom the mean PM-MM value

- not robust average- negative Average difference if MM>PM

3- MAS5.0: weighted mean of avg.(PM-MM)- Probe intensities preprocessed for global background.- PM-IM intensities are log transformed- Robust mean of probe set values taken using Tukey Biweight.

)}{log(BiweightTukey *jj MMPMsignal −=

Page 8: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

MAS4.0/MAS5.0

MAS 5.0 MAS 4.0

Detection p-valueChange p-value

Page 9: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

MAS 5.0/GCOS

+--Centralized Data Sharing

+--User Access, Roles, Security

+++-Barcode Support, Automation

++-MIAME Standard Template

++-Publishing to AADM Database

++-Manage & Associate Projects,Experiments, Samples & Data

+++Gene Expression Data Analysis(Statistical Algorithm (CHP)

-++Instrument Control / Data Acquisition

GCOSServer

GCOSClient

MAS

Page 10: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Raw files-comparison files

.Exp

.DAT

.CEL

.CHP

.CHP

Raw CHP file B Raw CHP file A

Page 11: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

GCOS

Software status window

Data window

Files window

Page 12: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

.Exp file

Page 13: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

.DAT file

DAT= scan

Page 14: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Using the .DAT file

1- To identify defective arrays

Page 15: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Using the .DAT file

1- To identify image problem

Page 16: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

.DAT file

Page 17: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

.CEL file

.CEL= Computerized version of the .DAT file CEL file is used to generate the .CHP file

Page 18: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Raw . CHP file

Page 19: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

comparison . CHP file

Page 20: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

.RPT file

• Data set QC

GeneChip built-in control 1: % present genes

GeneChip built-in control 2: 3’/5’ratio for “house keeping” genes

Right click a .CHP file

=> Report (.RPT) file

Page 21: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

.RPT file

Page 22: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

.RPT file

Page 23: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Access to probe cell information

Page 24: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Generate an comparison file

- Drag and drop the experimental CEL file- Choose the baseline file- Enter the output file name

Page 25: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Scatter plot

2X up

2X down

Page 26: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Scatter plot

“Background box”

Page 27: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Next stepsData export in ExcelData export to third party software

Applied Maths, GenExplore™ :BioDiscovery, GeneSight:GeneData AG -Expressionist. LION Bioscience AG'sMolecular Applications Group, Stingray™.MolecularWare, Inc.: ArrayAnalyzerDBPartek, Inc., Partek Pro 2000Rosetta Inpharmatics. Resolver™Scanalytics, Inc. , MicroArray SuiteSilicon Genetics' GeneSpringTMSpotfire, Inc., .Media Cybernetics, Array-Pro(R).Microarray Software developed by Stanford UniversityTIGR (The Institute for Genome Research) offers software tools (free foracademic institutions) for array analysis.

OmniViz, Inc., OmniViz Pro Xpogen Inc., PathlinX

Page 28: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Plan

I- GeneChip Operating System (GCOS)

II- Genespring

III- NetAffx

Page 29: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Plan

I- Introdution and potential applications of array platformII- Existing platformsIII- Experimental design

IV- Steps involved in data analysisData set QCNormalizationFeature (gene) filteringReplicate analysisClusteringStatistical testsPathway

Page 30: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Genespring interface

Page 31: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Choice of “genome”

Page 32: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Data import

TXT files from a Genechiparray or from a spottedarray or from any othertype of array as long asyou have a “signal”associated with a identifier(gene, transcript, protein,other)

Page 33: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Samples information

• Sample-centric system (not experiment centric)

• Sample attributes format is MIAME compliant

Minimum information about a microarray experiment (MIAME)-toward standards for microarray data.Nat Genet. 2001 Dec;29(4):365-71.

MIAME goal : to specify the minimum information that must bereported about a microarray experiment in order to ensure itsinterpretability, as well as potential verification of the results

• MIAME format required for microarray data publication

Page 34: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

1. Experimental design: the set of the hybridisation experiments as a whole2. Array design: each array used and each element (spot) on the array3. Samples: samples used, the extract preparation and labeling4. Hybridizations: procedures and parameters5. Measurements: images, quantitation, specifications6. Controls: types, values, specifications

Hybridisation ArraySample

Analysis

Experiment Normalisation

6 parts in MIAME

MIAME

Page 35: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Samples information

• Sample-centric system (not experiment centric)

• MIAME compliant sample attributes format

Page 36: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Experiment parameters

Parameters can be used for gene filtering with a statistical test

Page 37: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Gene filtering

Page 38: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Gene filtering

Page 39: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Gene lists

Page 40: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Gene lists

Page 41: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Union/Intersection of gene lists

Venn Diagram

Page 42: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Union/Intersection of gene lists

Venn Diagram

Page 43: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Statistical analysis

Page 44: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Statistical analysis output

•Venn diagram•Clustering•Pathway analysis•…

Page 45: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Clustering tools in GS

Hierarchical clusteringExperiments and samples

Page 46: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Projection methods

•Principal component analysis (PCA)•Multi-Dimensional Scaling (MDS)•Not clustering methods but can beused to determine or visualize clusterstructure if present

Page 47: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Microarray assay life cycle

BiologicalQuestion

SamplePreparation

MicroarrayHybridization

Microarray Detection

Data Analysis& modeling

M.Shena and R. Davis,MIcroarray biochip technology

Page 48: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

PLAN

I- GeneChip Operating System (GCOS)

II- Genespring

III-Submission to public repository

IV- NetAffx

Page 49: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Data submission to public repository

Do you submit your data to MIAME compliant microarray public database?Response % Response Total

Always 9.70% 6sometimes 19.40% 12Only if requested by publisher 38.70% 24never 33.90% 21

Total Respondents 62

Which database are you submitting your data to?Response % Response Total

GeneExpression Omnibus- (GEO-NIH) 43.50% 27Array Express (EMBL) 29% 18Other (please specify) 33.90% 21

Total Respondents 62

Page 50: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Data submission to GEO

3 steps process:

1- Submission of theplatform (Array type)

Page 51: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Data submission to GEO

3 steps process:

1- Submission of theplatform (Array type)

2- Submission of thesamples (MIAME)

ID_REF VALUE DETECTION Detection p-valueAFFX-MurIL2_at 13.4 A 0.953518AFFX-MurIL10_at 17.3 A 0.843268AFFX-MurIL4_at 18.1 A 0.749204AFFX-MurFAS_at 15.8 A 0.425962AFFX-BioB-5_at 730.6 P 0.001593AFFX-BioB-M_at 1952.8 P 0.000044AFFX-BioB-3_at 1267.6 P 0.000147AFFX-BioC-5_at 3155.5 P 0.00007AFFX-BioC-3_at 2296.3 P 0.000052AFFX-BioDn-5_at 2987.8 P 0.000044AFFX-BioDn-3_at 16968.8 P 0.00006AFFX-CreX-5_at 31299.5 P 0.000044AFFX-CreX-3_at 47550 P 0.000044AFFX-BioB-5_st 117.8 A 0.165861AFFX-BioB-M_st 155 A 0.108979AFFX-BioB-3_st 179.6 A 0.327079

Page 52: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Data submission to GEO

3 steps process:

1- Submission of theplatform (Array type)

2- Submission of thesamples (MIAME)

3- Submission of a“serie”( experiment)

Page 53: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Plan

I- GeneChip Operating System (GCOS)

II- Genespring

III-Submission to public repository

IV- NetAffx

Page 54: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

NetAffx

Def: comprehensive resource of functional annotations and public database

Page 55: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

NetAffx

Def: comprehensive resource of functional annotations and public database

Accession number

Page 56: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Access to NetAffx

Free registrationUpdated every quarter

Page 57: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Quick query input

Key wordGene symbolPublic DB numberProbe set name

Page 58: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Quick query output

GO: Gene Ontology Pathway information

Page 59: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Pathway Diagram

Page 60: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Quick query output

Page 61: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Detailed information

•Genechip Array Information•Probe design information•Genomic Alignment of target sequence•Public domain and Genome references•Functional annotations•Sequence

Page 62: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Genechip Array Information

Page 63: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Probe design/ Genomic Alignment

Link to UC Santa Cruz Genome Browser

Page 64: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Public domain

Page 65: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Functional Annotations

Page 66: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Sequence information

Page 67: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Batch query

Page 68: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Batch query

Page 69: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Batch query output

Export data to ExcelGene ontology brower

Page 70: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Gene ontology browser

Page 71: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

•DNA microarrayBowtell, Sambrook, CSHL

•A Biologist's Guide to Analysis of DNA Microarray Data Steen Knudsen

•http://ihome.cuhk.edu.hk/%7Eb400559/array.html

•DNA Microarray (genome chip) Leming Shihttp://www.gene-chips.com/

Useful links and lectures

Page 72: Introduction to microarray analysis and tools · User Access, Roles, Security - - + Barcode Support, Automation - + ++ MIAME Standard Template - + + Publishing to AADM Database -

Conclusion

Proteomics

Human Genetics(Genotyping)

ClinicalDatabase

Genomics

Basic ResearchAnimal Models of Human Cancer

Pathway Analysis

GLOBAL UNDERSTANDING OF MOLECULAR BASIS OF CANCER