functional enrichment analysis · contingency table - example - chi square test observed expected o...

16
Functional enrichment analysis

Upload: others

Post on 22-Jan-2021

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Functional enrichment analysis

Page 2: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Enrichment analysis

Does my gene list (eg. up-regulated genes between two condictions) contain more genes than expected involved in a particular pathway or biological process (eg. cell cycle)?

Hypergeometric test Fisher exact test Bionomial test (Bernouilli) Chi sq test

Page 3: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

in listwith annotation

tested

Enrichment analysis - For lists of genes

A BC

D

Page 4: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

in list not in list totals

with annotation A B A+B

without annotation C D C+D

A+C B+D A+B+C+D=N

in list not in list totals

with annotation (A+B)(A+C)/N (A+B)(B+D)/N A+B

without annotation (C+D)(A+C)/N (C+D)(B+D)/N C+D

A+C B+D N

Contingency table

Observed

Expected

Page 5: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

in listwith annotation

tested (183)

12 356

130

Enrichment analysis - For lists of genes

Page 6: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

in list not in list totals

with annotation 12 35 A+B

without annotation 6 130 C+D

A+C B+D A+B+C+D=N

in list not in list totals

with annotation (A+B)(A+C)/N (A+B)(B+D)/N A+B

without annotation (C+D)(A+C)/N (C+D)(B+D)/N C+D

A+C B+D N

Contingency table - example

Observed

Expected

Page 7: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

in list not in list totals

with annotation 12 35 47

without annotation 6 130 136

18 165 183

in list not in list totals

with annotation 4.6 42.4 47

without annotation 13.4 122.6 136

18 165 183

Contingency table - example - Chi square test

Observed

Expected

O = Observed / E = Expected

Page 8: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

in list not in list totals

with annotation 12 35 47

without annotation 6 130 136

18 165 183

in list not in list totals

with annotation 4.6 42.4 47

without annotation 13.4 122.6 136

18 165 183

Contingency table - example - Fisher’s exact test

Observed

Expected

Page 9: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Enrichment analysis - For continuous values

Blue ticks = genes with an annotation

Are the values (eg. expression values) of the genes involved in a particular pathway or biological process (eg. cell cycle) biased towards high or low values?

Page 10: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Enrichment analysis - Gene Set Enrichment Analysis

Page 11: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Annotation sources (modules)

• Gene Ontology

– biological process, molecular function, cellular component – Terms may have >1 “parent” (more general term) – GO Slim: includes only general categories

• KEGG; REACTOME pathways

• Genes sharing a motif of regulated by the same protein/miRNA (experimental or predicted)

• Genes found on the same chromosome

• Broad’s Molecular Signatures Database (MSigDB)

• [any grouping that is biologically sensible]

Page 12: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Gene Ontology

www.geneontology.org

Page 13: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

KEGG pathways

http://www.genome.jp/kegg/

Page 14: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Multiple Test Problem!

Enrichment analysis on multiple annotations (eg. all Gene Ontology Terms / all KEGG pathways) involves multiple statistical tests

Need to do multiple test correction

Page 15: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47

Limitations of functional enrichment analysis

• Annotation databases are incomplete

• Annotation bias in databases: some genes are more studied and in consequence more annotated than others

• Terminology problem poses a challenge to data integration

• Some pieces of information may be imprecise or incorrect: some annotations in GO are inferred from electronic annotations without any expert human involvement (mainly annotations at very high level of the ontology).

Page 16: Functional enrichment analysis · Contingency table - example - Chi square test Observed Expected O = Observed / E = Expected. in list not in list totals with annotation 12 35 47