enrichnet : network-based gene set enrichment analysis

22
EnrichNet: network- based gene set enrichment analysis Presenter: Lu Liu

Upload: ura

Post on 24-Feb-2016

59 views

Category:

Documents


0 download

DESCRIPTION

EnrichNet : network-based gene set enrichment analysis. Presenter: Lu Liu. The problem: Functional Interpretation. Identify and assess functional associations between an experimentally derived gene/protein set and well-known gene/protein sets. Agenda. Related research The method - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: EnrichNet : network-based gene set enrichment analysis

EnrichNet: network-based gene set enrichment analysis

Presenter: Lu Liu

Page 2: EnrichNet : network-based gene set enrichment analysis

The problem: Functional Interpretation

• Identify and assess functional associations between an experimentally derived gene/protein set and well-known gene/protein sets

Page 3: EnrichNet : network-based gene set enrichment analysis

Agenda

• Related research

• The method

• The Evaluation

• The results

• The conclusion

Page 4: EnrichNet : network-based gene set enrichment analysis

Related Research

• Over-representation analysis (ORA)

• Gene set enrichment analysis (GSEA)

• Modular enrichment analysis (MEA)

Page 5: EnrichNet : network-based gene set enrichment analysis

Limitations

• ORA tend to have low discriminative power

• Functional information from interaction network disregarded

• Missing annotation gene/protein ignored

• Tissue-specific gene/protein set association often infeasible

Page 6: EnrichNet : network-based gene set enrichment analysis

Agenda

• Related research

• The method

• The Evaluation

• The results

• The conclusion

Page 7: EnrichNet : network-based gene set enrichment analysis

General workflow

• Input gene/protein list(>=10), a database of interest (KEGG etc.)

• Processinggene mapping, score the distance with RWR, compare scores with background model

• Output A pathways/processes ranking table, visualization

of sub-networks

Page 8: EnrichNet : network-based gene set enrichment analysis

Input

Page 9: EnrichNet : network-based gene set enrichment analysis

Output

Page 10: EnrichNet : network-based gene set enrichment analysis

The method

10

0 0

0

0

0

0

0

00

0

0

1

0

0

0

0

0

1

1

1

0

0

.6

0.9.9

.6

.6

.9

.9

.8

.6

.6

.6

.3

.4

.4

.6

.6

.5

.3

.3

.3

.2

.3

.1

Input GeneSet

Pathway 1

Pathway N

……

.

RWR

.6 .6 .6 .6 .5

.4 .3 .2 .1

……

.Pathway 1

Pathway N

Page 11: EnrichNet : network-based gene set enrichment analysis

Algorithm for distance score

Page 12: EnrichNet : network-based gene set enrichment analysis

Relate scores to a background model

• Discretized into equal-sized bins• Quatify each pathway’s deviation from

average

Page 13: EnrichNet : network-based gene set enrichment analysis

Agenda

• Related research

• The method

• The Evaluation

• The results

• The conclusion

Page 14: EnrichNet : network-based gene set enrichment analysis

Evaluation method

• Compare with ORA5 datasets and 2 reference gene sets from literature1. select 100 most DEGs2. get association scores of EnrichNet and ORA3. compute a running-sum statistic for all gene sets

• The consensus of GSEA-derived(SAM-GS, GAGE) pathway ranking as external benchmark pathway ranking

Page 15: EnrichNet : network-based gene set enrichment analysis

Agenda

• Related research

• The method

• The Evaluation

• The results

• The conclusion

Page 16: EnrichNet : network-based gene set enrichment analysis

The results-EnrichNet vs ORA

Page 17: EnrichNet : network-based gene set enrichment analysis

The results-Xd-score vs Q-value

Page 18: EnrichNet : network-based gene set enrichment analysis

The results-comparative validation

Page 19: EnrichNet : network-based gene set enrichment analysis

Protein–protein interaction sub-networks (largest connected components) for target and reference set pairs with small overlap, predicted to be functionally associated by EnrichNet: (a) gastric cancer mutated genes (blue) and genes/proteins from the BioCarta pathway ‘Role

of Erk5 in Neuronal Survival’ (magenta, the shared genes are shown in green); (b) bladder cancer mutated genes (blue) and genes/proteins from Gene Ontology term ‘Tyrosine

phosphorylation of Stat3’ (GO:0042503, magenta; the only shared gene NF2 is shown in green).

Page 20: EnrichNet : network-based gene set enrichment analysis

Protein–protein interaction sub-network (largest connected component) for the PD gene set (blue) and genes/proteins from GO term ‘Regulation of interleukin-6 biosynthetic process’

(magenta, GO:0045408; the only shared gene IL1B is shown in green).

Page 21: EnrichNet : network-based gene set enrichment analysis

The results-tissue specificity

• EnrichSet don’t require additional gene expression measurement data

• Brain tissue: Xd-scores over-representated• Non-Brain tissue: center of Xd-score

distribution significant lower

Page 22: EnrichNet : network-based gene set enrichment analysis

The conclusion

• EnrichNet sometimes has more discriminative power when target sets and pathway set has large overlaps

• EnrichNet can identifies novel function associations through direct and indirect molecular interactions when target sets and pathway set has little overlaps