comparative genomics ii: functional comparisons caterino and hayes, 2007

36
Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Upload: eric-hunt

Post on 24-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Comparative Genomics II:Functional comparisons

Caterino and Hayes, 2007

Page 2: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Overview

I. Comparing genome sequences• Concepts and terminology• Methods

- Whole-genome alignments

- Quantifying evolutionary conservation (PhastCons, PhyloP, GERP)

- Identifying conserved elements

• Utility and limitations of conservation• Available datasets at UCSC

II. Comparative analyses of function• Evolutionary dynamics of gene regulation• Case studies• Insights into regulatory variation within and across species

Page 3: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Functional variation within and among species

Human

Chimp

Rhesus

Mouse

Page 4: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Modularity of developmental gene expression

forebrain

gene A

Brain TFs

neural tube

gene A

Neural TFs

limb

Limb TFs

gene A

Regulatory changes introduce variance without disrupting protein function

Regulatory variation contributes to human phenotypic variation

overall

Page 5: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Lettice et al. Hum Mol Genet 12:1725 (2003) Sagai et al. Development 132:797 (2005)

Regulatory mutations affecting pleiotropic genes cause discrete developmental changes

Page 6: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Neutral Constrained Directional

Patterns of selection on gene expression and regulation

Romero et al., Nat Rev Genet. 13:505 (2012)

Page 7: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Comparative approaches to identify conserved andvariant regulatory functions

Visel and Pennacchio, Nat Genet 42:557 (2010)

Regulatory conservation

Regulatory rewiring

Page 8: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Furey and Sethupathy, Science 2013

Genetic drivers of gene regulatory variation

Page 9: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

• H3K4me2• H3K27ac

• H3K4me2• H3K27ac

Comparative analysis of ChIP-seq datasets

Human

Mouse

Compare TF binding, histone modifications,DNase hypersensitivity in equivalent tissues

Requires a statistical framework to reliably quantify changes inChIP-seq signals

Page 10: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

•Input data are noisy: ChIP-seq, RNA-seq data are signal based, subjectto considerable experimental variation

•Using comparable biological states within and across species(e.g., human liver vs. mouse liver) = variation across tissues?

•How do epigenetic states and gene expression diverge among individuals and across species (Neutral? Constrained?)

•Can we identify variants or substitutions that drive regulatory changes?

Issues in comparative functional genomics

Page 11: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

•10 human lymphoblastoid cell lines3 major population groups: European, East Asian, Nigerian

9 females, 1 male

9 analyzed by HapMap and 1000 Genomes

Science 328: 232 (2010)

•Targets:

RNA Polymerase II

NFkB

Page 12: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

NFkB

PolI

IPair

wis

e d

iffere

nce

in

bin

din

g

Fra

cti

on

of

reg

ion

sb

ou

nd

# individuals

Variation in TF binding is common

Page 13: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Science 342: 747 (2013)

•10 human lymphoblastoid cell lines1 population group (Nigerian)

All analyzed by HapMap and 1000 Genomes

•Targets:

RNA Polymerase II

H3K4me1, H3K4me3, H3K27ac, H3K27me3

DNase hypersensitivity

Page 14: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Measuring allelic imbalance in histone modification profiles

G allele

T allele

Need to map reads reliably to individual alleles

ChIP-seq reads

Allelicimbalance

Page 15: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Cis-quantitative trait loci

~1200 identified

Page 16: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Science 328: 1036 (2010)

•Targets:

CCAAT/enhancer binding protein a (CEBPA)

Hepatocyte nuclear factor 4a (HNF4A)

Essential for normal liver development and function

•Tissue:

Adult liver from 4 mammal species plus chicken

Page 17: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Lineage-specific gain and loss of CEBPA binding in liver

Lineage-specific: 0 bp overlap in multiple species alignment

Page 18: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Widespread variation in CEBPA binding in mammals

Page 19: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Widespread variation in CEBPA binding in mammals

Page 20: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Cell 154: 530 (2013)

Page 21: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Enhancer-associatedhistone modification

Single TF binding events may not indicate regulatory function

• Many TFs are present at high concentrationsin the nucleus

• TF motifs are abundant in the genome

• Single TF binding events may be incidental

Page 22: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Combinatorial TF binding events are more conserved

Page 23: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Many TF binding changes do not have obvious genetic causes

In mammalian liver:

Page 24: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Many TF binding changes do not have obvious genetic causes

In mouse liver:

Page 25: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Human

Rhesus

Mouse

Bud stage; digitspecification Digit separation

Cell 154: 185 (2013)

Page 26: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Identifying human-lineage changes in promoter and enhancer function

• Compare H3K27ac signal at orthologous sites

• ‘Stable marking’: 1.5-fold or less change in H3K27ac among human, rhesus and mouse

• Human gain: require significant, reproducible gain in human versus all 12 datasets in rhesus and mouse

Page 27: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Mapping active promoters and enhancers in human limb

ENCODE cell lines

H3K27ac

Page 28: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Gains in promoter and enhancer activity

• Bone morphogenesis• Chondrogenesis• Digit malformations in mouse

Page 29: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Human-specific H3K27ac marking correlates with changes inenhancer function

Page 30: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Epigenetic signatures reflect tissue identity and species relationships

H3K27ac signal in human and mouse

Primate

Mouse

H3K27ac in human, rhesus, mouse

Page 31: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

• Human• Chimpanzee• Bonobo• Gorilla• Orangutan• Macaque• Mouse• Opossum• Platypus• Chicken

• Custom gene models based on Ensembl + RNA-seq• 5,636 1:1 orthologs in amniotes• 13,277 1:1 orthologs in primates• Only constitutive exons

Nature 478: 343 (2011)

Page 32: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Global patterns of gene expression differences

Page 33: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Gene expression recapitulates species phylogenies

Page 34: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Gene expression divergence rates are tissue-specific

liver

testis

brain

Page 35: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

Gene expression divergence increases with evolutionary time

Conservation of core organ functions restricts divergence

Page 36: Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007

•Comparative functional genomics identifies regulatory differenceswithin and among species

•TF binding is variable within species and highly variable among species

•Epigenetic comparisons provide more insight into biologicallyrelevant regulatory diversity and divergence

•Gene regulation and expression diverges with increasingphylogenetic distance – they mirror neutral expectation

Summary