joseph powell - using the transcriptome to determine the genetic mechanisms of disease...
DESCRIPTION
Epistasis is the phenomenon whereby one polymorphism’s effect on a trait depends on other polymorphisms present in the genome. The extent to which epistasis influences complex traits and contributes to their variation is a fundamental question in evolution and human genetics. Although often demonstrated in artificial gene manipulation studies in model organisms, and some examples have been reported in other species, few examples exist for epistasis among natural polymorphisms in human traits. Its absence from empirical findings may simply be due to low incidence in the genetic control of complex traits, but an alternative view is that it has previously been too technically challenging to detect owing to statistical and computational issues. Here we show, using advanced computation and a gene expression study design, that many instances of epistasis are found between common single nucleotide polymorphisms (SNPs). In a cohort of 846 individuals with 7,339 gene expression levels measured in peripheral blood, we found 501 significant pairwise interactions between common SNPs influencing the expression of 238 genes (P < 2.91 × 10−16). Replicaon of these interacons in two independent data sets showed both concordance of direction of epistatic effects (P = 5.56 × 10−31) and enrichment of interacon P values, with 30 being significant at a conservative threshold of P < 9.98 × 10−5. Forty‐four of the genetic interactions are located within 5 megabases of regions of known physical chromosome interactions (P = 1.8 × 10−10). Epistac networks of three SNPs or more influence the expression levels of 129 genes, whereby one cis‐acting SNP is modulated by several trans‐acting SNPs. For example, MBNL1 is influenced by an additive effect at rs13069559, which itself is masked by trans‐SNPs on 14 different chromosomes, with nearly identical enotype–phenotype maps for each cis–trans interaction. This study presents the first evidence, to our knowledge, for many instances of segregating common polymorphisms interacting to influence human traits. First presented at the 2014 Winter School in Mathematical and Computational Biology http://bioinformatics.org.au/ws14/program/TRANSCRIPT
Common Diseases Epistasis Variance Heterogeneity
Using the transcriptome to determine the geneticmechanisms of disease susceptibility
Joseph Powell
Centre for Neurogenetics and Systems Genomics
Winter School in Mathematical and Computational Biology
Brisbane - 7th July 2014
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Causes of Death - Common Diseases
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Causes of Common Diseases - Genetics + Environment
Environmental factors
Genetics
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
How Do Mutations Influence Disease Susceptibility?
Most genetic variation underlying disease susceptibility is locatedin regulatory regions
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Move To Molecular Phenotypes - Systems Genetics
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
By what means do most DNA variants alter cellular behaviour andultimately contribute to differences in disease susceptibility?
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Genome-wide epistasis Variance heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
What is epistasis?
Definition
The effect on the phenotype caused by locus A depends on thegenotype at locus B....
Epistasis has been reported in model organisms through artificialgene knockouts, artificial line crosses and hybridization.
Our aim was to systematically search for instances of epistasisamongst common variants for genetic variation that has arisen innatural populations.
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Gene expression
Transcription measured forthousands of genes
Typically heritable
Loci (eQTLs) commonlyhave very large effect sizes
Good candidates to searchfor epistasis
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Discovery population
846 healthy individuals
528,509 autosomal SNPs
RNA measured forperipheral blood (IlluminaHT-12v4.0)
Expression levels for 7,339probes
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Computational analysis
Exhaustive SNP x SNPtesting for each probe
epiGPU software and GPUclusters
8 d.f. F-test
Over quadrillion tests
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Design Of The Analysis
SNP x SNP 2 dimension analysis Epistatic genotype by phenotype
Example of genotype by phenotype epistasis map
AA
Aa
aa aa
Aa
AA
0.0
0.2
0.4
0.6
0.8
1.0
SNP1 SNP2
RNA levels
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Correction for multiple testing
Apply a family-‐wise error rate of 5%
Permutation and Bonferroniused to give 5% FWER
Permutation: Single probeFWER; T∗ = 2.13x10−12
Correct for 7,339 probes
Experiment wide FWER;T ∗ /7, 339 = 2.91x10−16
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Removing artifacts
For SNP pairs that passedthe 2.91x10−16 threshold
All 9 genotypes classespresent
Minimum class size of 5
No LD between SNP pairs(r2 < 0.1 and D ′2 < 0.1)
No single loci additive ordominance effects
11,155 pairs carried forward
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Testing for non-additive effects
Nested ANOVA for 11,155pairs
Contrasts full genetic (8d.f.) vs marginal effects (4d.f)
Thus, testing for thecontribution of epistaticvariance
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
More correction for multiple testing
Epistatic effects significantat p < 0.05/11, 155 =4.48x10−6
501 SNP pairs withsignificant interaction terms
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Replication population
Fehrmann
1,240 individuals(Netherlands)
EGCUT
891 individuals (Estonian)
RNA measured forperipheral blood (IlluminaHT-12v3.0)
Genotyped with Illuminaarrays
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Replication tests
Replication of 501 epistatichits using two independentcohorts
Same filtering criteria
434 pairs which could betested in both cohorts
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Functional work
Analyses to elucidate possiblefunctional mechanisms
Non-synonymous mutations
GWAS and known eQTLoverlap
Genome segmentation
Chromosome interactions
Tissue specific transcriptionregions
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Q-Q plots of interaction p-values from replication
The top panel shows all 434discovery SNPs that were testedfor interactions.
The bottom panel shows thesame data as the top panel butexcluding the 30 most significantinteractions
Matched direction of epistaticeffects p = 5.56x10−31
●
●
●●●
●●●
●●
●●
●
●●
●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●
●
●
●
●
●
●●●
●
●●
●●●
●●●●
●●●●●●●●●
●●●●●
●●●●●●●
●●●●●●●●●●●●
●●●●●●●●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
0
50
100
150
0
1
2
3
4
All
Below
Bonferroni threshold
0 1 2 3Expected
Obs
erve
d
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Interactive Figure: http://kn3in.github.io/detecting_epi/
Yellow dots: A Transcript Grey dots: Epistatic SNPs
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Example: Epistatic Control Of CAST Regulation
CAST: Associated with numerousrheumatic diseases
It’s regulation controlled by manyepistatic SNPs throughout thegenome
Novel genetic control of regulation
potentially underlying disease
susceptibility
Chromosomes
Position of CAST Gene Epistatic
interactions
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Functional Mechanisms - Chromosomal Interactions
Chromosome interactionsidentified in K562 cell linesusing Hi-C and ChIP-seq
Red lines = N SNPs within20kb, 500kb, 2Mb and 10Mbof known interacting regions
Histograms = null distributions
0
250
500
750
0
250
500
750
0
250
500
750
0
250
500
750
5Mb
1Mb
250kb10kb
0 10 20 30 40Number of chromosome interactions
Fre
quen
cy
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Conclusions
The first evidence for multiple instances of segregating commonpolymorphisms interacting to influence human traits
Novel computational and statistical frameworks can identify andreplicate epistasis
New knowledge of the control of transcription of many genesimplicated in common disease
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Variance Heterogeneity
Searching for veQTL
There is evidence across several species for genetic control ofphenotypic variation of complex traits
This means that the variance of a phenotype is genotype dependent
Understanding genetic control of variability is important inevolutionary biology, and medicine
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Variance eQTL
What causes them?
Epistatic interactions
Gene by Environment Interactions
Other ..... ?
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
846 healthy individuals
6,011,184 Imputed(autosomal) SNPs
RNA measured forperipheral blood (IlluminaHT-12v4.0)
Expression levels for 17,994probes
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Double Generalized Linear Model
y = µ+ xb + e
e ∼ N(0, σ2e )
Model to detect variance and
mean effects
e ∼ N(0, σ2ei ); log(σ2
e ) = c + xv
y ∼ N(µ+ xb, exp(c + vx))
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
For statistical analysis, theinformational content is the same ifdata is transformed by a monotonicfunction
Provided the three means aredifferent, there is always amonotonic transformation that willtend to equalize the three variances
However, it’s relevance depends on
the biological scale
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Original and normalised scales
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Scale
Initial scale = Z-score frominverse-normal transformation
Adjust original scale using transformationfunctions provided in Sun et al. (2013)
σ2(x) = f (m) = am2 + bm + c (1)
Express variance of x as a function of its meanmx , f (mx )
y = g(x)∞∫
dx√f (x)
(2)
Re-test using dglm
The aim is to remove main driven variance effects
Pictorial representation of four different non-linear
changes to the scale. They represent four classes of
monotonic transformation (Sun et al AJHG 2013).
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Discovery results
Cis - 2.55x10−7
747 veQTL (Potentially havesome main effects)
Only 88 have Pmain > 0.01
Transformations remove theeffects of 655/659 (with maineffects)
Transformations remove theeffects of 1/87 (without maineffects)
Trans - 3.39x10−12
1412 veQTL (Potentially havesome main effects)
619 have Pmain > 0.01
Transformations remove theeffects of 732/793 (with maineffects)
Transformations remove theeffects of 21/619 (withoutmain effects)
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Fehrmann
1,240 individuals(Netherlands)
EGCUT
891 individuals (Estonian)
RNA measured forperipheral blood (IlluminaHT-12v3.0)
Genotyped with Illuminaarrays
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Replication of 7,768 veQTLhits using two independentcohorts
DGLM
Also Bartlett’s and Levene’stests
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Replication Meta-analysis
Just interested in the variance only veQTL
At FDR = 0.05
89 % of cis-veQTL
78 % of trans-veQTL
Same effect direction;
85 % cis and 77 % trans
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Ripple effect across a network
●●
●
●
●●
●
●
−5.0
−2.5
0.0
2.5
0 1 2factor(rs3821224)
ILM
N_1
7983
08
Predicted TF binding sites tagged by expression probes. Blue lines havevariance heterogeneity with rs3821224 (p < 0.05)
Expression levels for a probe in AHSA2 by genotype class of rs3821224
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
GWAS - veSNP overlap
Disease / Trait N veSNPs in GWAS catalogueAsthma 1Autism spectrum disorder 2Bipolar disorder 2Cholesterol, total 3Endometriosis 1HDL cholesterol 2Height 4Inflammatory bowel disease 2LDL cholesterol 6N-glycan levels 4Obesity-related traits 5Parkinson’s disease 6Rheumatoid arthritis 3Schizophrenia 8Sudden cardiac arrest 2Systemic lupus erythematosus 3Triglycerides 7Type 1 diabetes 3Urate levels 2
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Conclusions
Computational and statistical frameworks can identify andreplicate variance heterogeneity for RNA transcripts.
Variance (only) eQTL represent another form of genetic control forphenotypes.
Functional characterization of variance loci suggests a number ofpossible mechanisms that underlie variance heterogeneity.
Joseph Powell Mechanisms of disease susceptibility
Common Diseases Epistasis Variance Heterogeneity
Acknowledgements
Centre for Neurogenetics and Systems Genomics
Gib Hemani
Kostya Shakhbazov
Jian Yang
Allan Mcrae
Peter Visscher
University of Groningen
Harm-Jan Westra
Lude Franke
Dalama University
Lars Ronnegard
QIMR Berghofer
Nick Martin
Anjali Henders
Grant Montgomery
Georgia Institute of Technology
Greg Gibson
University of Tartu
Andres Metspalu
Tonu Esko
Visit us at www.complextraitgenomics.com
Joseph Powell Mechanisms of disease susceptibility