maximum likelihood (cont.)

21
Maximum likelihood (cont.)

Upload: nero

Post on 05-Jan-2016

38 views

Category:

Documents


2 download

DESCRIPTION

Maximum likelihood (cont.). Midterm. Relatedness – is not sharing a common ancestor but sharing a relatively recent CA Homoplasy includes convergence and/or reversal PTP looks at tree length ; g1 looks at skew in tree length Both introgression and LGT result in a dominant and minor history. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Maximum likelihood (cont.)

Maximum likelihood(cont.)

Page 2: Maximum likelihood (cont.)

Midterm

• Relatedness – is not sharing a common ancestor but sharing a relatively recent CA

• Homoplasy includes convergence and/or reversal

• PTP looks at tree length; g1 looks at skew in tree length

• Both introgression and LGT result in a dominant and minor history

Page 3: Maximum likelihood (cont.)

Midterm

• ILD test looks at the sum of the lengths (or likelihood) of the optimal trees from each partition

• Topology tests evaluate whether the data support one tree better than another. It can be used to evaluate support for a clade or to assess discordance, but that is an application

Page 4: Maximum likelihood (cont.)

Midterm

• Parsimony criterion: Tree that can explain the data with the lowest number of character state changes (weighted by the inferred evidential value of each character state transition)

• Assumptions: Single; independence; branch lengths short and fairly even

Page 5: Maximum likelihood (cont.)

Dig

italis

Loph

osp

ermu

mC

ymb

alaria

Asarin

aL. ca

na

den

sisL. vu

lgaris

Ch

aen

orhin

umM

isop

atesA

ntirrhin

um

Page 6: Maximum likelihood (cont.)

Incomplete lineage sorting

Page 7: Maximum likelihood (cont.)

The cause is the retention of a

polymorphism – does not depend on gene

duplication

A CB

A-B coalescen

ceAB-C coalescence

Split 2

Split 1

Page 8: Maximum likelihood (cont.)

Relationship among models9

4

66

35

2

1

Page 9: Maximum likelihood (cont.)

Site-to-site rate heterogeneity• Two main methods:

– Some proportion of invariant sites

– Rates distributed as a discrete approximation to a gamma distribution

• Both use one parameter – I = proportion of invariant sites– α = shape parameter

Page 10: Maximum likelihood (cont.)
Page 11: Maximum likelihood (cont.)

Nested models

• Simpler models have fewer parameters than more complex models

• Two models are “nested” if all parameters in the simpler model are also in the more complex model– Nested: GTR-HKY; HKY-JC; GTR-JC; – Not nested: F81-K2P; JC+I-HKY

Page 12: Maximum likelihood (cont.)

Which of these pairs are nested?

• HKY-K2P

• GTR+Γ-GTR

• HKY+I-HKY+Γ

• HKY+I-HKY

Page 13: Maximum likelihood (cont.)

In the case of nested models

• Log-likelihood under the simpler model = Ls

• Log-likelihood under the complex model = Lc

• It will always be the case that Lc ≥ Ls

• But how much better can we explain the data under the more complex model?

Page 14: Maximum likelihood (cont.)

Log-Likelihood ratios

• The likelihood ratio is 2(Lc-Ls)

• For nested models the expected LR is distributed as a Χ2 with as many degrees of freedom as the number of extra parameters in the more complex model (kc-ks)

Page 15: Maximum likelihood (cont.)

Hierarchical LR tests

• If the LR is significant under a chi-square then favor the more complex model

• If the LR is not significant then stick with the simpler model

Page 16: Maximum likelihood (cont.)

Akaike Information Criterion

• Another approach to choosing among models

• Can be used even among non-nested models

• Pick the model with the lowest AIC:– k = number of parameters in model– AIC = -2 ln L + 2k

Page 17: Maximum likelihood (cont.)

Relationship between MP and ML

• One argument - MP is inherently nonparametric No direct comparison possible

• MP is an ML model that makes particular assumptions

Page 18: Maximum likelihood (cont.)

Parsimony-like likelihood model(see Lewis 1998 for more)

• Estimate branch-length independently for each character (a VERY complex model)

• Only sum over maximum likelihood ancestral states

Page 19: Maximum likelihood (cont.)

Why use MP

• The model is less realistic, but:– We can do more thorough searches and data

exploration (computational efficiency)– Robust results will usually still be supported

Page 20: Maximum likelihood (cont.)

Why use ML

• The model is explicit

• We can statistically compare alternative models of molecular evolution

• We can conduct parametric statistical tests

Page 21: Maximum likelihood (cont.)

Likelihood based topology test

• Kishino-Hasegawa test

• Likelihood ratio test of zero length branches