open source glmm tools: concordia

50
Precursors GLMMs Results Conclusions References Open-source tools for estimation and inference using generalized linear mixed models Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology 3 July 2011 Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology Open-source GLMMs

Upload: ben-bolker

Post on 11-May-2015

3.621 views

Category:

Technology


0 download

DESCRIPTION

Talk on open source tools (mostly R, some BUGS and ADMB). Slightly different from actual presentation.

TRANSCRIPT

Page 1: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Open-source tools for estimation and inferenceusing generalized linear mixed models

Ben Bolker

McMaster UniversityDepartments of Mathematics & Statistics and Biology

3 July 2011

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 2: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Outline

1 PrecursorsDefinitionsExamplesChallenges

2 GLMMsEstimationInference

3 ResultsCoral symbiontsGlyceraArabidopsis

4 ConclusionsConclusions

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 3: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

Outline

1 PrecursorsDefinitionsExamplesChallenges

2 GLMMsEstimationInference

3 ResultsCoral symbiontsGlyceraArabidopsis

4 ConclusionsConclusions

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 4: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

Definitions

Fixed effects (FE) Predictors where interest is in specific levels

Random effects (RE) Predictors where interest is in distributionrather than levels (blocks)5

Mixed models Statistical models with both FEs and REs

Linear mixed models Linear effects, normal responses, normal REs

Generalized linear models Linearizable effects, exponential-familyresponses, normal REs (on linearized scale)

Generalized linear mixed models GLMMs = LMMs + GLMs

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 5: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

Definitions

Fixed effects (FE) Predictors where interest is in specific levels

Random effects (RE) Predictors where interest is in distributionrather than levels (blocks)5

Mixed models Statistical models with both FEs and REs

Linear mixed models Linear effects, normal responses, normal REs

Generalized linear models Linearizable effects, exponential-familyresponses, normal REs (on linearized scale)

Generalized linear mixed models GLMMs = LMMs + GLMs

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 6: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

GLMMs

Distributions from exponential family(Poisson, binomial, Gaussian, Gamma, NegBinom(k), . . . )

Means = linear functions of predictorson scale of link function (identity, log, logit, . . . )

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 7: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

GLMMs (cont.)

Linear predictor:η = Xβ + Zu

Random effects:u ∼ MVN(0,Σ)

Response:

Y ∼ D(g−1η, φ

)(φ often ≡ 1)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 8: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

Marginal likelihood

Likelihood (Prob(data|parameters)) — requires integrating overpossible values of REs to get marginal likelihood e.g.:

likelihood of i th obs. in block j is L(xij |θi , σ2w )

likelihood of a particular block mean θj is L(θj |0, σ2b)

marginal likelihood is∫L(xij |θj , σ2

w )L(θj |0, σ2b) dθj

Balance (dispersion of RE around 0) with (dispersion of dataconditional on RE)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 9: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

Marginal likelihood

Likelihood (Prob(data|parameters)) — requires integrating overpossible values of REs to get marginal likelihood e.g.:

likelihood of i th obs. in block j is L(xij |θi , σ2w )

likelihood of a particular block mean θj is L(θj |0, σ2b)

marginal likelihood is∫L(xij |θj , σ2

w )L(θj |0, σ2b) dθj

Balance (dispersion of RE around 0) with (dispersion of dataconditional on RE)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 10: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Definitions

Bayesian solution?

Bayesians should not feel smug: they are stuck with thenormalizing constant

Posterior(β,θ,Σ) =Prior(β,θ,Σ)L(xij |β,θ)L(θ|Σ)∫∫∫

(. . .)dβ dθ dΣ(!!)

and similar issues with marginal posteriors

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 11: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Examples

Outline

1 PrecursorsDefinitionsExamplesChallenges

2 GLMMsEstimationInference

3 ResultsCoral symbiontsGlyceraArabidopsis

4 ConclusionsConclusions

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 12: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Examples

Coral protection by symbionts

none shrimp crabs both

Number of predation events

Symbionts

Num

ber

of b

lock

s

0

2

4

6

8

10

1

2

0

1

2

0

2

0

1

2

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 13: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Examples

Environmental stress: Glycera cell survival

H2S

Cop

per

0

33.3

66.6

133.3

0 0.03 0.1 0.32

Osm=12.8Normoxia

Osm=22.4Normoxia

0 0.03 0.1 0.32

Osm=32Normoxia

Osm=41.6Normoxia

0 0.03 0.1 0.32

Osm=51.2Normoxia

Osm=12.8Anoxia

0 0.03 0.1 0.32

Osm=22.4Anoxia

Osm=32Anoxia

0 0.03 0.1 0.32

Osm=41.6Anoxia

0

33.3

66.6

133.3

Osm=51.2Anoxia

0.0

0.2

0.4

0.6

0.8

1.0

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 14: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Examples

Arabidopsis response to fertilization & clipping

panel: nutrient, color: genotype

Log(

1+fr

uit s

et)

0

1

2

3

4

5

unclipped clipped

●●●●● ●

●●

●●

●●

●●

●●●

●●● ●

● ●

●●

●●

● ●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●●

● ●

●●

●●

● ●

●●

●●

● ●

●●● ●● ●

●●

●●

●●

●● ●

●●

● ●●● ●●● ●●● ●●● ●● ●● ●● ●

● ●● ●●● ●●●●

●●

● ●

●●●●●● ●●

●●●

● ●

: nutrient 1

unclipped clipped

●●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

● ●●

● ●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

●●●● ●● ●●● ●●●● ●

●●

●●●●●●● ●●● ●●

●●

●●

●●●● ●●●●●●●

●●

: nutrient 8

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 15: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Challenges

Outline

1 PrecursorsDefinitionsExamplesChallenges

2 GLMMsEstimationInference

3 ResultsCoral symbiontsGlyceraArabidopsis

4 ConclusionsConclusions

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 16: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Challenges

Data challenges: estimation

Small # RE levels (<5–6) [modes at zero]

Crossed REs [unusual setup]

Spatial/temporal correlation structure (“R-side” effects)

Overdispersion

Unusual distributions (Gamma, negative binomial . . . )

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 17: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Challenges

Data challenges: computation

Large n (of course)

Multiple REs (dimensionality)

Crossed REs

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 18: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Challenges

Inference

Any departures from classical LMMs

Small N (<40)

Small n

Inference on components of Σ (boundary effects, df)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 19: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Challenges

RE examples

Coral symbionts: simple experimental blocks, RE affectsintercept (overall probability of predation in block)

Glycera: applied to cells from 10 individuals, RE again affectsintercept (cell survival prob.)

Arabidopsis: region (3 levels, treated as fixed) / population /genotype: affects intercept (overall fruit set) as well astreatment effects (nutrients, herbivory, interaction)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 20: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Outline

1 PrecursorsDefinitionsExamplesChallenges

2 GLMMsEstimationInference

3 ResultsCoral symbiontsGlyceraArabidopsis

4 ConclusionsConclusions

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 21: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Penalized quasi-likelihood (PQL)

alternate steps of estimating GLM using known RE variancesto calculate weights; estimate LMMs given GLM fit2

flexible (e.g. spatial/temporal correlations)

biased for small unit samples (e.g. counts < 5, binary orlow-survival data)

widely used: SAS PROC GLIMMIX, R MASS:glmmPQL

marginal models: generalized estimating equations(geepack, geese)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 22: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Penalized quasi-likelihood (PQL)

alternate steps of estimating GLM using known RE variancesto calculate weights; estimate LMMs given GLM fit2

flexible (e.g. spatial/temporal correlations)

biased for small unit samples (e.g. counts < 5, binary orlow-survival data)

widely used: SAS PROC GLIMMIX, R MASS:glmmPQL

marginal models: generalized estimating equations(geepack, geese)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 23: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Penalized quasi-likelihood (PQL)

alternate steps of estimating GLM using known RE variancesto calculate weights; estimate LMMs given GLM fit2

flexible (e.g. spatial/temporal correlations)

biased for small unit samples (e.g. counts < 5, binary orlow-survival data)

widely used: SAS PROC GLIMMIX, R MASS:glmmPQL

marginal models: generalized estimating equations(geepack, geese)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 24: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Penalized quasi-likelihood (PQL)

alternate steps of estimating GLM using known RE variancesto calculate weights; estimate LMMs given GLM fit2

flexible (e.g. spatial/temporal correlations)

biased for small unit samples (e.g. counts < 5, binary orlow-survival data)

widely used: SAS PROC GLIMMIX, R MASS:glmmPQL

marginal models: generalized estimating equations(geepack, geese)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 25: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Laplace approximation

approximate marginal likelihood

for given β, θ find conditional modes by penalized, iteratedreweighted least squares; then use second-order Taylorexpansion around the conditional modes

more accurate than PQL

reasonably fast and flexible

lme4:glmer, glmmML, glmmADMB, R2ADMB (AD Model Builder)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 26: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

(adaptive) Gauss-Hermite quadrature (AGHQ)

as above, but compute additional terms in the integral(typically 8, but often up to 20)

most accurate

slowest, hence not flexible (2–3 RE at most, maybe only 1)

lme4:glmer, glmmML, gamlss.mx:gamlssNP, repeated

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 27: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Variations

Hierarchical GLMS (hglm, HGLMMM)

Monte Carlo methods: MCEM1, MCMLE (bernor)18,sequential MC (pomp), data cloning (dclone)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 28: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Bayesian approaches

Monte Carlo approaches: MCMC (Gibbs sampling,Metropolis-Hastings, etc.)

slow but flexible

makes marginal inference easy

must specify priors, assess convergence

specialized: glmmAK, MCMCglmm9, INLA

general: BUGS (glmmBUGS, R2WinBUGS, BRugs, WinBUGS,OpenBUGS, R2jags, rjags, JAGS)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 29: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Estimation

Extensions

Overdispersion Variance > expected from statistical model

Quasi-likelihood approaches: MASS:glmmPQL

Extended distributions (e.g. negative binomial):glmmADMB, gamlss.mx:gamlssNPObservation-level random effects (e.g.lognormal-Poisson): lme4

Zero-inflation Overabundance of zeros in a discrete distribution

zero-inflated models: glmmADMB, MCMCglmmhurdle models: MCMCglmm

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 30: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Outline

1 PrecursorsDefinitionsExamplesChallenges

2 GLMMsEstimationInference

3 ResultsCoral symbiontsGlyceraArabidopsis

4 ConclusionsConclusions

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 31: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Wald tests/CIs

Easy (e.g. typical results of summary): assume quadraticsurface, based on information matrix @ MLE

always approximate, sometimes awful (Hauck-Donner effect)

often bad for variance estimates

available from most direct-maximization packages

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 32: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Likelihood ratio tests/profile confidence intervals

Model comparison is relatively easy

Profiling is expensive — and not (yet) available . . . (lme4a forLMMs)

in GLM(M) case, numerator is only asymptotically χ2 anyway:Bartlett corrections3;4, higher-order asymptotics: cond

[neither extended to GLMMs!]

OK if N − n, N � 40?

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 33: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Conditional F tests

What if scale parameter (φ) is estimated(e.g. Gaussian, Gamma, quasi-likelihood) ?

In classical LMMs, −2 log L ∼ F (ν1, ν2)

For non-classical LMMs (unbalanced, crossed, R-side) orGLMMs, ν2 poorly defined:Kenward-Roger, Satterthwaite approximations12;16

unimplemented except in SAS (partially in Genstat)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 34: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Tests/CIs of variances [boundary problems]

LRT depends on null hypothesis being within the parameter’sfeasible range6;13

violated e.g. by H0 : σ2 = 0

In simple cases null distribution is a mixture of χ2

distributions (e.g. 0.5χ20 + 0.5χ2

1: emdbook:dchibarsq)

simulation-based testing: RLRsim

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 35: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Information-theoretic approaches

Above issues apply, but less well understood:7;8

AIC is asymptotic

“corrected” AIC (AICc)10 derived for linear models, widelyused but not tested elsewhere14

For comparing models with different REs,or for AICc , what is p? conditional AIC:8;19 (cAIC) (level offocus issue: see also Deviance Information Criterion (DIC,17)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 36: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Bootstrapping

1 fit null model to data

2 simulate “data” from null model

3 fit null and working model, compute likelihood difference

4 repeat to estimate null distribution

confidence intervals?

simulate/refit methods; bootMer in lme4a (LMMs only!)

> pboot <- function(m0, m1) {

s <- simulate(m0)

2 * (logLik(refit(m1, s)) - logLik(refit(m0, s)))

}

> replicate(1000, pboot(fm2, fm1))

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 37: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Bayesian inference

Marginal highest posterior density intervals (or quantiles)

Computationally “free” with results of stochastic Bayesiancomputation

Easily extended to prediction intervals etc. etc.

Post hoc Markov chain Monte Carlo sampling available forsome packages (glmmADMB, R2ADMB, eventually lme4a)

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 38: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Inference

Bottom line

Large data: computation slow (maximization methodsfastest), inference easy (asymptotics)

Bayesian computation slow, inference easy (posterior samples)

Small data: computation fast

RE variances may be poorly estimated/set to zero (upcoming:penalty/prior term in blmer within arm)inference tricky, may need bootstrapping

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 39: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Coral symbionts

Coral symbionts: comparison of results

Regression estimates−6 −4 −2 0 2

Symbiont

Crab vs. Shrimp

Added symbiont

GLM (fixed)GLM (pooled)PQLLaplaceAGQ

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 40: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Glycera

Glycera fit comparisons

Effect on survival (logit)

−60 −40 −20 0 20 40 60

Osm

Cu

H2S

Anoxia

Osm:Cu

Osm:H2S

Cu:H2S

Osm:Anoxia

Cu:Anoxia

H2S:Anoxia

Osm:Cu:H2S

Osm:Cu:Anoxia

Osm:H2S:Anoxia

Cu:H2S:Anoxia

Osm:Cu:H2S:Anoxia

MCMCglmmglmer(OD:2)glmer(OD)glmmMLglmer

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 41: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Glycera

Glycera: MCMCglmm fit

Effect on survival

−20 −10 0 10 20 30

H2SCu

Osm

Oxygen

Cu : H2S

Osm : Oxygen

Cu : Oxygen

Osm : H2S

H2S : OxygenOsm : Cu

Osm : Cu : H2S

Cu : H2S : Oxygen

Osm : H2S : Oxygen

Osm : Cu : Oxygen

Osm : Cu : H2S : Oxygen

main effects

2−way

3−way

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 42: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Glycera

Parametric bootstrap results

True p value

Infe

rred

p v

alue

0.001

0.005

0.01

0.05

0.1

0.5

0.001

0.005

0.01

0.05

0.1

0.5

Osm

H2S

0.001 0.0050.01 0.05 0.1 0.5

Cu

Anoxia

0.001 0.0050.01 0.05 0.1 0.5

variable

normal

t7

t14

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 43: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Arabidopsis

Arabidopsis: AIC comparison of RE models

∆AIC

clip(gen) X clip(popu)clip(gen) X nut(popu)clip(gen) X int(popu)

nut(gen) X clip(popu)nut(gen) X nut(popu)nut(gen) X int(popu)int(gen) X clip(popu)int(gen) X nut(popu)int(gen) X int(popu)

int(popu)nointeract

0 2 4 6

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 44: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Arabidopsis

Arabidopsis: fits with and without nutrient(genotype)

Regression estimates−1.0 −0.5 0.0 0.5 1.0 1.5

nutrient8

amdclipped

rack2

statusPetri.Plate

statusTransplant

nutrient8:amdclipped

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 45: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Conclusions

Primary tools

lme4: multiple/crossed REs, (profiling): fast

MCMCglmm: Bayesian, very flexible

glmmADMB: negative binomial, zero-inflated etc.

Flexible tools:

AD Model Builder (and interfaces)BUGS/JAGS (and interfaces)INLA15

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 46: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Conclusions

Outlook

Computation: faster algorithms, parallel computation

Inference: mostly computational?

Implementation: extensions (e.g. L1-penalized approaches11),consistency (profile, simulate, predict)

Benefits & costs of staying within the GLMM framework

Benefits & costs of diversity

More info: http://glmm.wikidot.com

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 47: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Conclusions

Acknowledgements

Data: Josh Banta and Massimo Pigliucci (Arabidopsis);Adrian Stier and Sea McKeon (coral symbionts); CourtneyKagan, Jocelynn Ortega, David Julian (Glycera);

Co-authors: Mollie Brooks, Connie Clark, Shane Geange, JohnPoulsen, Hank Stevens, Jada White

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 48: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

[1] Booth JG & Hobert JP, 1999. Journal of theRoyal Statistical Society. Series B, 61(1):265–285.doi:10.1111/1467-9868.00176. URL http://

links.jstor.org/sici?sici=1369-7412(1999)

61%3A1%3C265%3AMGLMML%3E2.0.CO%3B2-C.

[2] Breslow NE, 2004. In DY Lin & PJ Heagerty,eds., Proceedings of the second Seattlesymposium in biostatistics: Analysis of correlateddata, pp. 1–22. Springer. ISBN 0387208623.

[3] Cordeiro GM & Ferrari SLP, Aug. 1998. Journalof Statistical Planning and Inference,71(1-2):261–269. ISSN 0378-3758.doi:10.1016/S0378-3758(98)00005-6. URLhttp://www.sciencedirect.com/science/

article/B6V0M-3V5CVRT-M/2/

190f68a684dd08c569a7836ff59568e4.

[4] Cordeiro GM, Paula GA, & Botter DA, 1994.International Statistical Review / RevueInternationale de Statistique, 62(2):257–274.ISSN 03067734. doi:10.2307/1403512. URLhttp://www.jstor.org/stable/1403512.

[5] Gelman A, 2005. Annals of Statistics, 33(1):1–53.doi:doi:10.1214/009053604000001048.

[6] Goldman N & Whelan S, 2000. Molecular Biologyand Evolution, 17(6):975–978.

[7] Greven S, 2008. Non-Standard Problems inInference for Additive and Linear Mixed Models.Cuvillier Verlag, Gottingen, Germany. ISBN

3867274916. URL http://www.cuvillier.de/

flycms/en/html/30/-UickI3zKPS,3cEY=

/Buchdetails.html?SID=wVZnpL8f0fbc.

[8] Greven S & Kneib T, 2010. Biometrika,97(4):773–789. URL http:

//www.bepress.com/jhubiostat/paper202/.

[9] Hadfield JD, 2 2010. Journal of StatisticalSoftware, 33(2):1–22. ISSN 1548-7660. URLhttp://www.jstatsoft.org/v33/i02.

[10] HURVICH CM & TSAI C, Jun. 1989. Biometrika,76(2):297 –307.doi:10.1093/biomet/76.2.297. URLhttp://biomet.oxfordjournals.org/content/

76/2/297.abstract.

[11] Jiang J, Aug. 2008. The Annals of Statistics,36(4):1669–1692. ISSN 0090-5364.doi:10.1214/07-AOS517. URL http:

//projecteuclid.org/euclid.aos/1216237296.

[12] Kenward MG & Roger JH, 1997. Biometrics,53(3):983–997.

[13] Molenberghs G & Verbeke G, 2007. TheAmerican Statistician, 61(1):22–27.doi:10.1198/000313007X171322.

[14] Richards SA, 2005. Ecology, 86(10):2805–2814.doi:10.1890/05-0074.

[15] Rue H, Martino S, & Chopin N, 2009. Journal ofthe Royal Statistical Society, Series B,71(2):319–392.

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 49: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Conclusions

[16] Schaalje G, McBride J, & Fellingham G, 2002.Journal of Agricultural, Biological &Environmental Statistics, 7(14):512–524. URLhttp://www.ingentaconnect.com/content/

asa/jabes/2002/00000007/00000004/art00004.

[17] Spiegelhalter DJ, Best N et al., 2002. Journal ofthe Royal Statistical Society B, 64:583–640.

[18] Sung YJ, Jul. 2007. The Annals of Statistics,35(3):990–1011. ISSN 0090-5364.doi:10.1214/009053606000001389. URLhttp:

//projecteuclid.org/euclid.aos/1185303995.Mathematical Reviews number (MathSciNet):MR2341695; Zentralblatt MATH identifier:1124.62009.

[19] Vaida F & Blanchard S, Jun. 2005. Biometrika,92(2):351–370.doi:10.1093/biomet/92.2.351. URLhttp://biomet.oxfordjournals.org/cgi/

content/abstract/92/2/351.

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs

Page 50: Open source GLMM tools: Concordia

Precursors GLMMs Results Conclusions References

Conclusions

Extras

Spatial and temporal correlation (R-side effects):MASS:glmmPQL (sort of), GLMMarp, INLA;WinBUGS, AD Model Builder

Additive models: amer, gamm4, mgcv

Penalized methods11

Ben Bolker McMaster University Departments of Mathematics & Statistics and Biology

Open-source GLMMs