bes/sfe talk 2014

25
A unified framework to combine disperate data types in species distribution modelling A unified framework to combine disperate data types in species distribution modelling Slides on Slideshare: http://www.slideshare.net/oharar/bessfe-talk-2014 Bob O’Hara 1 Petr Keil 2 Walter Jetz 2 1 BiK-F, Biodiversity and Climate Change Research Centre Frankfurt am Main Germany Twitter: @bobohara 2 Department of Ecology and Evolutionary Biology Yale University New Haven, CT, USA

Upload: bob-ohara

Post on 07-Jul-2015

690 views

Category:

Science


2 download

DESCRIPTION

Talk on poit processes and distribution models

TRANSCRIPT

Page 1: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

A unified framework to combine disperate datatypes in species distribution modelling

Slides on Slideshare:http://www.slideshare.net/oharar/bessfe-talk-2014

Bob O’Hara1 Petr Keil 2 Walter Jetz2

1BiK-F, Biodiversity and Climate Change Research CentreFrankfurt am Main

GermanyTwitter: @bobohara

2Department of Ecology and Evolutionary BiologyYale University

New Haven, CT, USA

Page 2: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

A ”Real” Curve

0 20 40 60 80 100

020

4060

80

A Curve

Page 3: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Approximated with a Discretised Curve

0 20 40 60 80 100

020

4060

80

A CurveDiscrete

Page 4: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Better: linear interpolation

0 20 40 60 80 100

020

4060

80

A CurveDiscreteInterpolated

Page 5: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

With more points, the approximations improve

0 20 40 60 80 100

020

4060

80

A CurveDiscreteInterpolated

Page 6: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

What does this have to do with distribution models?

Page 7: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

What does this have to do with distribution models?

This is how SDMs see the world:

source: http://bit.ly/1l8sG7M

Map produced by Peter Blancher, Science and Technology Branch, Environment Canada, based on data from the

North American Breeding Bird Survey

Page 8: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

DiscretionProjected maps look like this:

−3.0

−2.5

−2.0

−1.5

−1.0

−0.5

0.0

0.5

Page 9: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Can we be less discrete?This might be better:

−3.0

−2.5

−2.0

−1.5

−1.0

−0.5

0.0

0.5

Page 10: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Point Processes: Model

−3.0

−2.5

−2.0

−1.5

−1.0

−0.5

0.0

0.5

Model of where individuals are Ahigher intensity means moreindividuals

Page 11: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Pointless Processes?

How do we use this approach?

Page 12: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Need to use talk about sausage

Lincolnshire Sausage! by Caro Wallis, https://www.flickr.com/photos/carowallis1/806455819

Page 13: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Point processes

Represent density by a surface, with intensity λ(s)Number in area A follows Poisson distribution with mean µ(A):

µ(A) =

∫Aλ(s)ds

Raw Blood Sausage by Carlos Lorenzo, https://www.flickr.com/photos/carlos lorenzo/447670758

Page 14: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Intense Models

Model the expected density through the intensity:

log(λ(s)) =∑

βjXj(s) + ν(s)

Xj(s) are covariates (also continuous in space)ν(s) is a residual spatial term

Gastrocast #91 by Neal Foley, https://www.flickr.com/photos/86571141@N00/345595467

Page 15: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Point Processes: Model

In practice, approximate the surface with triangles

Page 16: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

How about the data?

Pig faces by Karen,

https://www.flickr.com/photos/unsureshot/167930176

Presence only points: thinnedpoint processAbundance: PoissonPresence/Absence: binomial,cloglog(large) areas:

Pr(n(A) > 0) = 1− e∫A eρ(ξ)dξ

Page 17: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Abundance to Presence

N ∼ Poisson(λ)

so

Pr(N > 0) = 1− Pr(N = 0) = 1− λ0exp(−λ)0!

= 1− exp(−λ)

or

cloglog(Pr(N > 0)) = log(λ)

i.e. we just use a GLM with a cloglog link function

Page 18: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Fitting models in PracticeNeed a sausage grinder

Fat Grinder by Danielle Harms,

https://www.flickr.com/photos/daniellemharms/5597688960

R + INLA

Page 19: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

An Example

Data from MoL

Picus tridactylus by Biodiversity Heritage Library,

https://www.flickr.com/photos/biodivlibrary/9514435393

Page 20: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Data

eBird GBIF (not eBird)

BBS Expert Range

Page 21: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

We run the model

Sausage machine by clement127, https://www.flickr.com/photos/clement127/15004844674

Page 22: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Predicted Distribution

−0.05

−0.04

−0.03

−0.02

−0.01

0.00

0.01

Page 23: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

Predicted Distribution from Each Data Set

All Data

−0.05

−0.04

−0.03

−0.02

−0.01

0.00

0.01

eBird

−0.0150

−0.0145

−0.0140

−0.0135

−0.0130

−0.0125

−0.0120

GBIF

−0.0150

−0.0148

−0.0146

−0.0144

−0.0142

−0.0140

−0.0138

−0.0136

BBS

−100

−80

−60

−40

−20

Page 24: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

We can make a better sausage

Combining data is betterDon’t lose positional informationBut still have to approximate a surface

Nakki by hugovk, https://www.flickr.com/photos/hugovk/27828598

Page 25: BES/SfE talk 2014

A unified framework to combine disperate data types in species distribution modelling

We can make a better sausage

Great Marbling! by Gabriel Bucataru, https://www.flickr.com/photos/gabstero/4660842166