extremes with covariate effects - practicalities and …jonathan/ukx08.pdf · overview data...

31
Overview Data Modelling Design Issues Conclusions References EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND PROBLEMS Philip Jonathan Shell Technology Centre Thornton, Chester, UK. [email protected] Kevin Ewans Shell International Exploration and Production, Rijswijk, NL. [email protected] 24 September 2008 Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Upload: others

Post on 29-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References

EXTREMES WITH COVARIATE EFFECTS -PRACTICALITIES AND PROBLEMS

Philip JonathanShell Technology Centre Thornton, Chester, UK.

[email protected]

Kevin EwansShell International Exploration and Production, Rijswijk, NL.

[email protected]

24 September 2008

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 2: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Procedure-I Procedure-II

Overview

Motivation

Regulatory requirements ad-hoc (if not inconsistent) w.r.t.accommodation of covariate effects and estimation of (e.g.)directional and seasonal design values for coastal and oceanstructures.

Statistics literature provides a framework for rational and consistentestimation, but many issues unresolved.

This talk

Analysis procedure with application to North Sea design

Some pressing issues in modelling and interpretation

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 3: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Procedure-I Procedure-II

Procedure in a nut shell: I

Hindcast data for multiple locations in neighbourhood. Extractstorm peaks (to eliminate temporal dependence) over threshold u.

Assume extremal characteristics of all locations marginally identical,although dependent. Goal is to estimate distribution of n-year returnvalue qn for any single location.

Estimation using NHPP: storm arrival rate µ, GP shape γ and scaleσ.

Accommodate covariate effects: µ, γ, σ and u vary with covariates(e.g. direction, season, time). u estimated before hand as high(local) quantile (sensitivity to threshold choice).

Maximise likelihood, penalised by parameter roughness w.r.t.covariates. Diagnostics for model fit. Cross-validation for optimalroughness. Block bootstrap for parameter uncertainty pointwise.

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 4: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Procedure-I Procedure-II

Procedure in a nut shell: II

Simulate to estimate properties of qn.

Estimate qn also for partitions w.r.t. covariates. Estimate andaccommodate storm dissipation effects.

Present findings in engineering terms.

Procedure published here:

Effect of combining locations on estimation uncertainty (Jonathan and Ewans 2007b).

Illustrations of extent of covariate effects on extreme quantile estimates (Jonathan et al. 2008).

Modelling directional extremes in the Gulf of Mexico and Northern North Sea (Jonathan and Ewans 2007a, Ewans and Jonathan2008).

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 5: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Pre-processing EDA (u,ρ)

Data and pre-processing

Significant wave height HS values from a Northern North Seahindcast, for October 1964 to September 1998 inclusive, at 3 hourintervals.

For approximate location of 2◦ longitude, 61◦ latitude, selected 100grid points on 10 x 10 rectangular lattice covering an area ofapproximately 5◦ longitude, 3◦ latitude centred at the location ofinterest.

For each storm period for each grid point, isolated storm peaksignificant wave height, Hsp

S , and corresponding wave direction, θ.

Estimated threshold for extreme value analysis based on localquantile (with direction)

Estimated directional dissipation for a storm (w.r.t. storm peak HS ,with direction)

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 6: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Pre-processing EDA (u,ρ)

Exploratory analysis

Quantiles of HspS (θ)

Conditional densities of HspS (θ)

HspS for NE and SW

θ for NE and SW

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 7: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Pre-processing EDA (u,ρ)

Threshold and dissipation

Threshold u(θ)

Given interval Θ of θ, thresholdestimated empirically as quantileof Hsp

S (Θ)

Local median used for modellinghenceforth

Dissipation ρ(∆θ)

For a given storm event,ρ(∆θ) = HS(∆θ)/Hsp

S

Dissipation quantifies influenceof storm on extremes indirections other than θ

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 8: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP-λCV GP-(γ,σ)BS Poisson-I Poisson-II

Generalised Pareto Modelling: I

Given {Xi}ni=1, {θi}ni=1, distribution of storm peaks above variablethreshold u (θ) assumed GP with cdf FXi |θi ,u:

FXi |θi ,u (x) = P (Xi ≤ x |θi , u (θi ))

= 1−(

1 + γ(θi )σ(θi )

(x − u (θi )))− 1

γ(θi )

+

γ and σ vary smoothly with θ, assumed to follow Fourier form∑pk=0

∑2b=1 Aabktb (kθ).

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 9: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP-λCV GP-(γ,σ)BS Poisson-I Poisson-II

Generalised Pareto Modelling: II

Penalised negative log likelihood is l∗:

l∗ =n∑

i=1

li + λ

(Rγ +

1

wRσ

)Unpenalised negative log likelihood is:

li = log σ (θi ) +

(1

γ (θi )+ 1

)log

(1 +

γ (θi )

σ (θi )(Xi − u (θi ))

)+

Roughness of γ is given by:

Rγ =

∫ 2π

0

(∂2γ

∂θ2

)2

dθ =

p∑k=1

πk4

(2∑

b=1

A21bk

)Analogous expression for roughness of σ

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 10: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP-λCV GP-(γ,σ)BS Poisson-I Poisson-II

Cross-validation for roughness

Optimal λ

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 11: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP-λCV GP-(γ,σ)BS Poisson-I Poisson-II

Forms of γ and σ with block bootstrap

γ(θ), σ(θ), with block bootstrap 95% confidence interval

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 12: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP-λCV GP-(γ,σ)BS Poisson-I Poisson-II

Poisson Modelling: I

Non-homogeneous Poisson process model. The negative log-likelihoodwritten:

l(µ, γ, σ) = lN(µ) + lW (γ, σ)

where lN is the (negative) log-density of the total number of exceedances(with rate argument µ), and lW is the (negative)log-conditional-densityof exceedances given a known total number N). Inferences on µ madeseparately from those on γ and σ.The Poisson process log-likelihood, for arrivals at times {ti}ni=1 in periodP0 is:

lN(µ) = −(n∑

i=1

logµ(ti )−∫

P0

µ(t)dt)

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 13: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP-λCV GP-(γ,σ)BS Poisson-I Poisson-II

Poisson Modelling: II

Or approximately (Chavez-Demoulin and Davison 2005):

l̂N(µ) = −(m∑

j=1

cj logµ(jδ)− δm∑

j=1

µ(jδ))

where {cj}mj=1 is the number of occurrences in each of the msub-intervals. WWe estimate storm occurrence rate adopting a Fourier form for Poissonintensity µ as a function of θ, penalising its roughness Rµ:

l̂∗N(µ) = l̂N(µ) + κRµ

Rµ has form analogous to that of Rγ or Rσ. Cross-validation and blockbootstrapping used similarly.

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 14: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Sectors 100-year-Drn 100-year-Cns Strategies

Defining appropriate directional sectors

Optimal boundaries to reduce within-sector variability

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 15: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Sectors 100-year-Drn 100-year-Cns Strategies

100-year storm peak cdf using directional γ,σ

HspS100 for directional sectors and omni-directionally

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 16: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Sectors 100-year-Drn 100-year-Cns Strategies

100-year storm peak cdf using constant γ,σ

HspS100 for directional sectors and omni-directionally

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 17: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Sectors 100-year-Drn 100-year-Cns Strategies

Design values for different design strategies

Specification of an omni-directional non-exceedance probability doesnot uniquely define sector design values.

Different strategies possible:

Design to omnidirectional HspS100

Design to equal sector non-exceedence probabilityDesign to optimise a specified cost function

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 18: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Issues

Generic

Sample size (c.f. estimates required for long return periods)

Measurement (or hindcast) uncertainty (especially for extremevalues)

Temporal dependence (∴ ”storm peak” analysis)

Spatial dependence (∴ block bootstrap)

Model form (e.g. GP versus Weibull ...) and complexity

Transformation of variables (e.g. weighting locally w.r.t. covariate)

Combination of variables (e.g. joint modelling, structural response -based analysis)

Specific

Reflecting model specification and fitting uncertainty in designvalues

Threshold selectionModel stiffnessDissipation

Specification and interpretation of design conditions in engineeringcontext

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 19: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Model selection - a toy problem

Two homogeneous directional sectors S1 and S2, extremes areGP-distributed

γ, σ and u values potentially different between sectors

Random sample size 1250 from each sector corresponding to 25 years

Test

H0: γ1 = γ2, σ1 = σ2

HA: γ1 6= γ2, σ1 6= σ2

Cases (γ1,σ1,u1) and (γ2,σ2,u2)

Case 1 (-0.1,2,4) and (-0.3,4,4)Case 2 (-0.1,3,4) and (-0.3,4,6)

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 20: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Case 1 - Model selection

Sector densities (theory) Probability of rejecting H0

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 21: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Case 1 - Parameter estimates with threshold

HA: S1 estimates with threshold HA: S2 estimates with threshold

H0: estimates with threshold

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 22: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Case 1 - Estimated median 100-year maximum

Median 100-year event: true (dot), HA (dash), H0 (full)

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 23: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Case 1 - Implications for long return periods

Median P-year event: true (dot), HA (dash),

H0 (full, for thresholds 6,7,...,11m)

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 24: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Case 2 - Model selection

Sector densities (theory) Probability of rejecting H0

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 25: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Case 2 - Parameter estimates with threshold

HA: S1 estimates with threshold HA: S2 estimates with threshold

H0: estimates with threshold

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 26: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Case 2 - Estimating median 100-year maximum

Median 100-year event: true (dot), HA (dash), H0 (full)

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 27: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Interpretation: Ratios of extreme quantiles with γ

σ=1. Moments of HspS100 vary with γ

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 28: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Overview Toy-Outline Toy-Case1 Toy-Case2 q(γ) Seasonal

Interpretation: seasonal extremes

Design values for short-term deployments in Gulf of Mexico

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 29: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Future

Future work

Model multiple covariate effects (e.g. more general smoothers).

Model spatial and temporal dependence explicitly (e.g. extremequantiles for region rather than single location).

Improved modelling of dissipation effects.

Jointly model multiple variables (wind, waves, current, e.g.Heffernan and Tawn 2004), compare inferences with response-basedapproaches.

Extremes estimates incorporating uncertainties from model andthreshold specification (e.g. predictive distributions).

Models that better exploit the underlying physics (e.g. forhurricanes)

Influence design practice. Regulators (e.g. API) currently reviewingmethods for seasonal and directional design. Bridge industry andacademia, communicate.

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 30: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Thanks

References

V. Chavez-Demoulin and A.C. Davison. Generalized additive modelling ofsample extremes. J. Roy. Statist. Soc. Series C: Applied Statistics, 54:207, 2005.

K. C. Ewans and P. Jonathan. The effect of directionality on NorthernNorth Sea extreme wave design criteria. J. Offshore Mechanics ArcticEngineering, 2008. (Accepted October 2007. DOI:10.1115/1.2960859).

J. E. Heffernan and J. A. Tawn. A conditional approach for multivariateextreme values. J. R. Statist. Soc. B, 66:497, 2004.

P. Jonathan and K. C. Ewans. The effect of directionality on extremewave design criteria. Ocean Engineering, 34:1977–1994, 2007a.

P. Jonathan and K. C. Ewans. Uncertainties in extreme wave heightestimates for hurricane dominated regions. J. Offshore MechanicsArctic Engineering, 129:300–305, 2007b.

P. Jonathan, K. C. Ewans, and G. Z. Forristall. Statistical estimation ofextreme ocean environments: The requirement for modellingdirectionality and other covariate effects. Ocean Engineering, 35:1211–1225, 2008.

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates

Page 31: EXTREMES WITH COVARIATE EFFECTS - PRACTICALITIES AND …jonathan/UKX08.pdf · Overview Data Modelling Design Issues Conclusions References GP-I GP-II GP- CV GP-(,˙)BS Poisson-I Poisson-II

Overview Data Modelling Design Issues Conclusions References Thanks

Thanks for [email protected]

Jonathan & Ewans, UK Extremes 2008, Lancaster Extremes with covariates