an introduction to extreme value theory

Post on 05-Nov-2021

14 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

An Introduction to Extreme Value Theory

Petra Friederichs

Meteorological InstituteUniversity of Bonn

COPS Summer School, July/August, 2007

Applications of EVT

Finance• distribution of income has so called fat tails• value-at-risk: maximal daily lost• re-assurance

Hydrology• protection against flood• Q100: maximal flow that is expected once every 100 years

Meteorology• extreme winds • risk assessment (e.g. ICE, power plants)• heavy precipitation events• heat waves, hurricanes, droughts• extremes in a changing climate

In classical statistics: focus on AVERAGE behavior of stochastic process

central limit theorem

In extreme value theory:focus on extreme and rare events

Fisher-Tippett theorem

What is an extreme?

from Ulrike Schneider

Application

Extreme Value Theory

Block Maximum

for

follows a Generalized Extreme Value (GEV) distribution

Peak over Threshold (POT)

very large threshold u

follow a Generalized Pareto Distribution (GPD)

Poisson-Point GPD Process

combines POT with Poisson point process

M n=max {X 1 , , X n}

M n

n∞

{X i−u∣X iu}

Extreme Value Theory

Block Maximum

Example: station precipitation (DWD) at Dresden

1948 – 2004: 57 seasonal (Nov-March and May-Sept) (maxima over approximately 150 days)

M n=max {X 1 , , X n}

DresdenSummer

Winter

GEV – Fisher-Tippett Theorem

G y=exp −[1y−]−1 /

The distribution of

converges to ( )

which is called the Generalized Extreme Value (GEV) distribution.

It has three parameters

location parameter

scale parameter

shape parameter

M n=max {X 1 , , X n}

n∞

G y=exp −exp −y−

≠0

=0

GEV – Types of Distributions

GEV has 3 types depending on shape parameter

Gumbel

Fréchet

Weibull

=0

G x =exp−exp[−x ]

x=y−

=1 /0

G x =exp−[1x]−

=−1 /0

G x =exp−[1−x]

GEV – Types of Distributions

GEV has 3 types depending on shape parameter

Gumbel

exponential tail

Fréchet

so called „fat tail“

Weibull

upper finite endpoint

=0

x=y−

=1 /0

=−1 /0

GEV – Types of Distribution

Conditions to the sample from which the maxima are drawn

must be independently identically distributed (i.i.d.)

Let be the distribution of

is in the domain of attraction of a Gumbel type GEV iff

for all

Exponential decay in the tail of

{X 1 , , X n}

X i

X iF x

F x

limx∞

1−F xtb x 1−F x

=e−t

t0F x

F(x)p=0.9p=0.99

x0.9 x0.99

GEV – Types of Distribution

Conditions to the sample from which the maxima are drawn

must be independently identically distributed (i.i.d.)

Let be the distribution of

is in the domain of attraction of a Frechet type GEV iff

for all

polynomial decay in the tail of

{X 1 , , X n}

X i

X iF x

F x

limx∞

1−F x 1−F x

=−1/

0F x

GEV – Types of Distribution

Conditions to the sample from which the maxima are drawn

must be independently identically distributed (i.i.d.)

Let be the distribution of

is in the domain of attraction of a Weibull type GEV iff

there exists with

and

for all

has a finite upper end point

{X 1 , , X n}

X i

X iF x

F x

limx∞1−F F−

1 x1−F F−

1x−1

=1/

0F x

F F =1F

F

GEV – return level

zm= − log −log 1−1/m

Of interest often is return level

value expected every m observation (block maxima)

calculated using invers distribution function (quantile function)

can be estimated empirically as the

zm

Prob yzm=1−G yzm=1m

zm= G−11−1m = inf y ∣F y

1m

zm=G−11−

1m = −

1−−log 1−1/m−

DresdenSummer

Winter

Maximum daily precipitation for Nov-March (Winter) and May-Sept (Summer)

Block Maxima

Block Maxima

Block Maxima

Peak over Threshold

Peak over Threshold (POT)

very large threshold u

follow a Generalized Pareto Distribution (GPD)

{X i−u∣X iu}

Daily precipitation for Nov-March (green) and May-Sept (red) Dresden

Peak over Threshold

Peak over Threshold (POT)

The distribution of

exceedances over large threshold u

are asymptotically distributed following a

Generalized Pareto Distribution (GPD)

two parameters

scale parameter

shape parameter

advantage: more efficient use of data

disadvantage: how to choose threshold not evident

Y i :=X i−u∣X iu

H y∣X iu=1−1yu−1/

POT – Types of Distribution

GDP has same 3 types as GEV depending on shape parameter

Gumbel

exponential tail

Pareto (Fréchet)

polynomial tail behavior

Weibull

has upper end point

=0

H y =1−exp−yu

0

1−H y ~c y−1/

0

F=u∣∣

Peak over Threshold

POT: threshold u = 15mm

Poisson Point – GPD Process

A= t 2−t1 [1y−u u

]−1/

Poisson point – GPD process with intensity

on

A= t1, t2 × y ,∞

Estimation and Uncertainty

General concept of estimating parameters from a sample

Maximum Likelihood (ML) Method

Assume are draw from a GEV (GPD,...) with unknown parameters

and PDF

{ yi} , i=1, , n

yi~F y∣ , ,

{ yi}

f y∣ , ,=F ' y∣ , ,

Maximum Likelihood Method

L , ,=∏i=1

nf yi∣ , ,

The likelihood L of the sample is then

It is easier to minimize the negative logarithm of the likelihood

in general there is no analytical solution for the minimum with respect to the parameters

Minimize using numerical algorithms.

The estimates maximize the likelihood of the data.

l , ,=−∑i=1

nlog f yi∣ , ,

, ,

Profile log Likelihood

To Take Home

There exists a well elaborated statistical theory for extreme values.

It applies to (almost) all (univariate) extremal problems.

EVT: extremes from a very large domain of stochastic processes follow one of the three types: Gumbel, Frechet/Pareto, or Weibull

Only those three types characterize the behavior of extremes!

Note: Data need to be in the asymptotic limit of a EVD!

References

Coles, S (2001): An Introduction to Statistical Modeling of Extreme Values. Springer Series in Statistics. Springer Verlag London. 208p

Beirlant, J; Y. Goegebeur; J. Segers; J. Teugels (2005): Statistics of Extremes. Theory and Applications. John Wiley & Sons Ltd. 490p

Embrechts, Küppelberg, Mikosch (1997): Modelling Extremal Events for Insurance and Finance. Springer Verlag Heidelberg.648p

Gumbel, E.J. (1958): Statistics of Extremes. (Dover Publication, New York 2004)

R Development Core Team (2003): R: A language and environment for statistical computing, available at http://www.R-project.org

The evd and ismev Packages by Alec Stephenson and Stuard Coles

top related