(better) bootstrap confidence intervals

21
(Better) Bootstrap Confidence Intervals TAU Bootstrap Seminar 2011 Dr. Saharon Rosset Shachar Kaufman Based on Efron and Tibshirani’s “An introduction to the bootstrap” Chapter 14

Upload: eitan

Post on 12-Feb-2016

75 views

Category:

Documents


0 download

DESCRIPTION

(Better) Bootstrap Confidence Intervals. Shachar Kaufman Based on Efron and Tibshirani’s “An introduction to the bootstrap” Chapter 14. TAU Bootstrap Seminar 2011 Dr. Saharon Rosset. Agenda. What’s wrong with the simpler intervals? The (nonparametric) BC a method - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: (Better) Bootstrap Confidence Intervals

(Better)Bootstrap Confidence Intervals

TAU Bootstrap Seminar 2011Dr. Saharon Rosset

Shachar Kaufman

Based on Efron and Tibshirani’s “An introduction to the bootstrap”

Chapter 14

Page 2: (Better) Bootstrap Confidence Intervals

Agenda

• What’s wrong with the simpler intervals?• The (nonparametric) BCa method• The (nonparametric) ABC method– Not really

Page 3: (Better) Bootstrap Confidence Intervals

Example: simpler intervals are bad

Page 4: (Better) Bootstrap Confidence Intervals

Example: simpler intervals are bad

Under the assumption that i.i.d.

Have exact analytical interval Can do parametric-bootstrap

Under the assumption that i.i.d.

Can do nonparametric bootstrap

Page 5: (Better) Bootstrap Confidence Intervals

Why are the simpler intervals bad?• Standard (normal) confidence interval

assumes symmetry around • Bootstrap-t often erratic in practice– “Cannot be recommended for general nonparametric

problems”• Percentile suffers from low coverage– Assumes nonp. distribution of is representative of (e.g. has

mean like does)• Standard & percentile methods assume homogenous

behavior of , whatever is– (e.g. standard deviation of does not change with )

Page 6: (Better) Bootstrap Confidence Intervals

A more flexible inference model

MeanStandard deviationSkewness

Account for higher-order statistics

�̂�∗

Page 7: (Better) Bootstrap Confidence Intervals

A more flexible inference model• If doesn’t work for the data, maybe we could find a transform

and constants and for which we can accept that

• Additional unknowns– allows a flexible parameter-description scale– allows bias: – allows “” to change with

• As we know, “more flexible” is not necessarily “better”• Under broad conditions, in this case it is (TBD)

Page 8: (Better) Bootstrap Confidence Intervals

Where does this new model lead?

Assume known and , and initially that , hence

Calculate a standard -confidence endpoint from this

Now reexamine the actual stdev, this time assuming that

According to the model, it will be

Page 9: (Better) Bootstrap Confidence Intervals

Where does this new model lead?

Ok but this leads to an updated endpoint

Which leads to an updated

If we continue iteratively to infinity this way we end up with the confidence interval endpoint

Page 10: (Better) Bootstrap Confidence Intervals

Where does this new model lead?

• Do this exercise considering and get

• Similarly for with

Page 11: (Better) Bootstrap Confidence Intervals

Enter BCa

• “Bias-corrected and accelerated”• Like percentile confidence interval– Both ends are percentiles , of the bootstap

instances of – Just not the simple

Page 12: (Better) Bootstrap Confidence Intervals

BCa

• Instead

• and are parameters we will estimate– When both zero, we get the good-old percentile

CI• Notice we never had to explicitly find

Page 13: (Better) Bootstrap Confidence Intervals

BCa

• tackles bias

(since is monotone)

• accounts for a standard deviation of which varies with (linearly, on the “normal scale” )

Page 14: (Better) Bootstrap Confidence Intervals

BCa

• One suggested estimator for is via the jackknife

where

and

• You won’t find the rationale behind this formula in the book (though it is clearly related to one of the standard ways to define skewness)

Page 15: (Better) Bootstrap Confidence Intervals

Theoretical advantages of BCa

• Transformation respecting– If the interval for is then the interval for a

monotone is – So no need to worry about finding transforms of

where confidence intervals perform well • Which is necessary in practice with bootstrap-t CI • And with the standard CI (e.g. Fisher corrcoeff trans.)• Percentile CI is transformation respecting

Page 16: (Better) Bootstrap Confidence Intervals

Theoretical advantages of BCa

• Accuracy– We want s.t. – But a practical is an approximation where

– BCa (and bootstrap-t) endpoints are “second order accurate”, where

– This is in contrast to the standard and percentile methods which only converge at rate (“first order accurate”) errors one order of magnitude greater

Page 17: (Better) Bootstrap Confidence Intervals

But BCa is expensive

• The use of direct bootstrapping to calculate delicate statistics such as and requires a large to work satisfactorily

• Fortunately, BCa can be analytically approximated (with a Taylor expansion, for differentiable ) so that no Monte Carlo simulation is required

• This is the ABC method which retains the good theoretical properties of BCa

Page 18: (Better) Bootstrap Confidence Intervals

The ABC method

• Only an introduction (Chapter 22)• Discusses the “how”, not the “why”• For additional details see Diciccio and Efron

1992 or 1996

Page 19: (Better) Bootstrap Confidence Intervals

The ABC method

• Given the estimator in resampling form

– Recall , the “resampling vector”, is an dimensional random variable with components

– Recall • Second-order Taylor analysis of the estimate – as a function of the bootstrap resampling

methodology

Page 20: (Better) Bootstrap Confidence Intervals

The ABC method

• Can approximate all the BCa parameter estimates (i.e. estimate the parameters in a different way)

– , where

• something akin to a Hessian component but along a specific direction not perpendicular to any natural axis (the “least favorable family” direction)

Page 21: (Better) Bootstrap Confidence Intervals

The ABC method

• And the ABC interval endpoint

• Where– with

• Simple and to the point, aint it?