lock_jsm_2010.pptx

8/14/2019 Lock_JSM_2010.pptx

1/12

Rerandomization inRandomized Experiments

Kari Lock and Don Rubin

Harvard UniversityJSM 2010

8/14/2019 Lock_JSM_2010.pptx

2/12

The Gold Standard

Why are randomized experiments so good?

They yield unbiased estimates of the treatment effect

They eliminate (?) confounding factors

ON AVERAGE. For any particular experiment,

covariate imbalance is possible (and likely)

8/14/2019 Lock_JSM_2010.pptx

3/12

Rerandomization

Suppose you are doing a randomized experiment andhave covariate information available beforeconducting

the experiment

You randomize to treatment and control, but get a

bad randomization

Can you rerandomize?Yes, but you first need to specify a concrete

definition of bad

8/14/2019 Lock_JSM_2010.pptx

4/12

Randomize subjects totreated and control

Collect covariate dataSpecify a criteria determining whena randomization is unacceptable;

based on covariate balance

(Re)randomize subjectsto treated and control

Check covariate balance

1)

2)

Conduct experiment

unacceptable acceptable

Analyze results with aFisher randomization test

3)

4)

8/14/2019 Lock_JSM_2010.pptx

5/12

Unbiased

To maintain an unbiased estimate of the treatmenteffect, the decision to rerandomize or not must be

automatic and specified in advance

blind to which group is treated

Theorem: If the treated and control groups are the

same size, and if for every unacceptable randomization

the exact opposite randomization is also unacceptable,

then rerandomization yields an unbiased estimate of

the treatment effect.

8/14/2019 Lock_JSM_2010.pptx

6/12

Mahalanobis Distance

Define overall covariate distance by

M = Dr-1D

2Under adequate sample sizes and pure randomization: ~ kM

Dj : Standardized difference between treated andcontrol covariate means for covariatej

k = number of covariates

D= (D1, , Dk)

r= covariate correlation matrix = cov(D)

Choose aand rerandomize when M > a

8/14/2019 Lock_JSM_2010.pptx

7/12

Rerandomization Based on M

Since M follows a known distribution, easy tospecify the proportion of rejected randomizations

M is affinely invariant

Correlations between covariates are maintained

The variance reduction on each covariateis the

same (and known)

The variance reduction for any linear combinationof the covariates is known

8/14/2019 Lock_JSM_2010.pptx

8/12

Rerandomization

Theorem:

If nT= nCand rerandomizationoccurs when M> a, then

| ,

cov co| v

T C

T C T C a

E M a

M a v

X X 0

X X X X

and

1,2 2 2

, is the incomplete gamma function

,2 2

a

k a

vk ak

2va

| 0,

| 1 (1 ) var .r

T C

T C T C a

E M a

M

Y Y

Y Ya vY R Y

8/14/2019 Lock_JSM_2010.pptx

9/12

Differencein CovariateMeans

Difference in Outcome Means

8/14/2019 Lock_JSM_2010.pptx

10/12

Pure Randomization

Re-Randomization

Standardized Differences in Covariate Means

-4 -2 0 2 4

male

age

collgpaa

actcomp

preflit

likelit

likemath

numbmath 0.14

0.15

0.17

0.14

0.16

0.16

0.16

0.15

, ,

, ,

var( |

var(

)

)

j T j C

j T j C

X X aT

X X

(theoretical va= .16)

8/14/2019 Lock_JSM_2010.pptx

11/12

Pure Randomization

Re-Randomization

var( | .57var(

))

T C

T C

Y Y aY Y

T

(theory = .58)

-1.0 -0.5 0.0 0.5 1.0

Equivalent to

increasing the

sample size by a

factor of 1.7

Difference in Outcome Means Under Null

8/14/2019 Lock_JSM_2010.pptx

12/12

Conclusion

Rerandomization improves covariate balancebetween the treated and control means, and

increases precision in estimating the treatment effect

if the covariates are correlated with the response

Rerandomization gives the researcher more power

to detect a significant result, and more faith that an

observed effect is really due to the treatment

[email protected]

lock_jsm_2010.pptx

Documents