methods to control for confounding - protect home · applying the propensity score 35 • options:...

63
Methods to control for confounding - Introduction & Overview - Nicolle M Gatto 18 February 2015

Upload: others

Post on 22-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Methods to control for confounding - Introduction & Overview -

Nicolle M Gatto

18 February 2015

Page 2: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Learning Objectives

At the end of this confounding control overview, you will be able to:

• Understand how confounding arises in pharmacoepidemiology and the problem it poses for causal inference

• Differentiate (exposure probability) weighting approaches to confounding control, including propensity score control, matching, and stratification, and inverse probability weighting

• Relate these weighting approaches to randomization

• List key assumptions necessary for these approaches to estimate causal effects

• Distinguish between point treatment and time-dependent confounding

2

Page 3: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Outline

• Ideal comparisons for causal inference

• How confounding impedes causal inference

• The gold standard

• Mimicking ideal comparisons without randomization:

– Typical multivariate regression

– Propensity score methods

– Inverse probability weighting (IPW)

• Practical considerations

• Take home messages

3

Page 4: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Aim of comparative

pharmacoepidemiology studies

4

Does drug A protect against outcome Z?

Does drug A cause outcome Y?

? A Z

? A Y

Here, the exposure of interest, A, is a “point-treatment”

Page 5: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The essential problem

5

Drug A

L1

L2

• Outside the context of study, many reasons people take one medication over another

…Li

Page 6: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The essential problem

6

Drug A Adverse effect Y

…Li

?

L1

L2

• These reasons are often related to outcomes of interest

Page 7: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The essential problem

7

Drug A Adverse effect Y

…Li

?

L1

L2

• Confounding: When risk factors for the disease are associated with use of the medication of interest

Page 8: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The essential problem

8

Drug A Adverse effect Y

L

?

• For simplicity let’s consider L as the totality of confounding

• “Point-treatment” confounding

Page 9: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The ideal (hint: it’s not the RCT)

9

Target Population N

N Y+if A+ (Y+if A+ / N)

N Y+if A- (Y+if A- / N)

A type of counterfactual comparison

= RDcausal

Page 10: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The ideal (hint: it’s not the RCT)

10

• We’ve defined RDcausal (for now) as the effect – with all else equal - in the total population if everyone was exposed versus no one was exposed

• But at least one of these conditions is counter-to-fact

• Instead we use substitutes for the ideal risks

Page 11: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

11

A+ Y+in A+ (Y+in A+ / A+)

A- Y+in A- (Y+in A- / A-)

The real world substitute

In target Population N, people become A+ or A-

An observable estimate

N

= RDcrude

f1

1-f1

Page 12: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

12

A+ Y+in A+ (Y+in A+ / [f1N])

A- Y+in A- (Y+in A- / A-)

The real world substitute

In target Population N, people become A+ or A-

An observable estimate

N

= RDcrude

f1

1-f1

Page 13: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

(Y+in A- / [(1-f1)N])

= RDcrude

13

A+ Y+in A+ (Y+in A+ / [f1N])

A- Y+in A-

The real world substitute

In target Population N, people become A+ or A-

An observable estimate

N

f1

1-f1

Page 14: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

(Y+in A- / [(1-f1)N])

= RDcrude

14

A+ Y+in A+ ([f2f1Y+if A+] / [f1N])

A- Y+in A-

The real world substitute

In target Population N, people become A+ or A-

An observable estimate

N

f1

1-f1

Page 15: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

([f3(1-f1)Y+if A- ]/

[(1-f1)N])

= RDcrude

15

A+ Y+in A+ ([f2f1Y+if A+] / [f1N])

A- Y+in A-

The real world substitute

In target Population N, people become A+ or A-

An observable estimate

N

f1

1-f1

Page 16: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

([f3(1-f1)Y+if A- ]/

[(1-f1)N])

≠ RDcrude

16

A+ Y+in A+ ([f2f1Y+if A+] / [f1N])

A- Y+in A-

The real world substitute

In target Population N, people become A+ or A-

With confounding

by L

N

RDcausal

f1

1-f1

Page 17: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Possible solutions

• Try to avoid confounding through design, e.g.

– Randomize

– Match

– Restrict

• Try to control confounding through analysis, e.g.:

– Traditional analysis methods

– Techniques that weight by causes of the treatment (aka mimic randomization)

– Techniques that weight by causes of the outcome

17

Page 18: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

18

A+ Y+in A+ (Y+in A+ / A+)

A- Y+in A- (Y+in A- / A-)

An observable estimate

N

Why randomization is “the gold standard”

Target Population N, randomized 1:1 to A+ or A-

= RDcrude

f1

1-f1

Page 19: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

19

A+ Y+in A+ (Y+in A+ / 0.5N)

A- Y+in A- (Y+in A- / A-)

An observable estimate

N

Why randomization is “the gold standard”

Target Population N, randomized 1:1 to A+ or A-

= RDcrude

f1

1-f1

Page 20: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

20

A+ Y+in A+ (0.5Y+if A+ / 0.5N)

A- Y+in A- (Y+in A- / A-)

An observable estimate

N

Why randomization is “the gold standard”

Target Population N, randomized 1:1 to A+ or A-

= RDcrude

f1

1-f1

Page 21: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

21

A+ Y+in A+ (0.5Y+if A+ / 0.5N)

A- Y+in A- (0.5Y+if A- / 0.5N)

An observable estimate

N

Why randomization is “the gold standard”

Target Population N, randomized 1:1 to A+ or A-

RDcausal = RDcrude

f1

1-f1

Page 22: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

22

A+ Y+in A+ (0.25Y+if A+ / 0.25N)

A- Y+in A- (0.75Y+if A- / 0.75N)

N

Why randomization is “the gold standard”

Target Population N, randomized 1:3 to A+ or A-

RDcausal = RDcrude

f1

1-f1

Page 23: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Why randomization is “the gold standard”

23

Drug A Adverse effect Y

L

?

• If randomization is successful, no confounding at baseline

Page 24: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Possible solutions

• Try to avoid confounding through design, e.g.

– Randomize

– Match

– Restrict

• Try to control confounding through analysis, e.g.:

– Traditional analysis methods

– Techniques that weight by causes of the treatment (aka mimic randomization)

– Techniques that weight by causes of the outcome

24

Page 25: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Basic requirements of these methods

• Appropriately hypothesize factors that contribute to confounding

• Can measure these factors (validly)

...‘there is no unmeasured confounding’

25

Page 26: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Option 1: Traditional confounding control

• For example, multivariate regression analysis

• Estimates RDcausal? Yes, when:

– All variables that contribute to confounding are included in the model

– Statistical model is specified correctly

– Effect of AY across strata of confounding

variables is homogeneous

26

Page 27: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Option 2: Propensity score (PS) methods

• Estimates RDcausal? Yes, when:

– All variables that contribute to confounding are included in the model

– Statistical model is specified correctly

– Effect of AY across strata of confounding

variables is homogeneous

(to estimate RDcausal as we’ve defined it: the total population effect)

27

Page 28: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Option 3: Inverse probability weighting (IPW)

• Estimates RDcausal? Yes, when:

– All variables that contribute to confounding are included in the model

– Statistical model is specified correctly

– Positivity holds

There are no strata of L for which there are no subjects who received a particular exposure level

28

Page 29: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

To estimate RDcausal in total population

29

Characteristic Traditional methods

Propensity score control

IPW

Requires all confounding factors measured?

Yes Yes Yes

Statistical correctly specified

Yes Yes Yes

Can handle large number of confounding variables?

No Yes Yes

Requires homogeneity? Yes Yes No

Requires positivity? No No Yes

Page 30: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the propensity score

30

• Traditional PS: probability of being exposed conditional on predictors of exposure

• Minimally sufficient PS: probability of being exposed conditional on variables that contribute to confounding

Page 31: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the propensity score

31

• Traditional PS: probability of being exposed conditional on predictors of exposure

• Minimally sufficient PS: probability of being exposed conditional on variables that contribute to confounding

– A weight is assigned to each person according to the conditional probability of exposure given the confounder(s)

Page 32: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the propensity score: Example 1

32

Study Population

(N)

Y+

Y-

Y+

Y-

Y+

Y-

Y+

Y-

A+

A-

A+

A-

L+

L-

Weight these people by pr(A+ | L+)

Page 33: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the propensity score: Example 2

33

Study Population

(N)

Y+

Y-

Y+

Y-

Y+

Y-

Y+

Y-

A+

A-

A+

A-

L+

L-

Weight these people by pr(A+ | L-)

Page 34: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the propensity score

34

Example SAS code:

**estimate propensity scores**; proc logistic descending data=dataset1; model A=L1 L2 L4 L5 L7 L8; output out=probA1_PS p=prA1_PS xbeta=logit_ps; run;

Page 35: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Applying the propensity score

35

• Options: PS control, stratification & matching

• After estimating PS for each person, use it to control confounding by:

Adjusting for the PS as a covariate

Stratifying the effect estimate by the PS

Matching A+ and A- people by PS

Page 36: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Applying the propensity score

36

**PS adjusted (as covariate) RR**; proc genmod data=probA1_PS; model Y=A prA1_PS / dist=poisson; run;

Example SAS code:

Page 37: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Applying the propensity score

37

**PS quintile stratification for CMH RR**; proc rank data=probA1_PS groups=5 out=tempPS; ranks rnks; var prA1_PS; run; proc sort data=tempPS; by A Y; proc freq data=tempPS order=data; tables rnks*A*Y / cmh; output out=tempPS1 mhrrc2 lgrrc2; run; data PSstrat_5; set tempPS1; beta=log(1/_MHRRC2_); SE=(log(U_MHRRC2)-log(L_MHRRC2))/3.92; run;

Example SAS code:

Page 38: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the inverse probability weight

38

• Another way to use the propensity score!

• IPW: Inverse of the exposure probability conditional on variables that contribute to confounding

– A weight is assigned to each person according to the inverse of the conditional probability of his/her exposure status given the confounder(s)

Page 39: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the IPW: Example 1

39

Study Population

(N)

Y+

Y-

Y+

Y-

Y+

Y-

Y+

Y-

A+

A-

A+

A-

L+

L-

Weight these people by 1/ pr(A- | L+)

Page 40: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the IPW: Example 2

40

Study Population

(N)

Y+

Y-

Y+

Y-

Y+

Y-

Y+

Y-

A+

A-

A+

A-

L+

L-

Weight these people by 1 / pr(A+ | L-)

Page 41: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Estimating the IPW

41

**estimate propensity scores**; proc logistic descending data=dataset1; model A=L1 L2 L4 L5 L7 L8; output out=probA1_PS p=prA1_PS xbeta=logit_ps; run; data weights_PS; set probA1_PS; prA0_PS=1-prA1_PS; if A=1 then IPW=1/prA1_PS; else IPW=1/prA0_PS; run;

Example SAS code:

Page 42: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Applying the IPW

42

**IP weighted RR in total population**; proc genmod data=weights_PS; class ID; model Y=A /dist=poisson; weight IPW; repeated subject=id; estimate 'beta' A 1; run;

Example SAS code:

Page 43: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

How do these methods relate to RCTs?

• They use weighting to mimic randomization

• PS methods estimate effect using weighting across x number of ‘new populations’

• IPW weights people by the IPW, creating a pseudo-population

43

Page 44: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The pseudo-population

44

Study Population

(N)

Y+

Y-

Y+

Y-

Y+

Y-

Y+

Y-

A+

A-

A+

A-

L+

L-

1/ pr(A- | L+)

1/ pr(A+ | L+)

1/ pr(A+ | L-)

1/ pr(A- | L-)

Page 45: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

The pseudo-population

45

Study Population

(N)

Y+

Y-

Y+

Y-

Y+

Y-

Y+

Y-

A+

A-

A+

A-

L+

L-

1/ 0.1 = 10.0

1/ 0.9 = 1.1

1/ 0.05 = 20.0

1/ 0.95 = 1.05

Page 46: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Some practical considerations

46

Page 47: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

47

Propensity score balancing

0

200

400

600

800

1000

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Untreated

Treated

Propensity Score

Page 48: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

48

Propensity score balancing

0

200

400

600

800

1000

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Untreated

Treated

Propensity Score

Page 49: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

49

How will PS be estimated?

For example:

• Covariates to be included

• Interaction terms to be included

• Type of model used to estimate the probability of exposure

• Type of regression used to model the outcome

Page 50: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

50

How will IPW be estimated?

For example:

• Covariates to be included

• Interaction terms to be included

• Type of model used to estimate the probability of exposure

• Type of regression used to model the outcome

• Stabilized or unstabilized weights

• Robust variance estimation

Page 51: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

To estimate RDcausal in total population

51

Characteristic Traditional methods

Propensity score control

IPW

Requires all confounding factors measured?

Yes Yes Yes

Statistical correctly specified

Yes Yes Yes

Can handle large number of confounding variables?

No Yes Yes

Requires homogeneity? Yes Yes No

Requires positivity? No No Yes

Page 52: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

To estimate RDcausal in treated

52

Characteristic Traditional methods

Propensity score control

IPW

Requires all confounding factors measured?

Yes Yes Yes

Statistical correctly specified

Yes Yes Yes

Can handle large number of confounding variables?

No Yes Yes

Requires homogeneity? Yes No* No#

Requires positivity? No No Yes

* If PS matching or SMR weighting to combine PS strata # If SMR weighting used to create pseudo-population

Page 53: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Instrumental variable analysis: A solution?

• Stay tuned...

53

Page 54: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

What if we are interested in a time

varying treatment effect?

54

A0 Y A1

• Treatment (A) varies with time

Page 55: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

What if we are interested in a time

varying treatment effect?

55

A0 Y A1

• Treatment (A) varies with time

the causal effect if everyone in the population had been A++ versus if everyone

had been A--

Page 56: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

What if we are interested in a time

varying treatment effect?

56

A0

L1

Y A1

L0

• Treatment (A) varies with time

• Covariate (L) varies with time

Page 57: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

What if we are interested in a time

varying treatment effect?

57

A0

L1

Y A1

L0

• Treatment (A) varies with time

• Covariate (L) varies with time and:

– Predicts the outcome

Page 58: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

What if we are interested in a time

varying treatment effect?

58

A0

L1

Y A1

L0

• Treatment (A) varies with time

• Covariate (L) varies with time and:

– Predicts the outcome

– Predicts exposure

Page 59: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

What if we are interested in a time

varying treatment effect?

59

A0

L1

Y A1

L0

• Treatment (A) varies with time

• Covariate (L) varies with time and:

– Predicts the outcome

– Predicts exposure

– Is affected by previous exposure

How do we

address the

confounding and

mediational

effects of L?

Page 60: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Inverse probability weighting: A solution?

• Stay tuned...

60

Page 61: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Take home messages

• These weighting approaches require all confounding factors are hypothesized and measured

• To estimate the causal effect in the total population, only IPW does not require homogeneity of the treatment effect

• The choice of confounding control method for point-treatment confounding should be driven by:

– Causal question / target population

– Practical considerations

61

Page 62: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

References

62

Hernan MA and Robins JM. Estimating causal effects from epidemiological data. J Epidemiol Community Health 2006;60:578-86 Hernan MA et al. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology 2000;11:561-70 Robins JM et al. Marginal structural models and causal inference in epidemiology. Epidemiology 2000;11:550-60 Sato T and Matsuyama Y. Marginal Structural Models as a Tool for Standardization. Epidemiology 2003;14:680-6 Sturmer T, Rothman KJ, Glynn RJ Insights into different results from different causal contrasts in the presence of effect-measure modification. Pharmacoepidemiol Drug Saf 2006; 15, 698-709

Page 63: Methods to control for confounding - PROTECT Home · Applying the propensity score 35 • Options: PS control, stratification & matching • After estimating PS for each person, use

Thank you

63