precision medicine: lecture 02 causal inference

117
Precision Medicine: Lecture 02 Causal Inference Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete Department of Biostatistics Gillings School of Global Public Health University of North Carolina at Chapel Hill Fall, 2019

Upload: others

Post on 23-Jan-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Precision Medicine: Lecture 02 Causal Inference

Precision Medicine: Lecture 02Causal Inference

Lina Montoya, Michael R. Kosorok,Nikki L. B. Freeman and Owen E. Leete

Department of BiostatisticsGillings School of Global Public Health

University of North Carolina at Chapel Hill

Fall, 2019

Page 2: Precision Medicine: Lecture 02 Causal Inference

AcknowledgmentsDrawn from lectures by:

I Maya Petersen, MD PhD

I Laura Balzer, PhD

I Jen Ahern, PhD

Resources:

I Introduction to Causal Inference coursehttps://www.ucbbiostat.com/

I Petersen, Maya L., and Mark J. van der Laan. ”Causalmodels and learning from data: integrating causalmodeling and statistical estimation.” Epidemiology(Cambridge, Mass.) 25.3 (2014): 418.

I Balzer, Laura B., and Maya L. Petersen. ”InvitedCommentary: Machine Learning in Causal Inference –How Do I Love Thee? Let Me Count the Ways.”American Journal of Epidemiology (2021).

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 2/ 112

Page 3: Precision Medicine: Lecture 02 Causal Inference

Outline

Introduction

Defining causal questions and inference

The Causal Roadmap applied to the average treatment effect

The Causal Roadmap applied to Precision Medicine causalquestions

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 3/ 112

Page 4: Precision Medicine: Lecture 02 Causal Inference

Introduction

I In precision medicine, we often want toknow: who should get whichtreatment?I And at what time?I And at what dose?I And in what sequence?I Etc.

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 4/ 112

Page 5: Precision Medicine: Lecture 02 Causal Inference

IntroductionI In precision medicine, we often want to know: who should

get which treatment?I And at what time?I And at what dose?I And in what sequence?I Etc.

I Further: Is giving treatment in an individualized waybetter than giving it in a non-individualized way(e.g., giving everyone the same treatment)?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 5/ 112

Page 6: Precision Medicine: Lecture 02 Causal Inference

Introduction

I In precision medicine, we often want to know: who shouldget which treatment?I And at what time?I And at what dose?I And in what sequence?I Etc.

I Further: Is giving treatment in an individualized waybetter than giving it in a non-individualized way (i.e.,giving everyone the same treatment)?

I These are fundamentally causal questions

I The field of causal inference provides formalframeworks for answering causal questions

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 6/ 112

Page 7: Precision Medicine: Lecture 02 Causal Inference

Outline

Introduction

Defining causal questions and inference

The Causal Roadmap applied to the average treatment effect

The Causal Roadmap applied to Precision Medicine causalquestions

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 7/ 112

Page 8: Precision Medicine: Lecture 02 Causal Inference

What makes a question causal?

I A causal question asks about the world under manipulatedconditionsI For example, manipulating the way a treatment was

administeredI Question: What would outcomes look like if all participants

were exposed VS. if the same participants, over the sametime-frame, and under the same conditions were not exposed?

We are not time travelers → answering causal questions can be hard

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 8/ 112

Page 9: Precision Medicine: Lecture 02 Causal Inference

Contrast this with other kinds of questions

I Descriptive questionsI E.g.: What was the average age of participants in our study?I Think: Table 1 in applied journal papers

I Statistical questionsI E.g.: Which risk factors are associated with breast cancer?I E.g.: How likely were women who took vitamins to have breast

cancer?I Think: risk scores/prediction problems

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 9/ 112

Page 10: Precision Medicine: Lecture 02 Causal Inference

Statistical vs. Causal Inference

I Statistical inference - inference on a statistical parameterI Statistical parameter - summary of an observed data

distribution based on a sample drawn from that distributionI May require statistical assumptions, e.g., on the distribution

of the observed data

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 10/ 112

Page 11: Precision Medicine: Lecture 02 Causal Inference

Statistical vs. Causal Inference

I Causal inference - inference on a causal parameterI Causal parameter - summary of a distribution we do not (fully)

observe based on a sample drawn from that distributionI Causal parameters provide a summary of how a data

distribution would change under manipulated conditions, sothey require assumptions beyond statistical ones, namely,about the processes that generated the data

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 11/ 112

Page 12: Precision Medicine: Lecture 02 Causal Inference

Statistical vs. Causal Inference

Example: We sample individuals from some underlying population

and on each subject observe: A=vitamin use, Y=breast cancerI Statistical parameter

I How likely were women who took vitamins to have breastcancer vs. those who did not take vitamins?

I P(Y = 1|A = 1)− P(Y = 1|A = 0)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 12/ 112

Page 13: Precision Medicine: Lecture 02 Causal Inference

Statistical vs. Causal Inference

Example: We sample individuals from some underlying population

and on each subject observe: A=vitamin use, Y=breast cancerI Statistical parameter

I How likely were women who took vitamins to have breastcancer vs. those who did not take vitamins?

I P(Y = 1|A = 1)− P(Y = 1|A = 0)

I Causal parameterI What is the probability of breast cancer had all women had

taken vitamins vs. if all women had not taken vitamins?I ???

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 13/ 112

Page 14: Precision Medicine: Lecture 02 Causal Inference

Statistical vs. Causal Inference

I Statistical questions (based on what we observe) aredifferent than causal questions (based on what we wish tohave observed)

I Answering causal questions is hard – yet, in public health(and beyond), we ask causal questions all the time...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 14/ 112

Page 15: Precision Medicine: Lecture 02 Causal Inference

OK, so I think I’m asking a causal question...?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 15/ 112

Page 16: Precision Medicine: Lecture 02 Causal Inference

Causal Roadmap to the Rescue

A general Roadmap1for tackling causal questions...

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causal question

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

1Petersen & van der Laan, 2014Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 16/ 112

Page 17: Precision Medicine: Lecture 02 Causal Inference

Outline

Introduction

Defining causal questions and inference

The Causal Roadmap applied to the average treatment effect

The Causal Roadmap applied to Precision Medicine causalquestions

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 17/ 112

Page 18: Precision Medicine: Lecture 02 Causal Inference

Running example - Average Treatment Effect

I Antiretroviral Therapy (ART) potent and effective, but20-40% of HIV-infected patients in sub-Saharan Africalost to follow-up within two years after enrolled in care2

I Want to know the impact of SMS text messagereminders on patient retention in HIV care

2Geng, et. al., 2010Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 18/ 112

Page 19: Precision Medicine: Lecture 02 Causal Inference

Why use The Roadmap?I Usual approach might be...

I Get EHR data/questionnaires measured at first clinic visit,SMS app information, clinic retention records starting at firstvisit

I Since the outcome is binary, decide to use logistic regressionfor analysis

I Estimate conditional odds ratios by exponentiating theregression coefficient on the prevention package

I What’s the problem here?I You’re allowing the tool (e.g., logistic regression) to define the

question, rather than starting with the question you care aboutand picking a tool that allows you to answer that question

I Solution: causal roadmap!

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 19/ 112

Page 20: Precision Medicine: Lecture 02 Causal Inference

The Roadmap

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causal question

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 20/ 112

Page 21: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

What are the data, and how do those variables in the data relateto each other?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 21/ 112

Page 22: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

Data:I Measured variables

I X : the set of baseline covariatesI age, sex, SES, viral load at baseline, time on ART, CD4

count...

I A : the exposure/interventionI intervention to send SMS messages, where a ∈ A = {send

SMS = 1, standard of care = 0}I Y : the outcome

I lapse in care one year after baseline, where y ∈ Y = {no lapsein care = 1, lapse in care= 0}

I Unmeasured variablesI U : the set of unmeasured factors

I genetic factors, cultural factors...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 22/ 112

Page 23: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

I Causal modeling formalizes what we actually know abouthow the variables relate to each other - however limitedI What measured variables may affect each other?I What is the role of background factors?I What is the functional form of these relationships?

I Can encode this knowledge with directed acyclic graphs(DAGs) and structural causal models

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 23/ 112

Page 24: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

I Structural causal model:

X = fX (UX )

A = fA(UA,X )

Y = fY (UY ,A,X )

I No assumptions on thebackground factors(UX ,UA,UY )

I No assumptions on thefunctional form of theequations (fX , fA, fY )

Double-headed arrows = shared causes

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 24/ 112

Page 25: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

No unmeasured confounding setting? How would SCM and graphchange?

I Structural causal model:

X = fX (UX )

A = fA(UA,X )

Y = fY (UY ,A,X )

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 25/ 112

Page 26: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

No unmeasured confounding setting? How would SCM and graphchange?

I Structural causal model:

X = fX (UX )

A = fA(UA,X )

Y = fY (UY ,A,X )

I Background factors are allindependent (nounmeasured commoncauses)

I Still no functional formassumptions

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 26/ 112

Page 27: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

RCT setting? How would SCM and graph change?

I Structural causal model:

X = fX (UX )

A = fA(UA,X )

Y = fY (UY ,A,X )

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 27/ 112

Page 28: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

RCT setting? How would SCM and graph change?

I Structural causal model:

X = fX (UX )

A = UA

Y = fY (UY ,A,X )

I A determined by flip of acoin UA ∼ Bernoulli(0.5)

I A not a function of X(exclusion restriction)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 28/ 112

Page 29: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

In this example, this will be our SCM

I Structural causal model:

X = fX (UX )

A = UA

Y = fY (UY ,A,X )

I RCT part of ADAPT-R = Adaptive Strategies forPreventing and Treating Lapses of Retention in HIV Care

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 29/ 112

Page 30: Precision Medicine: Lecture 02 Causal Inference

The Roadmap

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causal question

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 30/ 112

Page 31: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal question

I What is the effect of SMS text message reminders onpatient retention in HIV care?

I Consider one hypothetical experimentI What would be the difference in one-year patient retention had

everyone received SMS reminders versus if no one had receivedSMS reminders?

I Why is this an example of a causal question?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 31/ 112

Page 32: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal question

I What is the effect of SMS text message reminders onpatient retention in HIV care?

I Consider one hypothetical experimentI What would be the difference in patient retention had everyone

received SMS reminders versus if no one had received SMSreminders?

I Why is this an example of a causal question? Because we’reasking about the probability of retention under differentconditions than observed (we only observe retention under oneSMS condition)

I Many other hypothetical experiments possibleI Note: giving SMS to everyone (or no one) is an example

of a static (vs. dynamic) treatment ruleI Assigning SMS, regardless of patient covariates

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 32/ 112

Page 33: Precision Medicine: Lecture 02 Causal Inference

The Roadmap

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causalquestion

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 33/ 112

Page 34: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Counterfactual (potential) outcomesare defined by modifications to thedata generating process described bythe model

X = fX (UX )

A = a

Ya = fY (UY , a,X )

I Ya: the counterfactual outcome, ifpossibly contrary to fact, a personreceived exposure level A=aI Y1: counterfactual one-year retention

outcome under SMS (A=1)I Y0: counterfactual one-year retention

outcome under standard of care (A=0)

“Surgically modified” model

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 34/ 112

Page 35: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

Note: other notation for counterfactual outcomes...

I Y (a), Y a, Y (a)

I In this lecture, we’ll use Ya

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 35/ 112

Page 36: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Use counterfactuals to define the target causal parameter= summary of distribution we do not (fully) observe

I Examples:I Treatment-specific mean

I E.g.: The expected counterfactual one-year retention outcomehad all patients received SMS

E[Y1]

I Average treatment effectI E.g.: The difference in the expected counterfactual one-year

retention outcome had all patients received SMS vs. standardof care:

E[Y1]− E[Y0]

I Causal relative risk: E[Y1]/E[Y0]I Causal odds ratio: E[Y1]/(1−E[Y1])

E[Y0]/(1−E[Y0])I etc...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 36/ 112

Page 37: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Use counterfactuals to define the target causal parameter= summary of distribution we do not (fully) observe

I Examples:I Treatment-specific mean

I E.g.: The expected counterfactual one-year retention outcomehad all patients received SMS

E[Y1]

I Average treatment effectI E.g.: The difference in the expected counterfactual one-year

retention outcome had all patients received SMS vs. standardof care:

E[Y1]− E[Y0]

I Causal relative risk: E[Y1]/E[Y0]I Causal odds ratio: E[Y1]/(1−E[Y1])

E[Y0]/(1−E[Y0])I etc...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 36/ 112

Page 38: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Use counterfactuals to define the target causal parameter= summary of distribution we do not (fully) observe

I Examples:I Treatment-specific mean

I E.g.: The expected counterfactual one-year retention outcomehad all patients received SMS

E[Y1]

I Average treatment effectI E.g.: The difference in the expected counterfactual one-year

retention outcome had all patients received SMS vs. standardof care:

E[Y1]− E[Y0]

I Causal relative risk: E[Y1]/E[Y0]

I Causal odds ratio: E[Y1]/(1−E[Y1])E[Y0]/(1−E[Y0])

I etc...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 36/ 112

Page 39: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Use counterfactuals to define the target causal parameter= summary of distribution we do not (fully) observe

I Examples:I Treatment-specific mean

I E.g.: The expected counterfactual one-year retention outcomehad all patients received SMS

E[Y1]

I Average treatment effectI E.g.: The difference in the expected counterfactual one-year

retention outcome had all patients received SMS vs. standardof care:

E[Y1]− E[Y0]

I Causal relative risk: E[Y1]/E[Y0]I Causal odds ratio: E[Y1]/(1−E[Y1])

E[Y0]/(1−E[Y0])

I etc...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 36/ 112

Page 40: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Use counterfactuals to define the target causal parameter= summary of distribution we do not (fully) observe

I Examples:I Treatment-specific mean

I E.g.: The expected counterfactual one-year retention outcomehad all patients received SMS

E[Y1]

I Average treatment effectI E.g.: The difference in the expected counterfactual one-year

retention outcome had all patients received SMS vs. standardof care:

E[Y1]− E[Y0]

I Causal relative risk: E[Y1]/E[Y0]I Causal odds ratio: E[Y1]/(1−E[Y1])

E[Y0]/(1−E[Y0])I etc...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 36/ 112

Page 41: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Use counterfactuals to define the target causal parameter= summary of distribution we do not (fully) observe

I Examples:I Treatment-specific mean

I E.g.: The expected counterfactual retention outcome had allpatients received SMS

E[Y1]

I Average treatment effect (i.e., causal risk difference if binaryoutcome)

I E.g.: The difference in the expected counterfactual one-yearretention outcome had all patients received SMS vs. standardof care:

E[Y1]− E[Y0]

I Causal relative risk: E[Y1]/E[Y0]I Causal odds ratio: E[Y1]/(1−E[Y1])

E[Y0]/(1−E[Y0])I etc...

Which of the above parameters are more causal?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 37/ 112

Page 42: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

I Use counterfactuals to define the target causal parameter= summary of distribution we do not (fully) observe

I Examples:I Treatment-specific mean

I E.g.: The expected counterfactual retention outcome had allpatients received SMS

E[Y1]

I Average treatment effect (i.e., causal risk difference if binaryoutcome)

I E.g.: The difference in the expected counterfactual one-yearretention outcome had all patients received SMS vs. standardof care:

E[Y1]− E[Y0]

I Causal relative risk: E[Y1]/E[Y0]I Causal odds ratio: E[Y1]/(1−E[Y1])

E[Y0]/(1−E[Y0])I etc...

Nothing “more” or “less causal” about any of these parameters

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 38/ 112

Page 43: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal parameter

Focus first on average treatment effect (ATE)

I E.g.: The difference in the expected counterfactualone-year retention outcome had all patients received SMSvs. standard of care:

E[Y1]− E[Y0]

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 39/ 112

Page 44: Precision Medicine: Lecture 02 Causal Inference

The Roadmap

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causal question

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 40/ 112

Page 45: Precision Medicine: Lecture 02 Causal Inference

Step 4a: Specify observed data

I Since we are not time-travelers, we need to specify thedata that can actually be observed

I Remember: for one patient, we actually observe:

O = (X ,A,Y ) ∼ P

I X : baseline covariates (e.g., age, sex, SES, viral load atbaseline, time on ART, CD4 count...)

I A : exposure (e.g., indicator of receiving SMS text messages)I Y : outcome (e.g., indicator of HIV care retention during one

year)I With probability distribution P

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 41/ 112

Page 46: Precision Medicine: Lecture 02 Causal Inference

Step 4b: Link observed data to causal model

I How does the data we observe O relate to themanipulated, causal model?

I We assume the observed data were generated by samplingn times from a data generating system specified by(described by) our causal modelI Here we will assume (but don’t have to) that our dataset are

1, 153 i.i.d. copies O1,O2, ...,On drawn from P

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 42/ 112

Page 47: Precision Medicine: Lecture 02 Causal Inference

Step 4b: Link observed data to causal model

I How does the data we observe O relate to themanipulated, causal model?

I We assume the observed data were generated by samplingn times from a data generating system specified by(described by) our causal modelI Here we will assume (but don’t have to) that our dataset are

1, 153 i.i.d. copies O1,O2, ...,On drawn from P

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 42/ 112

Page 48: Precision Medicine: Lecture 02 Causal Inference

Step 4c: Specify a statistical model

I Formally, a statistical model M is the set of possibledistributions of the observed dataI P ∈M

I Our causal model implies the statistical modelI Here, our causal model only places restrictions on our

statistical model through the treatment mechanismI No other functional form assumptionsI E.g.: our causal model says exposure

A = UA ∼ Bernoulli(0.5), i.e., P(A = 1|X ) = 0.5I We specified this in our causal model (in step 1)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 43/ 112

Page 49: Precision Medicine: Lecture 02 Causal Inference

Step 4c: Continuum of Statistical Models(informally)

I Non-parametric model: no restrictions on the set ofpossible distributions P

I Semi-parametric model: some restrictions on the set ofpossible distributions P

I Parametric model: assumes that we know P up to a finitenumber of unknown parameters

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 44/ 112

Page 50: Precision Medicine: Lecture 02 Causal Inference

Step 4c: Continuum of Statistical Models(informally)

I Non-parametric model: no restrictions on the set ofpossible distributions P

I Semi-parametric model: some restrictions on the set ofpossible distributions P

I Parametric model: assumes that we know P up to a finitenumber of unknown parameters

Note: The point is not that parametric (or semiparametric) modelsare bad...the point is that our statistical model should accuratelyreflect our knowledge (it should contain the true distribution of O)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 45/ 112

Page 51: Precision Medicine: Lecture 02 Causal Inference

The Roadmap

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causal question

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 46/ 112

Page 52: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

I What we want: E[Y1 − Y0] (causal parameter - functionof counterfactual distribution)

I What we have: sample from observed data distribution

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 47/ 112

Page 53: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

I What we want: E[Y1 − Y0] (causal parameter - functionof counterfactual distribution)

I What we have: sample from observed data distribution

Can we write our causal parameter as something we can estimatewith the observed data?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 48/ 112

Page 54: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

I What we want: E[Y1 − Y0] (causal parameter - functionof counterfactual distribution)

I What we have: sample from observed data distribution

Can we write our causal parameter as something we can estimatewith the observed data?

What are the assumptions required to do this?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 49/ 112

Page 55: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

We already made the following assumptions:I Temporality:

I E.g.: exposure precedes outcomeI Indicated by arrows on the causal graph from A to YI Equivalently Y as a function of A in the structural equations

I Consistency: Ya = Y when A = AI Counterfactuals defined through modifications to the causal

model, which also provides a description of the study underexisting conditions (i.e., observed exposure)

I Stable Unit Treatment Value Assumption(SUTVA): no interference between unitsI Indicated by the causal model where the outcome Y is only a

function of each patient’s exposure A

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 50/ 112

Page 56: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

We need the following assumptions:

I No unmeasured confounding (AKArandomization/strong ignorability assumption):

Ya ⊥ A|X for a ∈ A

I X is the adjustment set (i.e., confounders) - determined bybackdoor criterion

I Glossing over here- but this takes careful consideration!

I Positivity (AKA experimental treatment assignment):sufficient support in the exposure within possible values ofour adjustment set X

mina∈A

P(A = a|X = x) > 0 for all w for which P(X = x) > 0

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 51/ 112

Page 57: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

We need the following assumptions:

I No unmeasured confounding (AKA randomizationassumption):

Ya ⊥ A|X for a ∈ A

I X is the adjustment set (i.e., confounders)

I Positivity (AKA experimental treatment assignment):sufficient support in the exposure within possible values ofour adjustment set X

mina∈A

P(A = a|X = x) > 0 for all w for which P(X = x) > 0

Do these assumptions hold in our running example?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 52/ 112

Page 58: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

We need the following assumptions:

I No unmeasured confounding (AKA randomizationassumption):

Ya ⊥ A|X for a ∈ A

I X is the adjustment set (i.e., confounders)

I Positivity (AKA experimental treatment assignment orETA): sufficient support in the exposure within possiblevalues of our adjustment set X

mina∈A

P(A = a|X = x) > 0 for all w for which P(X = x) > 0

Yes! Both hold by RCT design (SMS randomized, P(A|X ) isknown)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 53/ 112

Page 59: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

With the aforementioned assumptions:

E[Y1] = E[E[Y1|X ]] by the tower rule

= E[E[Y1|A = 1,X ]] under randomization

= E[E[Y |A = 1,X ]] by consistency and positivity

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 54/ 112

Page 60: Precision Medicine: Lecture 02 Causal Inference

Step 5: IdentificationWith these assumptions:

E[Y1] = E[E[Y1|X ]] by the tower rule

= E[E[Y1|A = 1,X ]] under randomization

= E[E[Y |A = 1,X ]] by consistency and positivity

and

E[Y0] = E[E[Y0|X ]] by the tower rule

= E[E[Y1|A = 0,X ]] under randomization

= E[E[Y |A = 0,X ]] by consistency and positivity

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 55/ 112

Page 61: Precision Medicine: Lecture 02 Causal Inference

Step 5: IdentificationSo under the aforementioned assumptions:

E[Y1 − Y0] =E[E[Y |A = 1,X ]− E[Y |A = 0,X ]]

=∑x

[E[Y |A = 1,X = x ]

− E[Y |A = 0,X = x ]]P(X = x)

I The RHS is our statistical parameter

I Also called the G-computation formula3

I “Difference in the expected outcome given the exposureand the confounders and the expected outcome given noexposure and the confounders, averaged (standardized)with respect to the confounder distribution”

I (summation generalizes to integral for continuous X )3Robins, 1986

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 56/ 112

Page 62: Precision Medicine: Lecture 02 Causal Inference

The Roadmap

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causal question

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 57/ 112

Page 63: Precision Medicine: Lecture 02 Causal Inference

Step 6: Statistical estimation and inference

I Recall our statistical parameter:

Ψ(P) = E[E[Y |A = 1,X ]− E[Y |A = 0,X ]]

I Many estimators availableI Simple substitution estimator based on G-computation formulaI Inverse probability of treatment weighting (IPTW)4

I Augmented IPTW (AIPW)5

I Targeted maximum likelihood estimation (TMLE)6

I ...

5Hernan & Robins, 2006; Rosenbaum & Rubin, 19836Robins, Rotnitzky, & Zhao, 1994; Scharfstein, Rotnitzky, & Robins, 1999;

Robins, 19996Rosenblum & van der Laan, 2010; van der Laan & Rose, 2011

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 58/ 112

Page 64: Precision Medicine: Lecture 02 Causal Inference

Step 6: Statistical estimation and inference

I Pause to recall the “usual” approach1. Run logistic regression of the outcome (retention) Y on the

exposure (SMS messages) A and baseline confounders X

Logit(E[Y |A,X ]) = β0 + β1A + β2X1 + ...+ β19X18

2. Exponentiate the coefficient on the exposure and interpret theassociation in terms of a conditional odds ratio that ”holdsother factors constant”

I But: our target parameter Ψ(P) does not equal eβ1!I Relies on outcome regression model being correctly specifiedI Letting the estimation approach drive the question...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 59/ 112

Page 65: Precision Medicine: Lecture 02 Causal Inference

Step 6: Estimation with simple sub. estimator basedon G-comp.

I Focuses on estimation of the “outcome regression”I Expected outcome, given the exposure and confoundersI Q(A,X ) = E[Y |A,X ]

1

n

n∑i=1

[Q̂(Ai = 1,Xi )− Q̂(Ai = 0,Xi )

]

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 60/ 112

Page 66: Precision Medicine: Lecture 02 Causal Inference

Step 6: Estimation with IPTW

I Can think of confounding as a problem of biased samplingI Certain exposure-covariate subgroups are over-represented

relative to what we would see in a randomized trialI Other exposure-covariate subgroups are under-represented

I Apply weights to up-weight under-representedobservations and down-weight over-representedobservations

I Average weighted outcomes and compare

1

n

n∑i=1

[I[Ai = 1]

P̂(Ai = 1|Xi )Yi

]− 1

n

n∑i=1

[I[Ai = 0]

P̂(Ai = 0|Xi )Yi

]

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 61/ 112

Page 67: Precision Medicine: Lecture 02 Causal Inference

Step 6: Challenge of estimation

I But we often do not know enough to correctly specifythese regressionsI G-comp. relies on a regression based on a correctly specified

parametric model of the conditional expected outcomeQ(A,X )

I Inverse weighting also relies on a regression based on acorrectly specified parametric model of the propensity scoreP(A = 1|X )

I But parametric restrictions → misspecification → biasand misleading inferenceI Wrong answers and wrong conclusionsI Increasing sample size (i.e., “big data”) will not solve this

problem → more precision around the wrong answer

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 62/ 112

Page 68: Precision Medicine: Lecture 02 Causal Inference

Step 6: Incorporating machine learning inestimation?

I Estimate complex relationships in data flexibly

I Avoid introducing unsubstantiated assumptions duringestimation of outcome regression or propensity score

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 63/ 112

Page 69: Precision Medicine: Lecture 02 Causal Inference

Step 6: Prediction vs. causal inference

Can we use ML instead of parametric regressions in IPTW orG-computation?I No reliable way to obtain statistical inference

I No theory to support that ML in place of parametricregressions in “single robust” estimators have a normal (or anylimit) distribution

I ML has different goals than causal effect estimation

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 64/ 112

Page 70: Precision Medicine: Lecture 02 Causal Inference

Step 6: Incorporating ML in estimation

I AIPW

1

n

n∑i=1

I[Ai = 1]

P̂(Ai |Xi )(Yi − Q̂(Ai ,Xi )) + Q̂(1,Xi )

−1

n

n∑i=1

I[Ai = 0]

P̂(Ai |Xi )(Yi − Q̂(Ai ,Xi )) + Q̂(0,Xi )

I TMLE

1

n

n∑i=1

I[Ai = 1]

P̂(Ai |Xi )(Yi − Q̂∗(Ai ,Xi )) + Q̂(1,Xi )

−1

n

n∑i=1

I[Ai = 0]

P̂(Ai |Xi )(Yi − Q̂∗(Ai ,Xi )) + Q̂∗(0,Xi )

I Q̂∗ targeted toward parameter we care about (not Q)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 65/ 112

Page 71: Precision Medicine: Lecture 02 Causal Inference

Step 6: Incorporating ML in estimation

AIPW and TMLEI Double-robust: can incorporate machine learning to avoid

unsubstantiated assumptions, while maintaining validstatistical inferenceI Consistent estimate if either the outcome regression or

propensity score is consistently estimateI Asymptotically linear under regularity conditions (by the

Central Limit Theorem, normally distributed)I Efficient: lowest possible variance if both outcome regression

and propensity score are consistently estimated at reasonablerates

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 66/ 112

Page 72: Precision Medicine: Lecture 02 Causal Inference

Step 6: Statistical estimation and inference

I Recall our statistical parameter:

Ψ(P) = E[E[Y |A = 1,X ]− E[Y |A = 0,X ]]

I Many estimators availableI Simple substitution estimator based on G-computation formulaI Inverse probability of treatment weighting (IPTW)I Augmented IPTW (AIPW)I Targeted maximum likelihood estimation (TMLE)I ...

Which of the above estimators are more causal?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 67/ 112

Page 73: Precision Medicine: Lecture 02 Causal Inference

Step 6: Statistical estimation and inference

I Recall our statistical parameter:

Ψ(P) = E[E[Y |A = 1,X ]− E[Y |A = 0,X ]]

I Many estimators availableI Simple substitution estimator based on G-computation formulaI Inverse probability of treatment weighting (IPTW)I Augmented IPTW (AIPW)I Targeted maximum likelihood estimation (TMLE)I ...

Nothing “more” or “less causal” about these estimators – bythemselves, these are not “causal methods”

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 68/ 112

Page 74: Precision Medicine: Lecture 02 Causal Inference

The Roadmap

1. Specify data and causal model representing realbackground knowledge

2. Specify causal question

3. Specify causal parameter that answers causal question

4. Specify observed data and link to causal model

5. Identification

6. Statistical estimation and inference

7. Interpret results

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 69/ 112

Page 75: Precision Medicine: Lecture 02 Causal Inference

Step 7: Interpretation

I To what degree have identifiability assumptions have beenmet? This informs the strength of interpretation

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 70/ 112

Page 76: Precision Medicine: Lecture 02 Causal Inference

Step 7: Interpretation

I If there are concerns about identifiability (e.g., temporalordering unclear, key confounder not measured) theparameter can be interpreted as an association(statistical interpretation)I Interpretation: The difference in the outcome associated with

everyone exposed, compared to everyone unexposed,accounting for the measured confounders

I E.g.: The difference in the probability retention associatedwith SMS messages was X, after controlling for age, sex, SES,viral load, CD4 count,...

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 71/ 112

Page 77: Precision Medicine: Lecture 02 Causal Inference

Step 7: Interpretation

I If we believe identifiability assumptions are met, theparameter can be interpreted as the average treatmenteffectI Interpretation: The difference in the expected outcome if

everyone were exposed compared with if everyone wereunexposed

I E.g.: There would be a X difference in the probability ofretention if everyone had received SMS messages vs. standardof care

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 72/ 112

Page 78: Precision Medicine: Lecture 02 Causal Inference

Step 7: Interpretation

I If we believe identifiability assumptions are met, theparameter can be interpreted as the average treatmenteffectI Interpretation: The difference in the expected outcome if

everyone were exposed compared with if everyone wereunexposed

I E.g.: There would be a X difference in the probability ofretention if everyone had received SMS messages vs. standardof care

I Note that we are in the RCT setting, but RCTinterpretation requires:I Effective randomizationI Perfect complianceI Perfect follow up

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 73/ 112

Page 79: Precision Medicine: Lecture 02 Causal Inference

Step 7: Interpretation

I If we believe identifiability assumptions are met, theparameter can be interpreted as the average treatmenteffectI Interpretation: The difference in the expected outcome if

everyone were exposed compared with if everyone wereunexposed

I E.g.: There would be a X difference in the probability ofretention if everyone had received SMS messages vs. standardof care

I Note that we are in the RCT setting, but RCTinterpretation requires:I Effective randomizationI Perfect complianceI Perfect follow up

Where can we incorporate this knowledge?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 74/ 112

Page 80: Precision Medicine: Lecture 02 Causal Inference

Step 7: Interpretation

I If we believe identifiability assumptions are met, theparameter can be interpreted as the average treatmenteffectI Interpretation: The difference in the expected outcome if

everyone were exposed compared with if everyone wereunexposed

I E.g.: There would be a X difference in the probability ofretention if everyone had received SMS messages vs. standardof care

I Note that we are in the RCT setting, but RCTinterpretation requires:I Effective randomizationI Perfect complianceI Perfect follow up

Step 1 (data and causal model) and step 2 (causalquestion), which will inform later steps.

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 75/ 112

Page 81: Precision Medicine: Lecture 02 Causal Inference

Outline

Introduction

Defining causal questions and inference

The Causal Roadmap applied to the average treatment effect

The Causal Roadmap applied to Precision Medicine causalquestions

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 76/ 112

Page 82: Precision Medicine: Lecture 02 Causal Inference

Tie back to Precision Medicine

So what does this have to do with Precision Medicine?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 77/ 112

Page 83: Precision Medicine: Lecture 02 Causal Inference

Remember...

I In precision medicine, we often want to know: who shouldget which treatment?I And at what time?I And at what dose?I And in what sequence?I Etc.

I Further: Is giving treatment in an individualized waybetter than giving it in a non-individualized way (i.e.,giving everyone the same treatment)?

I These are fundamentally causal questions

I The field of causal inference provides formalframeworks for answering causal questions

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 78/ 112

Page 84: Precision Medicine: Lecture 02 Causal Inference

Running exampleI Antiretroviral Therapy (ART) potent and effective, but

20-40% of HIV-infected patients in sub-Saharan Africalost to follow-up within two years after enrolled in care

I Many diverse barriers to HIV care retentionI Want to know the impact of SMS text message reminders

on patient retention in HIV careI Further: Who benefits the most from SMS text

messages? Is sending SMS reminders to those whorespond well to them better than SMS for all?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 79/ 112

Page 85: Precision Medicine: Lecture 02 Causal Inference

Step 1: Data and causal model

Data:I U : the set of unmeasured factors

I genetic factors, cultural factors...

I X : the set of baseline covariatesI age, sex, SES, viral load at baseline, time on ART, CD4

count...

I A : the exposureI intervention to stay in care, where a ∈ A = {SMS = 1,

standard of care = 0}I Y : the outcome

I lapse in care one year after follow-up, where y ∈ Y = {nolapse in care = 1, lapse in care= 0}

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 80/ 112

Page 86: Precision Medicine: Lecture 02 Causal Inference

Step 1: Causal model

I Structural causal model:

X = fX (UX )

A = UA ∼ Bernoulli(0.5)

Y = fY (UY ,A,X )

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 81/ 112

Page 87: Precision Medicine: Lecture 02 Causal Inference

Step 2 & 3: Causal Questions & Parameters

I “Simple” Dynamic Treatment Rule & its Value

I Dynamic Treatment Rule & its Value

I Optimal Dynamic Treatment Rule & its Value

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 82/ 112

Page 88: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - “Simple” DynamicTreatment Rule

“Simple” Dynamic Treatment Rule

I If all participants had been given SMS based on theirage, what proportion of participants would not have hada lapse in care?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 83/ 112

Page 89: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - “Simple” DynamicTreatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 84/ 112

Page 90: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - “Simple” DynamicTreatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 85/ 112

Page 91: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - “Simple” DynamicTreatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 86/ 112

Page 92: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - “Simple” DynamicTreatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 87/ 112

Page 93: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - “Simple” DynamicTreatment Rule

I Define the simple dynamic treatment rule:I d(age) = a dynamic treatment rule for assigning A based on

each participants’s age.

A =

{SMS, if age ≤ 24

standard of care, if age > 24

Why is this a dynamic (vs. static) treatment rule?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 88/ 112

Page 94: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - “Simple” DynamicTreatment Rule

I Define the simple dynamic treatment rule:I d(age) = a dynamic treatment rule for assigning A based on

each participants’s age.

A =

{SMS, if age ≤ 24

standard of care, if age > 24

Why is this a dynamic (vs. static) treatment rule? Because we’reassigning treatment based on some function of X , the covariates(e.g., age)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 89/ 112

Page 95: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - “Simple” DynamicTreatment Rule

I Counterfactuals

X = fX (UX )

A = d(age)

Yd(age) = fY (UY ,X , d(age))

I Yd(age) is the counterfactual retention status for aparticipant if his/her/their intervention A were assignedbased on his/her/their age

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 90/ 112

Page 96: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - “Simple” DynamicTreatment Rule

Causal parameter: E[Yd(age)]

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 91/ 112

Page 97: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Dynamic Treatment Rule

Dynamic Treatment Rule

I If all participants had been assigned SMS based on theircharacteristics (i.e., covariates), what proportion ofparticipants would not have had a lapse in care within oneyear?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 92/ 112

Page 98: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Dynamic Treatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 93/ 112

Page 99: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Dynamic Treatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 94/ 112

Page 100: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Dynamic Treatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 95/ 112

Page 101: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - Dynamic TreatmentRule

I Define d(X ) as a dynamic treatment rule for assigning Abased on the covariates XI X ={age, sex, SES, viral load, CD4 count, etc...}

I The counterfactual outcomes are now:

X = fX (UX )

A = d(X )

Yd = fY (UY ,X , d(X ))

I Yd is the counterfactual one year retention status for anindividual if the intervention A were assigned using d(X )

I Causal parameter: probability of no lapse under dynamictreatment rule

E[Yd ]

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 96/ 112

Page 102: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Optimal DynamicTreatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 97/ 112

Page 103: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Optimal DynamicTreatment Rule

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 98/ 112

Page 104: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Optimal DynamicTreatment Rule

Optimal Dynamic Treatment Rule

I What is the dynamic rule for assignment to SMS messagesthat yields the highest probability of no lapse based onparticipants’ characteristics (i.e., covariates)?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 99/ 112

Page 105: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - Optimal DynamicTreatment Rule

I Define the optimal dynamic treatment rule7:

d∗(X ) ∈ argmaxd∈D

E[Yd ]

where D is the set of all dynamic treatment rules

I Can also define as function of conditional averagetreatment effect (CATE)

d∗(X ) ≡ I[E[Y1 − Y0|X ] > 0

]

7Murphy, 2003; Robins, 2004Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 100/ 112

Page 106: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - Optimal DynamicTreatment Rule

I Counterfactuals

X = fX (UX )

A = d∗(X )

Yd∗ = fY (UY ,X , d∗(X ))

I Yd∗ is the counterfactual one year retention status for anindividual if the intervention A were assigned based on theoptimal treatment rule d∗(X )

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 101/ 112

Page 107: Precision Medicine: Lecture 02 Causal Inference

Step 2: Causal Questions - Value of OptimalDynamic Treatment Rule

Value of Optimal Dynamic Treatment Rule

I Had all participants followed their optimal rule forSMS messages, what proportion of participants wouldhave been retained within a year?

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 102/ 112

Page 108: Precision Medicine: Lecture 02 Causal Inference

Step 3: Causal Parameters - Value of OptimalDynamic Treatment Rule

Causal parameter: E[Yd∗ ]Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 103/ 112

Page 109: Precision Medicine: Lecture 02 Causal Inference

Step 2 & 3: Causal Questions and Parameters -Value of Optimal Dynamic Treatment Rule

Evaluating a personalized vs. non-personalized intervention

I Had all participants followed their optimal rule for SMSmessages, what proportion of participants would havebeen retained within a year?

I How does this compare to giving everyone SMS?

Causal parameter: E[Yd∗ ]− E[Y1]

(can do the same contrast with the other values of the rule)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 104/ 112

Page 110: Precision Medicine: Lecture 02 Causal Inference

Step 4: Specify Observed Data

I Assume that observed data O = (X ,A,Y ) ∼ P weregenerated by sampling 1, 153 i .i .d . times from a datagenerating system described by the SCM

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 105/ 112

Page 111: Precision Medicine: Lecture 02 Causal Inference

Step 5: Identification

I Identification criteriaI Randomization (no unmeasured confounding)

Yd ⊥ A|X ∀ d ∈ D

I Positivity

mina∈A

P(A = a|X = x) > 0 for all w for which P(X = x) > 0

Both hold by RCT design (SMS randomized, P(A|X ) is known)

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 106/ 112

Page 112: Precision Medicine: Lecture 02 Causal Inference

Step 5: Statistical Estimands

1. True ODTRI d∗(X ) ∈ argmaxd∈D E[Q(A = d(X ),X )]I Identify CATE as B(X ) = Q(1,X )− Q(0,X ), the “blip

function”, then

d∗(X ) ≡ I[B(X ) > 0]

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 107/ 112

Page 113: Precision Medicine: Lecture 02 Causal Inference

Step 5: Statistical Estimands

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 108/ 112

Page 114: Precision Medicine: Lecture 02 Causal Inference

Step 5: Statistical Estimands

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 109/ 112

Page 115: Precision Medicine: Lecture 02 Causal Inference

Step 5: Statistical Estimands

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 110/ 112

Page 116: Precision Medicine: Lecture 02 Causal Inference

Step 5: Statistical Estimands

1. True ODTRI d∗(X ) ∈ argmaxd∈D E[Q(A = d(X ),X )]I Identify CATE as B(X ) = Q(1,X )− Q(0,X ), the “blip

function”, then

d∗(X ) ≡ I[B(X ) > 0]

2. True value of true ODTR: E[Q(A = d∗(X ),X )]

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 111/ 112

Page 117: Precision Medicine: Lecture 02 Causal Inference

Step 6 & 7: Estimation and Interpretation

Up next in the class!

Lina Montoya, Michael R. Kosorok, Nikki L. B. Freeman and Owen E. Leete 112/ 112