siscr 2019 6 2-erb - si.biostat.washington.edusi.biostat.washington.edu/sites/default/files... · 0...
TRANSCRIPT
Module 6: Introduction to Survival AnalysisSummer Institute in Statistics for Clinical Research
University of WashingtonJuly, 2019
Elizabeth R. Brown, ScDMember, Fred Hutchinson Cancer Research Center
andResearch Professor
Department of BiostatisticsUniversity of Washington
SESSION 2: ONE-SAMPLE METHODS
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and Cis– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 2
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and Cis– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 3
CLINICAL TRIAL
|
|
|
|
|
|
0 2 4 6 8
calendar time
id
|
|
|
|
|
|
65
43
21
D
D
L
A
D
D
Start End
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 4
CENSORED DATA
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 5
|
|
|
|
|
|
0 2 4 6 8
survival time
id
65
43
21
D
D
L
A
D
D
CENSORED DATA
SISCR 2019: Module 6: Intro Survival Elizabeth Brown
“Censored” observations give some information about their survival time.
id Y �1 5 12 3 13 6.5 04 2 05 4 16 1 1|
|
|
|
|
|
0 2 4 6 8
survival time
id
65
43
21
D
D
L
A
D
D
2 - 6
CENSORED DATA
SISCR 2019: Module 6: Intro Survival Elizabeth Brown
“Censored” observations give some information about their survival time.
id Y �1 5 12 3 13 6.5 04 2 05 4 16 1 1|
|
|
|
|
|
0 2 4 6 8
survival time
id
65
43
21
D
D
L
A
D
D
2 - 7
ESTIMATION
• Can we use the partial information in the censored observations?
• Two off-the-top-of-the-head answers:– Full sample: Yes. Count them as observations
that did not experience the event ever and estimate S(t) as if there were not censored observations.
– Reduced sample: No. Omit them from the sample and estimate S(t) from the reduced data as if they were the full data.
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 8
CENSORED DATA
|
|
|
|
|
|
0 2 4 6 8
survival time id
65
43
21
D
D
L
A
D
D
Problem: How to estimate:
Pr[T > 3.5] Pr[T > 6]
Full Sample: 46 = .67 2
6 = .33
Reduced Sample: 24 = .5 0
4 = 0
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 9
CENSORED DATA
Based on the data and estimates on the previous page,
Q: Are the Full Sample estimates biased? Why or why not?
A:
Q: Are the Reduced Sample estimates biased? Why or why not?
A:
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 10
CENSORED DATA
|
|
|
|
|
|
0 2 4 6 8
survival time
id
65
43
21
D
D
L
A
D
D
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 11
RISK SETS
|
|
|
|
|
|
0 2 4 6 8
survival time
id
6
5
4
3
2
1
D
D
L
A
D
D
R1 R2 R3 R4
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 12
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and Cis– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 13
RISK SETS
|
|
|
|
|
|
0 2 4 6 8
survival time
id
6
5
4
3
2
1
D
D
L
A
D
D
R1{1,2,3,4,5,6}
R2{1,2,3,5}
R3{1,3,5}
R4{1,3}
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 14
CENSORED DATA ASSUMPTION
• Important assumption: subjects who are censored at time t are at the same risk of dying at t as those at risk but not censored at time t.–When would you expect this to be true (or false)
for subjects lost to follow-up?
–When would you expect this to be true (or false) still alive at the time of the analysis?
SISCR 2019: Module 6: Intro Survival Elizabeth Brown
2 - 15
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and Cis– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 16
CENSORED DATA ASSUMPTION
• Important assumption: subjects who are censored at time t are at the same risk of dying at t as those at risk but not censored at time t.
• This means the risk set at time t is an unbiased sample of the population still alive at time t.
• Can use information from the unbiased risk sets to estimate S(t) using the method of Kaplan and Meier (Product-Limit Estimator).
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 17
USING RISK SETS INFO TO ESTIMATE S(t)
• Repeatedly use the fact that for t2 > t1,
Pr[T > t2] = Pr[T > t2 and T > t1] = Pr[T > t2|T > t1]Pr[T > t1]
• An observation censored between t1 and t2 can contribute to the estimation of Pr[T > t2] by its unbiased contribution to estimation of Pr[T > t1].
0 t2t1
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 18
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and Cis– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 19
PRODUCT-LIMIT (KAPLAN-MEIER) ESTIMATE
Notation: Let t(1), t(2) . . . , t(J) be the ordered failure times in thesample in ascending order.
t(1) = smallest Y� for which �� = 1 (t(1) = 1 )
t(2) = 2nd smallest Y� for which �� = 1 (t(2) = 3 )...t(J) = largest Y� for which �� = 1 (t(4) = 5 )
Q: Does J = the number of observed deaths in the sample?
A:
Q:When does J = n?
A: SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 20
t(j)
|
|
|
|
|
|
0 2 4 6 8
survival time
id
6
5
4
3
2
1
D
D
L
A
D
D
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 21
MORE NOTATION
For each t(j):
D(j) = number that die at time t(j)S(j) = number known to have survived beyond t(j)
(by convention: includes those known to have beencensored at t(j))
N(j) = number "at risk" of being observed to die at time t(j)(ie: number still alive and under observation just before t(j))
S(j) = N(j) �D(j)
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 22
FOR EXAMPLE DATA● ●● ●
0 2 4 6 8
time
● ●
t(j) N(j) D(j) S(j) Product-limit (Kaplan-Meier) Estimator:
1 6 1 5
3 4 1 3 S(t) = j:t(j)t(1�D(j)N(j)) = j:t(j)t(
S(j)N(j))
4 3 1 25 2 1 1
for t in S(t)
[0, 1) 1 (empty product)
[1,3 ) 1 ⇥ 56 = .833
[3,4 ) 1 ⇥ 56 ⇥
34 = .625
[4,5 ) 1 ⇥ 56 ⇥
34 ⇥
23 = .417
[5,� ) 1 ⇥ 56 ⇥
34 ⇥
23 ⇥
12 = .208SISCR 2019: Module 6: Intro Survival
Elizabeth Brown 2 - 23
K-M ESTIMATOR
0 1 2 3 4 5 6
0.0
0.2
0.4
0.6
0.8
1.0
Survival Function Estimate
t
S(t)
Note: does not descend to zero here (since last observation is censored).
Q: Since the estimate jumps only at observed death times, how does information from the censored observations contribute to it?
A:
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 24
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and Cis– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 25
MEDIAN SURVIVAL CENSORED DATA
0.0
0.2
0.4
0.6
0.8
1.0
Median Estimate, Censored Data
t
S(t)
1 2 3 median 5 6
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 26
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and CIs– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 27
KM STANDARD ERRORS
Greenwood’s Formula:
•’V�r(S(t)) = S2(t)P
j:t(j)tD(j)
N(j)S(j)
• se(S(t)) =∆’V�r(S(t))
• Pointwise CI: (S(t)� z �2se(S(t)), S(t) + z �
2se(S(t)))
– Can include values < 0 or > 1.
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 28
LOG –LOG KM STANDARD ERRORS
Use complementary log log transformation to keep CI within (0,1):
•’V�r(log(� log(S(t)))) =P
j:t(j)tD(j)
N(j)S(j)
[log(S(t))]2
• se =∆’V�r(log(� log(S(t))))
• CI for log(� log(S(t))) :(log(� log(S(t)))� z �
2se, log(� log(S(t))) + z �
2se)
• CI for S(t) : ([S(t)]ez�/2se , [S(t)]e
�z�/2se)
– CI remains within (0,1).
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 29
GREENWOOD’S FORMULA
0 1 2 3 4 5 6
0.0
0.2
0.4
0.6
0.8
1.0
Survival Function Estimate
t
S(t)
SISCR 2019: Module 6: Intro Survival
Elizabeth Brown2 - 30
COMPLEMENTARY LOG-LOG
0 1 2 3 4 5 6
0.0
0.2
0.4
0.6
0.8
1.0
Survival Function Estimate
t
S(t)
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 31
MEDIAN CONFIDENCE INTERVAL
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 32
Confidence interval for the median is obtained by inverting the signtest of H0 : median = M (Brookmeyer and Crowley, 1982).
• With complete data T1, T2, . . . , Tn, the sign test ofH0 : median = M is performed by seeing if the observedproportion, P[Y > M] is too big or too small (BinomialDistribution or Normal Approximation).
• With censored data (Y1, �1), (Y2, �2), . . . , (Yn, �n) givingincomplete data about T1, T2, . . . , Tn, we cannot always tellwhether T� > M:
When Y� M, �� = 1 observed death before M we know T� MWhen Y� > M observed death after M we know T� > MWhen Y� M, �� = 0 censored before M we don’t know if
T� M or T� > M
MEDIAN CONFIDENCE INTERVAL
Solution: Following Efron (self-consistency of KM), we estimatePr[T > M] when Y� M, �� = 0 using S(M)
S(Y�).
• For complete data, we let U� =⇢1 T� > M0 T� M
and our test is based onPn
�=1U�.
• For censored data, we let U� =
8<:
1 Y� > MS(M)S(Y�)
Y� M; �� = 00 Y� M; �� = 1
and our test is based onPn
�=1U�.
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 33
MEDIAN CONFIDENCE INTERVAL
• It turns out, this is the same as basing our test ofH0 :median = M on a test of H0 : S(M) = 1
2 .
• So a 95% CI for the median contains all potential M for whichthe test of H0 : S(M) = 1
2 cannot reject at � = .05 (2 sided).
• Since S(M) only changes value at observed event times, thetest need only be checked at M = t(1), t(2), . . . , t(J).
• Originally proposed for Greenwood’s formula CIs for S(M), butany good CIs are OK.
• Implemented in many software packages.
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 34
MEDIAN CONFIDENCE INTERVAL
Median Confidence Interval, Censored Data
t
S(t)
1 2 3 median 5 6
00.5
1
| ●
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 35
OUTLINE
• Session 2: – Censored data – Risk sets– Censoring assumptions– Kaplan-Meier Estimator–Median estimator– Standard errors and Cis– Example
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 36
COLON CANCER EXAMPLE
• Clinical trial at Mayo Clinic (Moertel et al. (1990) NEJM)
• Stage B2 and C colon cancer patients; adjuvant therapy
• Three arms– Observation only– Levamisole– 5-FU + Levamisole
• Stage C patients only• Two treatment arms only
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 37
COLON CANCER EXAMPLE
0 500 1000 1500 2000 2500 3000
0.0
0.2
0.4
0.6
0.8
1.0
Days from Diagnosis
Surv
ival P
roba
bilit
y
LevLev+5FU
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 38
COLON CANCER EXAMPLE
0 500 1000 1500 2000 2500 3000
0.0
0.2
0.4
0.6
0.8
1.0
Days from Diagnosis
Surv
ival P
roba
bilit
y
LevLev+5FU
Greenwood's Formula
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 39
COLON CANCER EXAMPLE
0 500 1000 1500 2000 2500 3000
0.0
0.2
0.4
0.6
0.8
1.0
Days from Diagnosis
Surv
ival P
roba
bilit
y
LevLev+5FU
Complementary log−log Transformation
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 40
PRESENTATION
N Events Median(days)
95% CI
LevamisoleOnly
310 161 2152 (1509, ∞)
5FU + Levamisole
304 123 -- (2725, ∞)
SISCR 2019: Module 6: Intro Survival Elizabeth Brown
2 - 41
COLON CANCER EXAMPLE
0 500 1000 1500 2000 2500 3000
0.0
0.2
0.4
0.6
0.8
1.0
Days from Diagnosis
Surv
ival P
roba
bilit
y
LevLev+5FU
Complementary log−log Transformation
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 42
ESTIMATION
• Estimate S(t) using KM curve (nonparametric).– Pointwise standard errors and CIs– Almost always presented– Not appropriate when the event of interest
happens only to some (more on this Friday)• Median: based on KM curve: often presented (too
often?)
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 43
TO WATCH OUT FOR
• Mean survival time hard to estimate without parametric assumptions– Censoring means incomplete information about
largest times– Mean over restricted time interval may be useful in
some settings (some on this tomorrow)• Median estimate more complicated than median of
times• Even with CIs, evaluating differences between curves
visually is subjective• Interpretation of survival function estimates depends on
validity of censoring assumptions
SISCR 2019: Module 6: Intro Survival Elizabeth Brown 2 - 44