partially identifled prevalence estimation under ... · the signal-tandmobiel°r study † 6 years...

23
Partially Identified Prevalence Estimation under Misclassification using the Kappa-Coefficient Anne Kunz Munich Cancer Registry IBE, Biometry and Bioinformatics, University of Munich (LMU) [email protected] Thomas Augustin Department of Statistics, University of Munich (LMU) [email protected] Helmut K¨ uchenhoff Department of Statistics, University of Munich (LMU) kuechenhoff@stat.uni-muenchen.de Anne Kunz, Thomas Augustin, Helmut K¨ uchenhoff, LMU M¨ unchen

Upload: others

Post on 15-Aug-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

Partially Identified Prevalence Estimation under Misclassificationusing the Kappa-Coefficient

Anne Kunz

Munich Cancer Registry

IBE, Biometry and Bioinformatics, University of Munich (LMU)

[email protected]

Thomas Augustin

Department of Statistics, University of Munich (LMU)

[email protected]

Helmut Kuchenhoff

Department of Statistics, University of Munich (LMU)

[email protected]

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen

Page 2: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

1. Misclassification

• Interest in

Yi ={

1 diseased0 not diseased

p := P(Y = 1) prevalence

• Often only available

Y ∗i =

{1 test positive0 test negative

p∗ := P(Y ∗ = 1) naive prevalence

• Y ∗i ←→ Yi misclassification

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 1

Page 3: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 2

Page 4: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

The Signal-Tandmobiel R© Study

• 6 years longitudinal oral health study (1996 - 2001)

• 4468 children in Flanders (Belgium)

• Annual examinations

• Additional calibration studies 1996, 1998, 2000

• Presence/absence of caries

• Y ∗i =

{1 caries observed0 no caries observed

←→ Yi ={

1 caries0 no caries

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 3

Page 5: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

2. Misclassification Bias

Y = 1 Y = 0

Y ∗ = 1 P(Y ∗ = 1|Y = 1)

sensitivity

P(Y ∗ = 1|Y = 0)

false positive

Y ∗ = 0 P(Y ∗ = 0|Y = 1)

false negative

P(Y ∗ = 0|Y = 0)

specificity

Assume throughout reasonable quality of the test: sens + spec > 1. (1)

Prevalence estimation based on misclassified data may be severely biased.Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 4

Page 6: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

p∗ = p · sens + (1− p) · (1− spec) = (2)

= p · sens + 1− spec− p =

= p (sens + spec− 1) + (1− spec)

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 5

Page 7: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

p* = p sens + (1−p)(1−spec)

p

p*

sens

= 1

, sp

ec =

1

sens =

1 , s

pec = 0

.8sens = 1 , spec = 0.6

sens = 1 , spec = 0.4

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

p* = p sens + (1−p)(1−spec)

pp*

sens

= 1

, sp

ec =

1

sens = 0.8 , spec = 0.8

sens = 0.6 , spec = 0.6

sens = 0.8 , spec = 0.4

Figure 1: Illustration of misclassification bias (deviation from the anglebisector).

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 6

Page 8: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

3. Correcting for Misclassification

3.1 The Extreme Cases

• If sens and spec are precisely known, (2) yields (together with (1)) anunbiased corrected prevalence estimator

p =p∗ + spec− 1

sens + spec− 1. (3)

• If there is complete ignorance on sens and spec then the correctedprevalence estimator is vacuous:

p = [0; 1].

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 7

Page 9: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

3.2 Use Additional Knowledge: kappa Coefficient

• Quite often two replicated measurements (→ two raters)

• In particular in medicine, common characterization of the quality ofmeasurements by kappa, a coefficient of inter-rater agreement

Y ∗(2) Y ∗(1) 1 01 p11 p10

0 p01 p00 κ =po − pe

1− pe(4)

with pjk = P (Y ∗(1) = j, Y ∗(2) = k), po = p00 + p11

pe = (p00 + p01) · (p00 + p10) + (p10 + p11) · (p01 + p11)

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 8

Page 10: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

Classification of κ according to Landis & KochLandis and Koch (1977) proposed a widely used classification of the kappa-statistic (see Table 1) to ”maintain consistent nomenclature when describ-ing the relative strength of agreement associated with kappa statistics[...]. Although these divisions are clearly arbitrary, they do provide useful’benchmarks’ for the discussion”.

Table 1: Classification of κ according to Landis & Koch (and alternativecommon terminology)

kappa-statistic strength of agreement

≤ 0.00 poor0.01 - 0.20 slight (insufficient)0.21 - 0.40 fair (satisfactory)0.41 - 0.60 moderate (sufficient)0.61 - 0.80 substantial (good)0.81 - 1.00 almost perfect (excellent)

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 9

Page 11: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

Under the assumptions

(A1) Independent conditional distributions Y ∗1 |Y and Y ∗

2 |Y for both replicates

(A2) Equal sensitivity and specificity for both replicates

the following equation can be deduced:

κ =p (1− p) (sens + spec− 1)2

(spec− p (sens + spec− 1)) · (1− spec + p (sens + spec− 1))(5)

Together with (see (2) above)

p∗ = p · sens + (1− p) · (1− spec)

this does not yield a unique solution.

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 10

Page 12: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

3.3 Point Identification through Additional ConstraintsTraditional way to proceed: add additional assumptions leading to pointidentification: sens = spec, or more general by sens

spec =: γ known.

Then the system described by (4) and (5) indeed has a unique solution,leading to

p (γ, p∗, κ) =(1− p∗) · γ − p∗ −√w

(p∗ − 1) · γ2 + (1−√w) · γ − p∗ −√w(6)

with w =√

(p∗ − 1)2 · γ2 − 2 · p∗ · (p∗ − 1) · (2 · κ− 1) · γ + (p∗)2

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 11

Page 13: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

3.4 Partial Identification – Basic Ideas

• Point identification only possible under non-testable assumptions

• Manski’s (2003) law of decreasing credibility: The credibility of inferencedecreases with the strength of the assumptions maintained.

• Look at empirical evidence alone, consider all models compatible withthe data.

• Parallel development in econometrics partial identification (initiated byManski) and in biometrics systematic sensitivity analysis (e.g. Vanstee-landt et al. (2006, Stat. Sinica))

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 12

Page 14: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

3.5 Identification Regions for Prevalence EstimationsLet κ > 0 and (A1), (A2) and (1) be satisfied. Then the identifi-cation regions I(z||p∗, κ) = {z satisfying (2) and (5) for given p∗, κ}, forz ∈ {p, sens, spec} are

I(p||p∗, κ) :=[

p∗

p∗ + κ−1(1− p∗);

p∗

p∗ + κ (1− p∗)

](7)

I(sens||p∗, κ) := [p∗ + κ (1− p∗) ; 1] (8)

I(spec||p∗, κ) := [1− p∗ + p∗κ ; 1] (9)

κ → 0 : Ip(p∗, κ) = [0, 1]

κ → 1 : Ip(p∗, κ) = p∗, sens = spec = 1

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 13

Page 15: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

(a) p∗ = 0.1 (b) p∗ = 0.3 (c) p∗ = 0.5

Figure 2: Identification regions for prevalence p

p∗pmax

pmin

@R

¡ª

XXy

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 14

Page 16: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

4. Confidence Intervals forPartially Identified Prevalence Estimation

• Identification regions reflect lack of knowledge (”ignorance“), but do not

take into account sample variability (”uncertainty“).

• Construction of confidence regions (Imbens & Manski (2004, Economet-rica), Stoye (2009, Econometrica); similarly: Vansteelandt, Goetghebeur,Kenword, Molenberghs (2006, Stat. Sinica) used here)

• Identification parameter γ := sensspec ; given γ all parameters would be

identified (cp. section 3.3) 1

1Vansteelandt et. al. (2006) use the term sensitivity parameter, which, however, is not used here toavoid terminological conflict with the sensitivity sens.

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 15

Page 17: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

• γ can be shown to vary between γmin and γmax, where

[γmin, γmax] =[κ + p∗ − κ · p∗; 1

κ · p∗ − p∗ + 1

]. (10)

• Construct confidence interval[L(p∗, κ); U(p∗, κ)

]such that

infγ∈ [γmin, γmax]

Prγ(p ∈ [L(p∗, κ);U(p∗, κ)

]) ≥ 1− α (11)

• Use suitable confidence intervals[L(p∗, κ, γ);U(p∗, κ, γ)

]given γ and

take

[L(p∗, κ);U(p∗, κ)

]:=

γ∈ [γmin, γmax]

[L(p∗, κ, γ);U(p∗, κ, γ)

](12)

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 16

Page 18: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

Consider for fixed γ the point estimator for p derived from (6)

p (γ, p∗, κ) =(1− p∗) · γ − p∗ −

√w

(p∗ − 1) · γ2 + (1−√

w) · γ − p∗ −√

w(13)

with w =√

(p∗ − 1)2 · γ2 − 2 · p∗ · (p∗ − 1) · (2 · κ− 1) · γ + (p∗)2

The asymptotic variance is given by the delta method

V ar(p (γ, p∗, κ)) = DTp Σ Dp (14)

with, Dp as the vector of derivatives of p (γ, p∗, κ) with respect to p∗

and κ, and Σ the covariance matrix of p∗ and κ.

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 17

Page 19: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

Since the relationship (13) between γ and p is monotone, the choice ofthe confidence intervals in (12) can be improved. If all the confidenceintervals given γ are small compared to the uncertainty region, anasymptotic confidence interval is given by

[p(γmax)− z1−α ·

√V ar(p(γmax)); p(γmin) + z1−α ·

√V ar(p(γmin))

]

(15)

sens + spec > 1

Since γmin, γmax estimate γmin and γmax consistently, this is an asymp-totic level (1− α)−confidence interval.

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 18

Page 20: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

5. Results from the Signal-Tandmobiel R© Study

Table 2: Signal-Tandmobielr study: Estimation of p∗ per year

year n p∗ se(p∗)1996 (age 6) 3378 0.118 0.0061998 (age 8) 3657 0.280 0.0072000 (age 10) 3415 0.380 0.008

• For illustration interpret validation measurement as a replicate,

• additionally satisfying (A1) and (A2)

• Further assumption: treat validation sample as a random sample

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 19

Page 21: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

Table 3: Signal-Tandmobielr study: Estimation of κ per year

year n κ se(κ)1996 120 0.575 0.0841998 157 0.602 0.0662000 148 0.746 0.057

Table 4: Signal-Tandmobielr study: Estimated identification regions for p,sens and spec

p ˆsens ˆspecyear p∗ κ min max min max min max

1996 0.118 0.575 0.072 0.189 0.625 1.000 0.950 1.0001998 0.280 0.602 0.190 0.393 0.714 1.000 0.889 1.0002000 0.380 0.746 0.314 0.451 0.843 1.000 0.903 1.000

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 20

Page 22: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

1996 1997 1998 1999 2000

0.0

0.1

0.2

0.3

0.4

0.5

year

1996 1997 1998 1999 2000

0.0

0.1

0.2

0.3

0.4

0.5

year

1996 1997 1998 1999 2000

0.0

0.1

0.2

0.3

0.4

0.5

year

1996 1997 1998 1999 2000

0.0

0.1

0.2

0.3

0.4

0.5

year

naive estimation, asympt. CICI with matrix methodidentification regionconfidence interval

Figure 3: Signal-Tandmobielr: confidence limits for pAnne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 21

Page 23: Partially Identifled Prevalence Estimation under ... · The Signal-Tandmobiel°R Study † 6 years longitudinal oral health study (1996 - 2001) † 4468 children in Flanders (Belgium)

6. Conclusion

• Prevalence estimation based on kappa is an instance where the idea ofpartial identification (econometrics) and systematic sensitivity analysis(biometrics) allows for reliable but still informative conclusions.

• Introducing successively additional assumptions would lead to smaller,but less credible regions.

• Optimize confidence interval (finite sample correction)!

Anne Kunz, Thomas Augustin, Helmut Kuchenhoff, LMU Munchen 22