1 chapter 10 screening for disease. 2 terminology reliabilityagreementreliability ≡ agreement of...

1

Chapter 10

Screening for Disease

2

Terminology• ReliabilityReliability ≡ agreementagreement of

ratings/diagnoses, “reproducibility”– Inter-rater reliabilityInter-rater reliability ≡

agreement between two independent raters

– Intra-rater reliabilityIntra-rater reliability ≡ agreement of the same rater with him/herself

• ValidityValidity ≡ ability to discriminate without error

• AccuracyAccuracy ≡ a combination of reliability and validity

3

Inter-Rater Reliability• Two independent

raters• Cross-tabulate • Observed Observed

proportion in proportion in agreement agreement NOTNOT adequate because a certain amount of agreement is due to chance

Rater B

Rater A + − Total

+ a b g1

− c d g2

Total f1 f2 N

4

Kappa (κ)

1 1 2 2e 2= [Expected agreement due to chance]f g f g

pN

Rater B

Rater A + − Total

+ a b g1

− c d g2

Total f1 f2 N

o = [Observed agreement; chance corrected]a d

p notN

o e

e

1

p p

p

[Agreement corrected for chance]

5

κ Benchmarks

6

Example 1: Flip two coins

o e

e

.5 .5 0.0 [no extra agreement above chance]

1 1 .5

p p

p

Toss B

Toss A Heads Tails Total

Heads 25 25 50

Tails 25 25 50

Total 50 50 100

To what extent are results reproducible?

1 1 2 2e 2 2

50 50 50 50.5

100

f g f gp

N

25 25= = .5 [Overall agreement is 50%]

100o

a dp

N

7

Example 2

obs exp

exp

.91 .6276 .758

1 1 .6276

p p

p

Rater B

Rater A + − Total

+ 20 4 24

− 5 71 76

Total 25 75 100

To what extent are these diagnoses reproducible?

“substantial” agreement

1 1 2 2exp 2 2

25 24 75 76.6276

100

f g f gp

N

20 71= = .9100

100obs

a dp

N

8

§10.3 Validity• Compare screening test results to a gold

standard (“definitive diagnosis”)• Each patient is classified as either true positive

(TP), true negative (TN), false positive (FP), or false negative (FN)

Test D+ D− Total

T+ TP FP TP+FP

T− FN TN FN+TN

Total TP+FN FP+TN N

9

SensitivityTest D+ D− Total

T+ TP FP TP+FP

T− FN TN FN+TN

Total TP+FN FP+TN N

SEN ≡ proportion of cases that test positive

FNTP

TP

w/diseasethose

TPSEN

10

Specificity

SPEC ≡ proportion of noncases that test negative

Test D+ D− Total

T+ TP FP TP+FP

T− FN TN FN+TN

Total TP+FN FP+TN N

FPTN

TN

disease w/out those

TNSPEC

11

Predictive Value PositiveTest D+ D− Total

T+ TP FP TP+FP

T− FN TN FN+TN

Total TP+FN FP+TN N

PVP ≡ proportion of positive tests that are true cases

FPTP

TP

positive test whothose

TPPVP

12

Predictive Value NegativeTest D+ D− Total

T+ TP FP TP+FP

T− FN TN FN+TN

Total TP+FN FP+TN N

PVN ≡ proportion of negative tests that are true non-cases

FNTN

TN

negative test whothose

TNPVN

13

Prevalence

• [True] prevalence = (TP + FN) / N

• Apparent prevalence = (TP + FP) / N

Test D+ D− Total

T+ TP FP TP+FP

T− FN TN FN+TN

Total TP+FN FP+TN N

15

ExampleLow Prevalence Population

D+ D− Total

T+

T−

Total 1000 1,000,000

Conditions: N = 1,000,000; Prevalence = .001

Prevalence = (those with disease) / N

Therefore:

(Those with disease) = Prevalence × N

= .001× 1,000,000 = 1000

16

Example: Low Prevalence Population

D+ D− Total

T+

T−

Total 1000 999,000 1,000,000

Number of non-cases, i.e., TN + FP

1,000,000 – 1,000 = 999,000

17


D+ D− Total

T+ 990

T−

Total 1000

TP = SEN × (those with disease) = 0.99 × 1000 = 990

Assume test SENsitivity = .99, i.e., Test will pick up 99% of those with disease

18


D+ D− Total

T+ 990

T− 10Total 1000

FN = 1000 – 990 = 10

It follows that:

19


D+ D− Total

T+

T− 989,010Total 999,000

TN = SPEC × (those without disease) = 0.99 × 999,000 = 989,010

Suppose test SPECificity = .99i.e., it will correctly identify 99% of the noncases

20


D+ D− Total

T+ 9,990T− 989,010

Total 999,000

FPs = 999,000 – 989,010 = 9,900

It follows that:

21


D+ D− Total

T+ 990 9,990 10,980

T− 10 989,010 989,020

Total 1000 999,000 1,000,000

PVPT = TP / (TP + FP) = 990 / 10,980 = 0.090

Strikingly low PVP!

It follows that the Predictive Value Positive is :

22


D+ D− Total

T+ 990 9,990 10,980

T− 10 989,010 989,020

Total 1000 999,000 1,000,000

PVNT= TN / (all those who test negative) = 989010 / 989020 = .9999

It follows that the Predictive Value Negative is:

23

Example: High prevalence population

D+ D− Total

T+ 99,000 9,000 108,000

T− 1,000 891,000 892,000

Total 100,000 900,000 1,000,000

SEN = 99000 / 100,000 = 0.99

SPEC = 891,000 / 900,000 = 0.99

Prev = 100000 / 1,000,000 = 0.10

Same test parameters but used in population with true prevalence of .10

24

Example: High prevalence population

D+ D− Total

T+ 99,000 9,000 108,000

T− 1,000 891,000 892,000

Total 100,000 900,000 1,000,000

PVP = 99,000 / 108,000 = 0.92

PVN = 891,000 / 892,000 = 0.9989

Prevalence = 100000 / 1,000,000 = 0.10

An HIV screening test is used in one million people. Prevalence in population is now 10%. SEN and SPEC are again 99%.

25

PVPT and Prevalence• As PREValence

goes down, PVPT is affected

• Figure shows relation between PVP, PREV, & SPEC (test SEN = constant .99)

26

Screening Strategy• First stage

high SENS (don’t want to miss cases)

• Second stage high SPEC (sort out false positives from true positives)

27

Selecting a Cutoff Point• There is often an overlap in test results

for diseased and non-diseased population

• Sensitivity and specificity are influenced by the chosen cutoff point used to determine positive results

• Example: Immunofluorescence test for HIV based on optical density ratio (next slide)

28

Low Cutoff High sensitivity and low specificity

29

High Cutoff Low sensitivity and high specificity

30

Intermediate Cutoff moderate sensitivity & moderate specificity

1 chapter 10 screening for disease. 2 terminology reliabilityagreementreliability ≡ agreement of...

Documents