a unified approach for assessing agreement lawrence lin, baxter healthcare a. s. hedayat, university...

A Unified Approach for Assessing Agreement

Lawrence Lin, Baxter Healthcare A. S. Hedayat, University of Illinois at

Chicago Wenting Wu, Mayo Clinic

Outline

IntroductionExisting approachesA unified approachSimulation studiesExamples

Introduction Different situations for agreement

Two raters, each with single readingMore than two raters, each with single readingMore than two raters, each with multiple readings• Agreement within a rater• Agreement among raters based on means• Agreement among raters based on individual

readings

Existing Approaches (1)

Agreement between two raters, each with single reading

Categorical data: • Kappa and weighted kappa

Continuous data: • Concordance Correlation Coefficient (CCC)• Intraclass Correlation Coefficient (ICC)

Agreement among more than two raters, each with single reading

Lin (1989): no inferenceBarnhart, Haber and Song (2001, 2002): GEEKing and Chinchilli (2001, 2001): U-statisticsCarrasco and Jover (2003): variance components

Agreement among more than two raters, each with multiple readings

Barnhart (2005)• Intra-rater/ inter-rater (based on

means) /total (based on individual observations) agreement

• GEE method to model the first and second moments

Unified Approach

Agreement among k (k≥2) raters, with each rater measures each of the n subjects multiple (m) times.Separate intra-rater agreement and inter-rater agreementMeasure relative agreement, precision, accuracy, and absolute agreement, Total Deviation Index (TDI) and Coverage Probability (CP)

Unified Approach - summary

Using GEE method to estimate all agreement indices and their inferencesAll agreement indices are expressed as functions of variance componentsData: continuous/binary/ordinaryMost current popular methods become special cases of this approach

Unified Approach - model

Set up

subject effect subject by rater effect error effect

rater effect

ijlijjiijl ey ),0(: 2

),0(: 2eijle

),0(: 2ij

ni ,...,2,1

kj ,...,2,1

ml ,...2,1

)1()(1

mj ,...,2,1

Unified Approach - targets

Intra-rater agreement: overall, are k raters consistent with themselves?

Inter-rater agreement: Inter-rater agreement (agreement based on mean): overall, are k raters agree with each other based on the average of m readings?Total agreement (agreement based on individual reading): overall, are k raters agree with each other based on individual of the m readings?

Unified Approach – agreement(intra)

: for over all k raters, how well is each rater in reproducing his readings?

]...,|)1()([

])1()([1

,,2,11

tindependenyyymyyE

ijmijij

lijijl

Intrac

Intrac,

Unified Approach – precision(intra) and MSD

: for any rater j, the proportion of the variance that is attributable to the subjects (same as )Examine the absolute agreement independent of the total data range:

Intrac,

2 2)( eijlijl yyEMSD

Unified Approach – TDI(intra) : for each rater j, % of observations are within unit of their replicated readings from the same rater.

is the cumulative normal distribution is the absolute value

21 2)2

)( Intra )100*()( Intra

Unified Approach – CP(intra)

: for each rater j, of observations are within unit of their replicated readings from the same rater

)( Intra %100*)( Intra

)]2/(1[21 2)( eIntra

Unified Approach – agreement(inter)

: for over all k raters, how well are raters in reproducing each others based on the average of the multiple readings?

],,...,,|)1()([

)]1()([

..2.11

tindependenyyykyyE

Interc

Interc,

Unified Approach – precision(inter) : for any two raters, the proportion of the variance that is attributable to the subjects based on the average of the m readings

)var()var(

),cov(

ijijInter

me /222

Unified Approach – accuracy(inter)

: how close are the means of different raters:

eIntera

InteraInterInterc ,,

Intera,

Unified Approach – TDI(inter)

: for overall k raters, % of the average readings are within unit of the replicated averaged readings from the other rater.

)100*(

me /222)2

11( 2221

)( Inter

Unified Approach – CP(inter)

: for each rater j, of averaged readings are within unit of replicated averaged readings from the other rater

)( Inter %100*)( Inter

)]/222/(1[21 222)( meIntra

Unified Approach – agreement(total)

: for over all k raters, how well are raters in reproducing each others based on the individual readings?

],,...,,|)1()([

])1()([

tindependenyyykyyE

ikllili

jliijl

totalc

Totalc,

Unified Approach – precision(total)

: for any two raters, the proportion of the variance that is attributable to the subjects based on the individual readings

)var()var(

),cov(

lijijl

lijijltotal

Unified Approach – accuracy(total)

: how close are the means of different raters (accuracy)

etotala

totala,

totalatotalTotalc ,,

Unified Approach – TDI(total)

: for overall k raters, % of the readings are within unit of the replicated readings from the other rater.

)(Total)100*(

2221 222)2

)(Total

Unified Approach – CP(total)

: for each rater j, of readings are within unit of replicated readings from the other rater

)(Total %100*)(Total

)]222/(1[21 222)( eTotal

Unified Approach

is the inverse cumulative normal distribution is a central Chi-squre distribution with df=1

11(1 Q

Statistics INTRA INTER TOTAL M=1

Agreement

Precision

Accuracy NA

me /2222

me /222

22 e me /222 222 222 222 e 22 22 e

IntraMSDQ InterMSDQ TotalMSDQ MSDQ

IntraMSD

InterMSD

TotalMSD

Estimation and Inference

Estimate all means, variance components,

and their variances and covariances by GEE methodEstimate all indices using above estimatesEstimate variances of all indices using above estimates and delta method

222221 ,,,,,,,, ek

Estimation and Inference (2)

: the covariance of two replications,

and ,with coming from rater and

coming from rater

''lljjjl

1 1' 1 1'''

'l l'l

: the variance from each combination

of (i, j), i.e., each cell. Thus is the average of all cells’ variances.

: the variance of replication of rater : the covariance of two replications, and

, both of them coming from rater .

mBA e /222

Using GEE method to estimate all indices through estimating the means and all variance components: 2222

21 ,,,,,...,, ek

iiiii YYHF

1' 0)(

Estimation and Inference (7)myyyA ijmijijj /)..( 21

)1(/)(1

kkyyBk

jjijij

)1(/]))([(2 21

1 1' 1 1'''.''.

kkmyyyyCk

lljlijjlijl

lijijl kmyyD

2. /])1/()([

kmyyyykmyyEk

lljlijljlijl

ljlijl

1 1''.'.

2. /]))([(2/)(

is the working variance-covariance structure of , “working” means assume following normal distribution is the derivative matrix of expectation of with respective to all the parameters

iYYiYY

),,,,,...,(/ 22221 ekiiF

Estimation and Inference (9)GEE method provides:

estimates of all meansestimates of all variance componentsestimates of variances for all variance componentsEstimates of covariances between any two variance components

Delta method is used to estimate the variances for all indices

22222,

)var()],cov(2)var()[var()1()var(

eIntracIntracIntrac

2222,,

)],cov(),[cov()1(2

eeIntracIntrac

22222,

)var(/)var()[var()()var()1()var(

eIntercIntercInterc

222222

]/),cov(2/),cov(2),cov(2

222222,,

]/),cov(),cov(),[cov()1(2

eIntercInterc

)var()var()[var()()var()1()var(

eTotalcTotalcTotalc

222222

)],cov(2),cov(2),cov(2

222222,,

)],cov(),cov(),[cov()1(2

eTotalcTotalc

22222222

]/),cov(2/)var()[var()()var()1()var(

eeInterInterInter

]/),cov(),[cov()1(2

eInterInter

2222222

)],cov(2)var()[var()()var()1()var(

eeTotalTotalTotal

)],cov(),[cov()1(2

eTotalTotal

2222222,

/),cov(2/)var()var()[var()1()var(

eeInteraIntera

222222,,

)],cov(/),cov(),[cov()1(2

eInteraIntera

)var(]/),cov(2),cov(2

Interae

222222,

),cov(2)var()var()[var()1()var(

eeTotalaTotala

)var()],cov(2),cov(2

Totalae

222222,,

)],cov(),cov(),[cov()1(2

eTotalaTotala

)var(4)var( 22eIntra

]),cov(),cov(

),cov()var(

)var()[var(4)var(2222

mmmeee

)],cov(),cov(),cov()var()var()[var(4)var( 2222222222eeeTotal

Estimation and Inference (18)Transformations for variances

Z-transformation: CCC-indices and precision indices

Logit-transformation: accuracy and CP indices

Log-transformation: TDI indices

)ln( 2e

Simulation Studythree types of data: binary/ordinary/normalthree cases for each type of data

k=2, m=1 / k=4, m=1 / k=2, m=3

for each case: 1000 random samples with sample size n=20for binary and ordinary data: inferences obtained through transformation vs. no-transformationFor normal data: transformation

Simulation Study (2)Conclusions:

Algorithm works well for three types of data, both in estimates and in inferencesFor binary and ordinary data: no need for transformationFor normal data, Carrasco’s method is superior than us, but for categorical data, our is superior. For ordinal data, both Carrasco’s method and ours are similar.

Example One

Sigma method vs. HemoCue method in measuring the DCHLb level in patients’ serum299 samples: each sample collected twice by each method Range: 50-2000 mg/dL

Example One – HemoCue method

HemoCue method first readings vs. second readings

Example One – Sigma method

Sigma method first readings vs. second readings

Example One – HemoCue vs. Sigma

HemoCue’s averages vs. Sigma’s averages

Example One – analysis result (1)

Statistics Estimates 95% CI* Allowance

ccc_inter 0.9866 0.9818 0.9775

ccc_total 0.9859 0.9809

precision_intra 0.9986 0.9982 0.9943

precision_inter 0.9866 0.9818

precision_total 0.9860 0.9809

accuracy_inter 0.9999 0.9974

accuracy_total 0.9999 0.9974

Example One – analysis result (2)

*: for all CCC, precision, accuracy and CP indices, the 95% lower limits are

reported. For all TDI indices, the 95% upper limit are reported.

Statistics Estimates 95% CI* Allowance

TDIintra(0.9) 41.0903 47.2713 75

TDIinter(0.9) 127.273 149.799 150

TDItotal(0.9) 130.548 152.678

CPintra(75) 0.9973 0.9942 0.9

CPinter(150) 0.9475 0.9170 0.9

CPintra(150) 0.9412 0.9102

Example Two

Hemagglutinin Inhibition (HAI) assay for antibody to Influenza A (H3N2) in rabbit serum samples from two labs64 rabbit serum samples: measured twice by each labAntibody level: negative/positive/highly positive

Example Two – Lab one

Second Reading

First Reading

Negative

Positive Highly positive

Negative

Positive 0 49 0

Highly positive

Example Two – Lab two

Second Reading

First Reading

Negative Positive Highly positive

Negative 2 0 0

Positive 0 22 2

Highly positive

0 5 33

Example Two: Lab one vs. lab two

Lab Two First Reading

Lab OneFirst Reading

Negative Positive Highly positiv

Negative 2 5 0

Positive 0 19 30

Highly positive

Example Two: lab one vs. lab two

Lab Two Second Reading

Lab OneSecond Reading

Negative Positive Highly positive

Negative 2 4 0

Positive 0 23 27

Highly positive

Example TwoStatistics Estimates 95% CI* Allowance

ccc_inter 0.37225 0.22039 0.4375

ccc_total 0.35776 0.20970

precision_intra

0.88361 0.79692 0.75

precision_inter

0.56795 0.4359

precision_total

0.53489 0.39999

accuracy_inter

0.65543 0.51586

accuracy_total

0.66885 0.53561

Conclusions (1)

When data are continuous and m goes to ∞:

agreement indices are the same as that proposed by Barnhart (2005), both in estimates and inferencesimprovements• Precision indices, accuracy indices TDIs

and CP• Variance components

Conclusions (2)When m=1:

agreement index degenerates into OCCC as proposed by King (2002), Carrasco (2003) for continuous data Improvements:

• For categorical data:– King’s method: approximates to kappa and weighted

kappa, our estimates (without transformation) are exactly the same as kappa and weighted kappa, both in estimate and in inference.

– Our estimates superior to Carrasco’s estimates when precision and accuracy are high

• Covariates adjustment become available

Conclusions (3)

When data are continuous, k=2 and m=1:

agreement index degenerates to the original CCC by Lin (1989)

When data are binary, k=2 and m=1:

agreement index degenerates into kappa, both in estimate and inference

Conclusions (4)When data are ordinary, k=2 and m=1:

agreement index degenerates into weighted kappa with below weight set, both in estimate and in inference.

jiwij ,...,2,1,,

Conclusions (5)

Unified approach Relative agreement indices: CCC with precision and accuracy – data rangeAbsolute agreement: Total deviation indices and Coverage Probability – normal assumptionLink function need more workRequire balanced data

ReferencesBarkto, John J (1966): The intraclass correlation coefficient as a measure of reliability. Pshchological Reports 19, 3-11.Barnhart, H. X. and Williamson, J. M. (2001). Modeling concordance correlation via GEE to evaluate reproducibility. Biometrics 57, 931-940.Barnhart, H. X. Song, Jingli and Haber, Michael J. (2005): Assessing intra, inter and total agreement with replicated readings. Statistics in Medicine 19: 255-270.Carrasco, J. L. and Jover, L. (2003). Estimating the generalized concordance correlation coefficient through variance components. Biometrics 59, 849-858.Fleiss, J., Cohen, J. and Everitt, B (1969). Large sample standard errors of kappa and weighted kappa. Psychological Bulletin 72, 323-327.King, Tonya S. and Chinchilli, Vernon M. (2001): A generalized concordance correlation coefficient for continuous and categorical data. Statistics in Medicine 20: 2131-2147.Lin, L. I. (1989). A concordance correlation coefficient to evaluate reproducibility. Biometrics 45, 255-268.Lin, L. I., Hedayat, A. S., Sinha, B., and Yang, M. (2002). Statistical methods in assessing agreement: models, issues & tools. Journal of American Statistical Association 97(457), 257-270.Wu, Wenting. A unified approach for assessing agreement. Ph.D. thesis, UIC, 2006

a unified approach for assessing agreement lawrence lin, baxter healthcare a. s. hedayat, university...

approach slide

rater slide

total agreement agreement

multiple readings agreement

inter rater agreement

unified approach model

unified approach cpintra

unified approach tditotal

Documents

sadeq hedayat, blind owl

liberty mutual insurance v. jaren baxter, joanne baxter

expense claims for: mr. adam hedayat, board of directors

toop morvari-hedayat

richard baxter

hedayat - tarane haaye khayyam

larry baxter

discretized streams: fault-tolerant streaming computation at...

sharhi hedayat

baxter ranch

droom de kunsten · hedayat video en projectie maryam...

tekst: ruud wenting fotografie: bertel kolthof m.m.v....

sadeq hedayat traducteur de la métamorphose de kafka par...

kamyar m. hedayat jean-claude lapraz la théorie

hedayat, sadeq - the mongol's shadow

introducing gasless spray tip - baxter€¦ · baxter...

40402104 sadeq hedayat life and legend of iranian writer...

baxter healthcare of pr – aibonito - newmoa · baxter...

shama e hedayat aur tablighi jamat

sadegh hedayat - short storiesl