proportions and confidence intervals in biostatistics

35
Biostatis tics Lecture 9

Upload: helpwithassignment

Post on 13-Jan-2015

144 views

Category:

Education


0 download

DESCRIPTION

Understanding Proportions and Confidence Intervals in Biostatistics. Get more details at http://www.helpwithassignment.com/statistics-assignment-help

TRANSCRIPT

Page 1: Proportions and Confidence Intervals in Biostatistics

Biostatistics

Lecture 9

Page 2: Proportions and Confidence Intervals in Biostatistics

Lecture 8 Review –Proportions and confidence

intervals• Calculation and interpretation of:

– sample proportion– 95% confidence interval for population proportion

• Calculation and interpretation of:––

difference in sample proportions95% confidence interval for difference proportions

in population

Page 3: Proportions and Confidence Intervals in Biostatistics

Single proportion – Inference

Estimated proportion of vivax malaria (p) = 15/100 = 0.15

Standard error of p

p(1 − p)

0.15(1 − 0.15)s e ( p ). . = = = 0.036

n 100

• 95% Confidence interval for π (population proportion)

Lower limit = p - 1.96×s.e.(p) = 0.079

Upper limit = p + 1.96×s.e.(p) = 0.221

Interpretation..

“We are 95% confident, the population proportion (π) of people with vivax

malaria is between 0.079 and 0.221

(or between 7.9% and 22.1%)”

Page 4: Proportions and Confidence Intervals in Biostatistics

Comparing two proportions2×2 table

••

ProportionProportion

Proportion

of all subjects experiencing outcome, p = d/n

in exposed group, p1 = d1/n1

in unexposed group, p0 = d0/n0

With outcome

(diseased)

Without outcome

(disease-free)

Total

Exposed

(group 1)

d1 h1 n1

Unexposed

(group 0)

d0 h0 n0

Total d h n

Page 5: Proportions and Confidence Intervals in Biostatistics

Comparing two proportionsExample – TBM trial

Death during 9 months post start of treatment

Treatment group Yes No Total

Dexamethasone

(group 1)

87(p1=0.318)

187 274

Placebo

(group 0)

112(p0=0.413)

159 271

Total 199 346 545

Page 6: Proportions and Confidence Intervals in Biostatistics

Comparing two proportions - Inference

Example:- TBM trial

Estimate of difference in population proportions

= p1-p0 = -0.095

s.e.(p1-p0) = 0.041

95% CI for difference in population proportions (π1-π0):

-0.095 ± 1.96×0.041

-0.175 up to -0.015 OR -17.5% up to -1.5%

Interpretation:-

“We are 95% confident, that the difference in population proportions is

between -17.5% (dexamethasone reduces the proportion of deaths by a

large amount) and -1.5% (dexamethasone marginally reduces the

proportion of deaths)”.

Page 7: Proportions and Confidence Intervals in Biostatistics

Comparing two proportions (absolute difference):-

Risk difference

Example:- TBM trial

Outcome measure: Death during nine

months treatment.

following start of

Dexamethasone

p1 (incidence risk) = d1/n1 = 87/274 = 0.318

Placebo

p0 (incidence risk)

= d0/n0 = 112/271 = 0.413

p1 – p0 (risk difference) = 0.318 – 0.413 = -0.095 (or -9.5%)

Page 8: Proportions and Confidence Intervals in Biostatistics

Lecture 9 – Measures of association

• 2×2 table (RECAP)

• Measures of association

––

Risk differenceRisk ratio

– Odds ratio

• Calculation & interpretation of confidence interval for

each measure of association

Page 9: Proportions and Confidence Intervals in Biostatistics

2×2 table

••

ProportionProportion

Proportion

of all subjects experiencing outcome, p = d/n

in exposed group, p1 = d1/n1

in unexposed group, p0 = d0/n0

With outcome

(diseased)

Without outcome

(disease-free)

Total

Exposed

(group 1)

d1 h1 n1

Unexposed

(group 0)

d0 h0 n0

Total d h n

Page 10: Proportions and Confidence Intervals in Biostatistics

2×2 table - Measures of association

• Differentbetween

measures of associationoutcome and exposure

• Can calculate confidence intervals and test statistics foreach measure

Measure of Effect Formula

Risk difference(lecture 8)

p1-p0

Risk ratio (relative risk) p1 / p0

Odds ratio (d1/h1) / (d0/h0)

Page 11: Proportions and Confidence Intervals in Biostatistics

2×2 table – TBM trial example

Death during 9 months post start

of treatment

Treatment group Yes No Total Incidence risk of death (p)

Odds of death

Dexamethasone

(group 1)87 (d1)

187 (h1)

274 (n1)

d1 / n1

= 0.318

d1 / h1

= 0.465

Placebo

(group 0)112 (d0)

159 (h0)

271 (n0)

d0 / n0

= 0.413

d0 / h0

= 0.704

Total 199 346 545

Page 12: Proportions and Confidence Intervals in Biostatistics

2×2 table – TBM trial example

• Risk difference = p1-p0 = 0.318 – 0.413 = -0.095 (or -9.5%)

• Risk ratio = p1/p0 = 0.318 / 0.413 = 0.77

• Odds ratio = (d1/h1) / (d0/h0) = 0.465 / 0.704 = 0.66

Death during 9 months post start of treatment

Treatment group Yes No Total

Dexamethasone

(group 1)

87 (d1) 187 (h1) 274 (n1)

Placebo

(group 0)

112 (d0) 159 (h0) 271 (n0)

Total 199 346 545

Page 13: Proportions and Confidence Intervals in Biostatistics

2×2 table – Calculation of Odds Ratio

Commonly given formula for odds ratio(a×d) / (b×c) = (87×159) / (187×112) = 0.66

Death during 9 months post start of treatment

Treatment group Yes No Total

Dexamethasone

(group 1)

87 (a) 187 (b) 274 (n1)

Placebo

(group 0)

112 (c) 159 (d) 271 (n0)

Total 199 346 545

Page 14: Proportions and Confidence Intervals in Biostatistics

2×2 table – Calculation of Odds Ratio

Odds ratio for not dying

= (a×d) / (b×c) = (187×112) / (=1/0.66)

(87×159) = 1.51

Death during 9 months post start of treatment

Treatment group No Yes Total

Dexamethasone

(group 1)

187 (a) 87 (b) 274 (n1)

Placebo

(group 0)

159 (c) 112 (d) 271 (n0)

Total 346 199 545

Page 15: Proportions and Confidence Intervals in Biostatistics

Differences in measures of association• When there is no association between exposure and outcome,

– risk difference = 0

– risk ratio (RR) = 1

– odds ratio (OR) = 1

• Risk difference can be negative or positive

• RR & OR are always positive

• For rare outcomes, OR ~ RR

• OR is always further from 1 than corresponding RR

– If RR > 1 then OR > RR

– If RR < 1 the OR < RR

Page 16: Proportions and Confidence Intervals in Biostatistics

Interpretation of measures of association

• RR & OR < 1, associated with a reduced risk / odds (may beprotective)

– RR = 0.8 (reduced risk of 20%)

• RR & OR > 1, associated with an increased risk / odds

– RR = 1.2 (increased risk of 20%)

• RR & OR – further the risk is from 1, stronger the associationbetween exposure and outcome (e.g. RR=2 versus RR=3).

Page 17: Proportions and Confidence Intervals in Biostatistics

Inference

• Obtain a sample estimate, q, of the population parameter (e.g.difference in proportions)

• REMEMBER different samples would give different estimatesof the population parameter (e.g. sample 1 q1, sample 2 q2,…)

• Derive:

– Standard error of q (i.e. s.e.(q))

– Confidence interval (i.e. q ± (1.96 × s.e.(q) )

Page 18: Proportions and Confidence Intervals in Biostatistics

Ratios – Risk ratio (RR) or Odds ratio (OR)

• Usual confidence intervals formula,q ± (1.96×s.e.(q)), is problematic for ratios.

When q is close to zero and s.e.(q) large,

calculated lower limit of confidence interval may benegative…

Page 19: Proportions and Confidence Intervals in Biostatistics

Risk ratio (RR)

• Solution Calculate the logarithm of(logeRR) and its standard error

RR

1 −

1 1 −

1s.e.(lo g e

RR ) = +d1 n1 d0 n0

95% CI for logarithm of RR :-Upper limitLower limit

= logeRR= logeRR

+ 1.96×s.e.(logeRR)

- 1.96×s.e.(logeRR)

95% CI for Risk ratio (RR):-Upper limit = antilog (upper limit of CI for logeRR)Lower limit = antilog (lower limit of CI for logeRR)

Page 20: Proportions and Confidence Intervals in Biostatistics

Log to the base e & antiloge (exponential)

• ‘Natural logarithms’ use the mathematical constant, e, as

their base, e=2.71828……1618 – Scottish

Mathematician: John Napier

ex• antilogex = exp(x) =

e = 2.718 loge2.718 = 1 antiloge1 = 2.718

e2 = 7.388 loge7.388 = 2 antiloge2 = 7.388

e3 = 20.079 loge20.079 = 3 antiloge3 = 20.079

101 = 10 log1010 = 1 antilog101 = 10

102 = 100 log10100 = 2 antilog102 = 100

103 = 1000 log101000 = 3 antilog103 = 1000

Page 21: Proportions and Confidence Intervals in Biostatistics

2×2 table – TBM trial example

Risk ratio = p1/p0 = 0.318 / 0.413 =logeRR = loge(0.77) = -0.26

0.77

1 −

1 +

1 −

1 s.e.(lo g RR ) =e = 0.11

87 274 112 271

95% CI for logeRR: -0.48 up to -0.04

95% CI for RR: exp(-0.48) up to exp(-0.04) = 0.62 up

to 0.96

Death during 9 months post start of treatment

Treatment group Yes No Total

Dexamethasone

(group 1)

87 (d1) 187 (h1) 274 (n1)

Placebo

(group 0)

112 (d0) 159 (h0) 271 (n0)

Total 199 346 545

Page 22: Proportions and Confidence Intervals in Biostatistics

Using Statacsi 87 112 187 159

| Exposed Unexposed | Total-----------------+------------------------+------------

Cases |

Noncases |87

187112

159|

|199

346-----------------+------------------------+------------

Total |

||||

274 271 |

||||

545

Risk .3175182 .4132841 .3651376

Point estimate [95% Conf. Interval]|------------------------+------------------------

Risk difference

Risk ratio Prev. frac. ex. Prev. frac. pop

|

|||

-.0957659

.7682808

.2317192

.1164974

|

|||

-.1762352 -.0152966.6139856 .9613505.0386495 .3860144

+-------------------------------------------------chi2(1) = 5.39 Pr>chi2 = 0.0202

Remember the warning about how the table is presented-Stata requires presentation with outcome by rows and exposure by columns

Results are close to those obtained by hand

Page 23: Proportions and Confidence Intervals in Biostatistics

2×2 table – TBM trial example

Interpretation…..

Dexamethasone was associated with an estimated decreased risk of 23% (estimated RR=0.77) for death during 9 months post start of treatment.

We are 95% confident, that the population risk ratio, lies between0.62 (decreased risk of 38%) and 0.96 (decreased risk of 4%).

Death during 9 months post start of treatment

Treatment group Yes No Total

Dexamethasone

(group 1)

87 (d1) 187 (h1) 274 (n1)

Placebo

(group 0)

112 (d0) 159 (h0) 271 (n0)

Total 199 346 545

Page 24: Proportions and Confidence Intervals in Biostatistics

95% confidence interval for Odds ratio (OR)

• Calculate the logarithm of OR (logeOR) and its standard error.

1 1 1 1Woolf’s formulas.e.(lo g OR ) =

e+ + +

d1 h1 d0 h0

95% CI for logarithm of OR :-Upper limit = logeOR + 1.96×s.e.(logeOR)

Lower limit = logeOR - 1.96×s.e.(logeOR)

95% CI for Odds ratio (OR):-

Upper limit = exp (upper limit of CI for logeOR)Lower limit = exp (lower limit of CI for logeOR)

Page 25: Proportions and Confidence Intervals in Biostatistics

2×2 table – TBM trial example

Odds Ratio = (d1/h1)/ (d0/h0) = 0.66logeOR = loge(0.66) = -0.42

1 +

1 +

1 +

1 s.e.(lo g OR ) =e = 0.18

87 187 112 159

95% CI for logeOR: -0.77 up to -0.07

95% CI for OR: exp(-0.77) up to exp(-0.07) = 0.46 up

to 0.93

Death during 9 months post start of treatment

Treatment group Yes No Total

Dexamethasone

(group 1)

87 (d1) 187 (h1) 274 (n1)

Placebo

(group 0)

112 (d0) 159 (h0) 271 (n0)

Total 199 346 545

Page 26: Proportions and Confidence Intervals in Biostatistics

Using Stata

. csi 87 112 187 159, or

| Exposed Unexposed | Total-----------------+------------------------+------------

Cases |

Noncases |87

187112

159|

|199

346-----------------+------------------------+------------

Total |

||

274 271 |

||

545

Risk .3175182 .4132841 .3651376|

||

|Point estimate [95% Conf. Interval]|------------------------+------------------------

Risk difference

Risk ratio Prev. frac. ex. Prev. frac. pop

Odds ratio

|

||||

-.0957659

.7682808

.2317192

.1164974

.6604756

|

||||

-.1762352

.6139856

.0386495

-.0152966

.9613505

.3860144

.4652544 .937623 (Cornfield)+-------------------------------------------------

chi2(1) = 5.39 Pr>chi2 = 0.0202

For OR, by default Stata uses Cornfield’s formula for se. You can requestthe Woolf formula as csi 87 112 187 159, or woolf

Page 27: Proportions and Confidence Intervals in Biostatistics

Test statistic forRisk ratio (RR) & Odds

ratio (OR)

Null hypothesis:-population RR = 1 or population OR = 1

• For risk ratio:-

log e RR − log e1 − 0.26 − 0z = = = −2.4

s.e.(lo g

RR )

0.11e

2-sided p-value = 0.016

Page 28: Proportions and Confidence Intervals in Biostatistics

Test statistic forRisk ratio (RR) & Odds

Null hypothesis:-

ratio (OR)

population RR = 1 or population OR = 1

• For odds ratio:-

log e OR − log e1 − 0.42 − 0z = = = −2.3

s.e.(lo g

OR )

0.18e

2-sided p-value = 0.021

Page 29: Proportions and Confidence Intervals in Biostatistics

Comparing the outcome measure of two exposure groups(groups 1 & 0)

1 0 1 01 0

s.e.( p ) + s.e.( p )1 0

p=

Outcome variable – data type

Population parameter

Estimate of population parameter

from sample

Standard error 95% Confidence interval for population

parameter

Numerical

µ1−µ0 x1 x0 s.e.( x 1 − x 0 )

2 2= s.e.( x 1 ) + s.e.( x 0 )

x1 − x0

± 1.96 × s.e.( x 1 − x 0 )

Categorical π1−π0 p − p s.e.( p − p )

2 2

p − p±1.96× s.e.(

1 − p

0 )

Page 30: Proportions and Confidence Intervals in Biostatistics

Comparing the outcome measure of two exposure groups(groups 1 & 0)

s.e.(lo g RR ) =e − + −

1 1 0 0

Outcome variable – data type

Population parameter

Estimate of

population parameter

from sample

Standard error of loge(parameter)

95% Confidence interval of loge(population parameter)

Categorical

Population risk ratio

p1/p0 1 1 1 1d1 n1 d0 n0

log eRR

± 1.96 × s.e.(log eRR )

Categorical Population odds ratio

(d1/h1) / (d0/h0)

1 1 1 1s.e.(lo ge OR ) =

d +

h +

d +

hlogeOR

±1.96× s.e.(log eOR)

Page 31: Proportions and Confidence Intervals in Biostatistics

Calculation of p-values for comparing two groups

z =

s.e.( p − p )

z =e

s.e.(log ( OR ))

Outcome variable – data type

Population parameter Population parameter under null hypothesis

Test statistic

Numerical

µ1−µ0 µ1−µ0=0 x1 − x0

s.e.( x 1 − x 0 )

Categorical

π1-π0

Population risk ratio

Population odds ratio

π1-π0=0

Population risk ratio=1

Population odds ratio=1

z = p1 − p0

1 0

loge ( RR)

s.e.(log ( RR ))

z = loge (OR)

e

Page 32: Proportions and Confidence Intervals in Biostatistics

Comparing the outcome measure of two exposure groups(TBM trial: dexamethasone versus placebo)

Outcome variable – data type

Population parameter under null hypothesis

Estimate of population parameter

from sample

95% confidence interval for population parameter

Two-sided p-value

Categorical Population risk

difference= 0

p1-p0

= -0.095-0.175, -0.015 0.020

Categorical

Population risk ratio

= 1

p1/p0

= 0.770.62, 0.96 0.016

Categorical Population odds ratio

= 1

(d1/h1) / (d0/h0)

= 0.660.46, 0.93 0.021

Page 33: Proportions and Confidence Intervals in Biostatistics

Using Stata – p-value calculated using Chi-squared test

. csi 87 112 187 159, or

| Exposed Unexposed | Total-----------------+------------------------+------------

Cases |

Noncases |87

187112

159|

|199

346-----------------+------------------------+------------

Total |

||

274 271 |

||

545

Risk .3175182 .4132841 .3651376|

||

|Point estimate [95% Conf. Interval]|------------------------+------------------------

Risk difference

Risk ratio Prev. frac. ex. Prev. frac. pop

Odds ratio

|

||||

-.0957659

.7682808

.2317192

.1164974

.6604756

|

||||

-.1762352

.6139856

.0386495

-.0152966

.9613505

.3860144

.4652544 .937623 (Cornfield)+-------------------------------------------------

chi2(1) = 5.39 Pr>chi2 = 0.0202

For OR, by default Stata uses Cornfield’s formula for se. You can requestthe Woolf formula as csi 87 112 187 159, or woolf

Page 34: Proportions and Confidence Intervals in Biostatistics

Lecture 9 - Objectives

• Calculate and interpret the measures ofassociation and theirand test statistics

confidence intervals

–––

Risk differenceRisk ratioOdds ratio

Page 35: Proportions and Confidence Intervals in Biostatistics

Thank You

www.HelpWithAssignment.com