10 rates vs. risksbiostat/bio1/week2/lectures_day5.pdf · 1 10 rates vs. risks a rate is a the...
TRANSCRIPT
1
10 Rates vs. risks
A rate is a the number of events occurring per unit of time
The incidence rate of disease occurrence is the
occurrence of new cases per unit of time in a well-
defined population
e.g. cases of a rare disease in Sweden per year
Incidence rate vs. mortality rate
When death is the event whose incidence we are
measuring, we refer to the mortality rate.
2
Reporting of rates
3
It is common to express rates per 100,000 individuals
e.g. If 32 children develop diabetes in a population of
200,000 children in a year. The incidence is
” 32 per 200,000 persons per year
or
32/200,000person-years = 16/100,000 person-years.
Sometimes other denominators used: e.g.
Per 100 or 1000 or 10,000 individuals
Reporting of rates (you saw in Epi 1)
4
http://seer/cancer.gov/statfacts/html/prost.html
http://seer/cancer.gov/statfacts/html/colorect.html
http://seer/cancer.gov/statfacts/html/pancreas.html
What about Swedish rates ( www.sos.se)?
3
Rates (continued)
Commonly used rates are incidence rate (IR) and mortality rate .
Person time is the summed time the people in the sample have
been observed, before an event.
Rates require observations of incidence in time. Thus, they are
estimated from cohort studies.
IR and MR are usually expressed in terms of number of events
per 1000 (or 10 000 or 100 000) person-years.
Other terms for IR: Incidence density, hazard
timeperson
incidencesof#
PT
IIR
Other terms for Incidence Rate
“instantaneous risk” (or probability);
Incidence density
“hazard” (especially for mortality rates);
“person-time incidence rate”;
a“force of morbidity”.
Related Terms that are not rates:
Cumulative incidence: The proportion of people having
an incidence during the follow up period.
Prevalence: The proportion of people having the
disease at a certain time.
4
Calculating Incidence Density
(with exact individual follow-up)
A hypothetical cohort of 12 subjects.
Followed for the period of 5.5 years.
7 withdrawals among non-cases
three (7,8,12) lost to follow-up;
two (3,4) due to death;
two (5,10) due to study termination.
PT = 2.5+3.5+…+1.5 = 26.
ID=5/26=0.192 per (person-) year
or 1.92 per 10 (person-)years.
Example *of Incidence Rates (from Epi I)
* Kesavan et al. Prospective Study of Magnesium and Iron Intake and Pancreatic Cancer
in Men. Am J Epidemiol 2010;171:233–241
How are these
calculated?
5
Population-Time Without Individual Data
e.g., some population-based registries.
Person-years computed using the mid-year population.
For rare events, periods of several years may be used.
Need to remove those not at risk (e.g., women for
prostate cancer incidence).
Incidence Density: Remarks
Any fluctuations in the instantaneous rate are obscured and can lead to
misleading conclusions. e.g.,
1000 persons followed for 1 year
100 persons followed for 10 years
produce the same number of person-years. If the average time to
disease onset is 5 years, ID in the first cohort will be lower.
6
Confidence interval for a rate….
Before we learn about confidence intervals for rates we need to
learn about a probability distribution called Poisson.
Recall the Binomial Distribution
If an experiment consists of n ”trials” each with probability ”p” of
success, we can find the probability of any numner of events
(0,1,2….n)
The formula for the probability involves n and p
The expected number of successes =np
If n is large and p is small, we can actually calculate the probabilities using
only ”np” in an alternative formula, called the Poisson formula
!
)(x
eeventsxP
x
P(0 events) = e-λ
λ = np
7
Example:
Comparison of Binomial and Poisson probabilities
http://www.graphpad.com/quickcalcs/probability1.cfm
Example: 100 trials, each with p=.02
No. successes Prob (Binomial) Prob (Poisson)
0 0.133=(.98)100 0.135 =e-2
1 0.271 0.271
2 0.273 0.271
3 or more 0.323 0.323
Check Poisson tables
( in book or page provided)
8
Poisson Distribution
Describes the probability of the number of occurences of an event
in a specified time, provided these events occur
At random
Independently
using only the “expected number” of events in the calculation
Example of Poisson in text book
(slightly modified)
If expected number of admissions at maternity unit per day =6,
we can find the probability of any number
P(X = 3) = 0.0892
P(X < 2) = P(X = 0) + P(X = 1) = .0025 + .0149 = 0.0174
P(X ≤ 14) = 1- (.0009+.0003+.0001) = .9987
9
Use the Poisson distribution to assess whether the
following report in Metro (25/10/2011) indicates a
significant rise in infant deaths:
”Plötslig spädsbarnsdöd ökar igen i Sverige. Förra året dog 27 barn,
jmfort med 13 barn 2007”
”Infant mortality may be increasing in Sweden. Last year there were
27 infant deaths compared to 13 in 2007”
The Poisson distribution can also be applied to events that occur
within some area or volume instead of time interval, e.g.
cases of some disease per squared-kilometer,
white cells count in a certain amount of blood
Provided the assumptions of random and independent occurrences
is reasonable
2014-04-25
1
11 Testing and comparing rates
Mean = λ
Variance = λ
Std. Deviation =
Remark: A peculiar characteristic of the Poisson distribution is that
the mean is equal to the variance. For any given set of observed
count data, the goodness of the Poisson assumption can be
assessed, to some extent, by looking at how close the mean and
the variance are.
Mean and variance of Poisson
2014-04-25
2
Example: if expected number of patients at an emergency
department is 2 per minute, then the probability distribution
(check that you agree)
x | Probability
-----+------------
0 | .1353
1 | .2706
2 | .2706
3 | .1804
4 | .0902
5 | .0360
6 | .0120
7 | .0034
8 | .0008
9 | .0001
10 | .0000
... | ...
------------------
0.1
.2.3
Pro
ba
bili
ty
0 1 2 3 4 5 6 7 8 9 10
Poisson(lambda = 2)
Quite skewed: but for large expected number , the distribution
becomes approximately normal
Remember: the number of events d has:
Mean = λ
Variance = λ
Std. Deviation =
So we can construct a CI using the mean and SE
2014-04-25
3
Confidence Interval for a rate
We use the poisson distribution as a model for calculating
confidence intervals for rates. We assume the number of events,
d~Poisson(λ)
T
d
nobservatio oftime-personTotal
eventsofNumberRate
T
Rateevents) of rs.e.(Numberates.e.
T
d
T)(
However, it is better to do the CI on the log-scale
Note this only depends on the number and not on T
6
d
1)( rate logs.e.
2014-04-25
4
CI for rate example (22.1 from text)
Guatemala study, 57 lower respiratory tractinfections in 873 child-
years i.e., rate of 65.3/1000py.
estimate λ= 65.3, log(λ )=4.179, d=57
95% CI for log rate = 4.179 1.96 (0.132) = 3.919 to 4.438
95% CI for rate = exp( 3.919) to exp(4.438)
= 50.36 to 84.65 per 1000 child years
132.057
11)(log d
SE
Comparing Two Incidence Rates
Assume data from
a cohort study:
Exposed Unexposed Total
Cases I1 I0 I
Person.-
time
PT1 PT0 PT
We get two estimates, one each for exposed and non-exposed
IR0=I0/PT0 and IR1=I1/PT1.
As with risks, we can compare them using the difference or ratio
Incidence rate difference: IRD = IR1 - IR0.
Incidence rate ratio: IRR = IR1 / IR0. (more commonly used)
10
11)SE(ln
IIIRR
Note similarity to
Poisson
2014-04-25
5
Comparing Two Incidence Densities: Example
Postmenopausal Hormones and Coronary Heart Disease Cohort Study:
Stampfer et al., NEJM (1985).
Involving female nurses:
Hormone use
Yes No Total
CHD 30 60 90
Person-years 54308.7 51477.5 105786.2
IR1=30/54308.7=0.00055; IR0=60/51477.5=0.00116
IRD = IR1 - IR0 = -0.00061
IRR = IR1 / IR0 = 0.474
Comparing two incidence rates (IRR) continued
Hormone use
Yes No Total
CHD 30 60 90
Person-
years
54308.7 51477.5 105786.2
224.060
1
30
1)SE(ln IRR
95% CI for ln IRR: ln(0.474) 1.96 (0.224) = (-1.178, -0.315).
95% CI for IRR: (e-1.178, e-0.315) = (0.308, 0.729).
CI allows us to reject the null hypothesis of no difference.
2014-04-25
1
12 Standardisation
Incidence rates: Remarks
If applied to the whole cohort/population, sometimes called crude rate.
However, sex, age, race etc. can have substantial influence on the
incidence of disease.
Comparing crude rates for two populations, which differ for example in
age, can be misleading (confounding!).
Therefore, usually standardized rates are compared.
e.g., for cancer, age- and sex-standardized rates are used.
2014-04-25
2
Standardized Rates
We will introduce standardization using age.
We will assume that our population is stratified by age (i.e., subdivided
into age-groups).
need to define age-groups (e.g., 0-4, 5-9,…).
compute age-specific incidence rates (IR).
Person-time and no. of cases for each age-group are required.
There are two methods of standardization:
Direct;
Indirect.
Standardization
Direct method
Age-specific rates of study population are applied to the age-
distribution of a “standard population”.
Indirect method
Age-specific rates from a reference population are applied to the
study population to obtain expected numbers of events.
2014-04-25
3
Direct Standardization
So we can report the rate that would have occurred if our population had the same age (and sex) distribution as the standard population
This provides a measure of disease rates in the different populations to be compared, adjusted for age
The idea is to apply the age-specific rates from our population to a ”standard population”
Standard world population
Standard European population
Standard US population
Standard world population (from Statistical Methods for Registries, Chapter 11 of IARC
publication 128)
2014-04-25
4
Data files of standard populations (World,
European, US, Canadian) available from SEER:
http://seer.cancer.gov/stdpopulations/
(use 1,000,000 rather than 100,000)
Direct standardisation of stomach cancer 1993-1997 in males
in Denmark (Chapter 8, Cancer Incidence in Five Continents Vol.
VIII,, IARC Publications No. 155
2014-04-25
5
IARC publications available on-line
Previous table from
http://www.iarc.fr/en/publications/pdfs-online/epi/sp155/ci5v8-
chap8.pdf
Another good Reference for standardisation:
http://www.iarc.fr/en/publications/pdfs-online/epi/sp95/sp95-
chap11.pdf
Comments on direct standardized rate (DSR)
If there is no confounding, crude rate is adequate.
DSR by itself is not meaningful – it makes sense only when comparing two or
more populations.
If possible, compare age-specific rates.
The rates should exhibit more or less similar trends (also in the standard).
DSR depends on the choice of the standard population.
The age-distribution of the latter should not be radically different from the compared
populations.
There are several standard populations (e.g., for the world, European, US etc.).
2014-04-25
6
New WHO standard world population : Implications?
(www.who.int/healthinfo/paper31.pdf )
Indirect Standardization
Direct standardization requires age-specific rates for both (all)
compared populations.
If these are not available, or they are imprecise, the indirect
method is preferred.
Both should lead to similar conclusions; if they do not, the
reason should be investigated.
2014-04-25
7
Indirect Standardization:
Compare the observed number of events in our population to what would be expected if rates another “reference population” applied
The ratio of observed to expected is called the
Incidence Rate Ratio IRR or
Standardized Incidence Rate SIR
If the ratio is of observed and expected deaths, it is called
Mortality Rate Ratio MRR or Standardized Mortality Rate SMR
Indirect Standardization: example
MRR=458/220.9= 2.07
2014-04-25
8
Indirect Standardization: examples from
your own work e.g. with register data
Good examples of Direct and Indirect standardization
25.2, 25.3 (text book)
2014-04-25
9
Comparison of Directly Standardized Rates:
If we have two standardized rates, we may want to compare them. Assume we have DSR1 and DSR2. It is usual to compute the ratio DSR1/DSR2, And obtain a confidence interval for either of these two measures (using statistical software)
Comparison of Indirectly Standardized Rates
D
2014-04-25
10
Comments on standardization of rates
Standardization is a simple way to remove effect of confounding.
We used age in all examples, but can be extended to more than one
confounder.
Same idea as for relative risk, where we construct estimates of common
(i.e. adjusted) OR (like ORMH).
Importance of standardized rates
From :
Taylor P. Standardized mortality ratios, Int Jour Epi, 2013; 42: 1882-1890
2014-04-25
11
(continued) Taylor P. Standardized mortality ratios, Int Jour
Epi, 2013; 42: 1882-1890