risk factors and risk models: why we need them, and how we ... 1005. shahian. risk... · odds ratio...
TRANSCRIPT
Risk Factors and Risk Models: Why We Need Them, and How
We Develop and Use ThemMarch 21, 2018
David M. Shahian, MDVice-President, Center for Quality and Safety, MGH
Professor of Surgery, Harvard Medical SchoolChair, STS Quality Measurement Task Force
October 24, 2019
No relevant disclosures
“…so many operations of such and such a nature, without reference to age, sex, or cause of the operation, followed by so many deaths, without reference to age, sex, or complications. Given these elements, divide the one by the other, and you get the mortality…
A statistical proceeding such as this can at best lead to loose approximations. It can convey but a very imperfect idea of the real state of the case. And one thing is quite certain, that it can lead to no practical result whatever, either as regards the true causes of the mortality, or how these might be mitigated.”
Nightingale 1863, cited by Spiegelhalter 1999
Nightingale on the need for risk adjustment
VA quality assurance legislationDecember 3,1985
The Chief Medical Director shall compare the mortality and morbidity rates … with the national mortality and morbidity standards …. and analyze any deviation between such rates and such standards in terms of (i) the characteristics of the respective patient populations; (ii) the level of risk for the procedure involved, based on (I) patient age; (II) the type and severity of the disease; (III) the effect of any complicating diseases; and (IV) the degree of difficulty of the procedure; and (iii) any other factor that the Chief Medical Director considers appropriate
“Apart from the proportion of a hospital’s
cases in 80 DRG’s, the predictive models
had no measure of case severity based on
diagnosis or procedure…Model bias also
favored lower-risk hospitals.”
Ann Thorac Surg 1988;45:348-349
“In March 12, 1986, the Department of Health and Human Services Health Care Financing Administration (HCFA) released a list of hospitals whose mortality rates for Medicare patients allegedly exceeded ”predicted mortality rates” for those hospitals, either overall or for nine specific diagnostic categories. …the implication was clearly made that these raw mortalityrates were equated with quality of care in the institutions in question…an incorrect perception of care in certain communities…. All of the risk factors that are predictive of operative mortality must be identified and subjected to appropriate statistical analysis before comparisons of mortality rates between institutions can be made”
Ann Thorac Surg 1989
“…prompted by the release by HCFA…of raw mortality data for Medicare patients undergoing coronary artery bypass grafting procedures without respect to any of the then-known risk factors …”
Risk models are a core elementof the STS quality program
Risk factor data fields
Risk model considerations
• How do we initially pick risk factors for the Database? Endpoint (e.g., mortality)for which we are adjusting should have at
least moderate incidence (e.g., >1%) Permissible risk factors for different uses Rare risk factors very difficult if not impossible to model Low risk factor missingness, < 5% (preferably < 1-2%) Bivariate associations interesting but not determinative Harrell’s 10:1 (endpoints:variables) rule helps avoid overfitting
• Model development and testing full versus parsimonious models forward or backward selection, stopping rules, penalties, clinically
“supervised” selection calibration, discrimination
• Model uses
Permissible risk factors
Profiling• Only include risk factors present before treatment begins• Do not include treatments or complications• SES currently being studied, may be applicable to some
outcomes such as readmission
Performance improvement (e.g., readmission prediction and mitigation) or shared decision making• Permissible and desirable to use all available data
Hg A1c MELD score 5 meter walk Aortic etiology Aortic stenosis gradient Mitral etiology
Some candidate variables with high missing rates
Logistic regression—logit and probability forms
log odds (logit) intercept coefficients
log[p/(1-p)] = b0 + b1x1 +….. + bkxkrisk factors
Probability form of the same equation1
intercept coefficients
p =
Risk models: why do we talk about risk in terms of coefficients, odds and odds ratios?
Odds: probability of an event occurring divided by probability of not occurring (p/1-p)
Odds of tails on a coin toss is 0.5 / 0.5 = 1
Odds of rolling a 2 with a single six-sided dice:1/6 ÷ 5/6 = 1 ÷ 5 = 0.2
Risk models: What are odds ratios?
Odds ratio (OR): odds of the event when the risk factor is present compared to when it is not present
Consider an endpoint (e.g., death) occurring 2% of the time when some specific risk factor (e.g., a recent heart attack) is absent• Odds = 2% ÷ 98% = 0.0204• If the odds ratio for that risk factor is 2, it means
that the odds of death doubles to 0.0408 when the risk factor is present
Derivation of odds ratios from logistic equations
log[p/(1-p)] = b0 + b1x1 +….. + bkxk
Conveniently, odds ratios are mathematically directly related to the coefficients (b) in the logistic model
Odds Ratio (OR) = exp (b) = eb
and e0 = 1
Severe chronic lung disease
Coefficient = 0.85513
Odds ratio = e0.85513 = 2.35
What information does the risk factor odds ratio provide?
• Whether it has a positive or negative effect on the outcome:OR > 1 (beta coefficient +): when risk factor is present, outcome more likelyOR < 1 (beta coefficient -): when risk factor is present, outcome less likely
• How strong is its independent effect in comparison with all the other predictors when they are all considered simultaneously
Testing risk models Calibration (good fit: observed versus expected)
Discrimination (classification—fatality or survivor)
Calibration assesses prediction accuracy—of the days we predict 40% probability of rain, it should rain 40% of the time
Discrimination is whether or not it will rain (0 or 1, measured over the range of threshold probabilities—i.e., “Classify a day as rainy if the probability is greater than x %”)
Can have high discrimination but poor calibration, and vice versa
Graphical assessment of calibration
ROC curve area, c-index
Grunkemeier et al, ATS 2001
Multiple uses
Identification patient features that impact outcomes Compare risk factor profiles across hospitals Assist in the selection of the best procedure for a
particular patient Shared decision making with patient Guide risk mitigation and improvement activities Target specific patients for special attention, such as
aggressive post-discharge monitoring Performance measurement
Risk-adjusted outcomes Risk adjustment essential for outcomes measures
• Fair and accurate performance assessment—accounts for higher risk patients, mitigates risk aversion
• Face validity and provider acceptance, irrespective of statistical considerations
From a purely statistical perspective, risk adjustment appropriate if:• Presenting patient characteristics associated with outcomes• Prevalence of patient characteristics varies among providers
Percentiles of Risk Factor Distribution
1st 5th 10th 25th 50th 75th 90th 95th 99th
Isolated CABG (2009)
Age ≥80 0.0% 2.7% 3.7% 5.6% 8.1% 10.9% 13.7% 15.4% 19.5%
Preop Dialysis 0.0% 0.0% 0.0% 1.2% 2.1% 3.7% 5.6% 7.1% 11.2%
Creatinine ≥ 2 0.0% 0.0% 0.5% 1.3% 2.1% 3.2% 4.4% 5.4% 7.7%
CPR/salvage 0.0% 0.0% 0.0% 0.0% 0.0% 0.7% 1.5% 2.2% 3.7%
Cardiogenic Shock 0.0% 0.0% 0.0% 0.7% 1.5% 2.9% 4.5% 5.7% 9.6%
PVD 2.7% 6.1% 7.9% 10.9% 14.4% 17.9% 22.3% 26.0% 32.6%
PCI ≤ 6 Hours 0.0% 0.0% 0.0% 0.0% 0.7% 1.5% 2.6% 3.3% 6.0%
MI ≤6 Hours 0.0% 0.0% 0.0% 0.4% 1.3% 2.4% 4.1% 5.6% 9.0%Reoperation 0.0% 0.0% 0.4% 1.7% 3.2% 4.8% 6.8% 8.3% 12.3%
Shahian et al, Ann Thorac Surg 2013;96:718-26
CABG risk factor prevalence—national STS data
Massachusetts CABG programsRisk factor prevalence FY 2014
• 44 Veterans Affairs Medical Centers• 87,078 major non-cardiac operations• October 1, 1991- December 31, 1993
• 93% of hospitals changed rank after risk adjustment (some up, some down)
50% by more than 525% by more than 10
Risk adjustment makes a difference
Appropriate uses of risk-adjusted results
YES--Comparing a hospital’s performance for its specific case mix with what would have been expected based on the national benchmark population
NO—Directly comparing two hospitals with each other
Why?• Patient risk versus case-mix• Indirect versus direct standardization
Epidemiology: Direct Standardization
Apply study hospital’s stratum-specific rates to reference population
Direct hospital to hospital comparisons often possible
Typically used in epidemiologic studies focusing on a limited number of strata (e.g., age, sex, etc.)
Profiling: Indirect Standardization
Often too many variables (strata) to use direct standardization
Some study hospital patient strata with zero observations
Apply reference population rates to the study hospital’s patients
Profiling: Indirect Standardization Provider’s results for their specific patients compared with
what would have been expected for the same patients based on the performance of all providers who contributed to the benchmark population
Mortality lower than expected (O/E < 1), greater than expected (O/E > 1), or as-expected
O/E ratio < 1 at a hospital treating mainly low risk patients not directly comparable with O/E ratio at a hospital treating mainly high risk patients
A desirable O/E ratio does not necessarily extrapolate to a different mix of patients (i.e., much higher risk)
Provider-level “expected” outcomes
Individual patient-level probabilities for specific outcomes are summed for all patients of a given provider
This yields the total expected number of those outcomes
Patient Probability of death1 0.0312 0.0233 0.0544 0.016
….. ……100 0.035
Total N =100 Sum of probabilities of death = 4.3
Provider-level observed outcomes and O/E ratios
Expected total deaths compared with the observed deaths to estimate observed to expected ratio (O/E) • >1 indicates worse than expected• < indicates better than expected• Always look at the confidence intervals, which
reflect our certainty about the point estimate!
Patient Probability of death Actual outcome
1 0.031 Alive2 0.023 Alive3 0.054 Dead4 0.016 Alive
….. …… ….100 0.035 Dead
Total N =100 Sum of probabilities of death = 4.3
Sum of observed deaths = 5
Provider-level observed outcomes and O/E ratios
= O/E ratio = 5/4.3 = 1.16Observed Expected
Risk-adjusted outcomes
The O/E ratio can be multiplied by the average population mortality to give the risk-adjusted mortality
If O/E 1.16 and population average mortality 2.1%, then risk-adjusted mortality rate = 2.44%
• Indirectly standardized, risk-adjusted outcomes basedonly on the patients a particular hospital treated
• Should not be used for direct hospital to hospital comparisons
Do risk models protect providerswho care for the sickest patients?
Most risk models over-predict risk among highest risk patients
>10% Predicted Risk ofmortality
Overall
Center with highest risk patients consistently had lowest O/E ratios
Risk adjustment
Essential for most outcomes measures Need to collect desired factors with low missing % Intended use determines permissible risk factors Models should be transparent and peer reviewed In most instances, direct hospital to hospital
comparisons not appropriate, even with risk-adjusted results
Watch for extreme differences in case mix
Thank You!