adverse selection: the dog that didn’t bite
TRANSCRIPT
Adverse Selection: The Dog that Didn’t Bite
July 2010
Abstract
We assess the welfare impact of health status based selection in health insurance plans using detailed panel data on health plan choices and complete health care utilization. Our estimates suggest that adverse selection plays an important role in explaining cost differentials across plans and much of the selection occurs along difficult to contract upon dimensions. The distortionary consequences of the asymmetric information are modest because individuals are very premium inelastic in our data. Our findings show that the presence of significant adverse selection need not cause meaningful welfare loss.
1
1. Introduction
The vast majority of under-65 year old Americans, accounting for 160 million enrollees
with an average annual family premium of $12,658 in 2008, obtain health insurance through an
employer.1 Because employers play such a large role in health insurance markets, the welfare of
health insurance outcomes depends in large part upon the ability of employers to offer the
appropriate health plan menu at efficient prices. Given the size of the market, small deviations
from the optimal premium have the potential to generate large welfare losses. Economists have
long been concerned that the presence of adverse selection in health insurance markets makes it
difficult for employers (and market mechanisms more generally) to price these products
optimally.
Cutler and Reber (1998) and Einav, Finkelstein and Cullen (2010) show that the optimal
sorting of enrollees into health plans requires the premium differentials to equal the differential
cost of the marginal enrollee across plans. In most employer-sponsored settings, premiums will
equal (or be a function of) the average cost of the enrollees in the plan and not the cost of
marginal enrollee. Sub-optimally set premiums result in welfare loss as some enrollees fail to
enroll in the plan that yields the highest net surplus. The magnitude of the selection induced
welfare loss depends upon the number of enrollees who sort into the `wrong plan’, which
depends upon the size of the premium distortion and the premium elasticity of enrollees, and the
net loss in welfare that results from enrollees selecting the `wrong’ plan. To accurately estimate
the welfare loss from selection, therefore, requires accurate estimates of the premium distortions,
the elasticity of demand for health plans, as well as the net loss in welfare from the misallocation
of enrollees to plans.
1 Source: Kaiser Family Foundation (http://ehbs.kff.org/pdf/2009/7981.pdf)
2
In this paper, we estimate the impact of health plan selection on welfare in the context of
the employee health plan offerings of a large employer.2 We possess detailed information on
both the health insurance choices and the complete health care utilization experience of a large
number of individuals over multiple years. Our data allow us to formulate rich measures of the
employee’s health status that are based upon previous diagnoses. These data provide sufficient
detail to cleanly model the importance of selection in determining the cost differentials across
health plans. It is well known that identifying adverse selection separately from moral hazard can
be challenging (Chiappori and Salanie, 2000). Furthermore, the information available in the
claims data allows us to decompose health plan selection into easily observable (to the employer)
demographic variables and more difficult to observe health status dimensions.
In order to estimate welfare impact of selection, we estimate models of the health plan
demand and costs. To model demand, we construct a model of health insurance choice in which
individuals assess the quality of the health plan, their estimated health status, premiums, and
expected level of expenses when selecting a plan. We exploit the detailed data and formulate
measures of the health status and expected out-of-pocket expenditures that reflect actual enrollee
medical care utilization between different plans.
We estimate utility parameters using an autoregressive, multinomial probit choice model
(Geweke, Keane and Runkle, 1997). The probit model allows for flexibility in the correlation
structure of the choice-specific errors. Our implementation also allows the unobserved choice-
specific error term to be autocorrelated, and the autocorrelation parameter can vary across health
plans. Work by Handel (2010) suggests that persistence in health plan choice is important, and
individuals in our data tend to remain in the same health plan for multiple years. Thus,
2 Here we use the term `selection’ to refer to both adverse and advantageous selection. In our setting, we have multiple plans and thus some plans will experience adverse selection while others will therefore experience advantageous selection.
3
incorporating autocorrelated unobserved heterogeneity into the model is likely important. This
model removes many of the unappealing implications (and potential biases) of the literature’s
workhorse empirical model, the iid, multinomial (or nested) logit. Statistical inference is
generated using Bayesian methods (Gelfand and Smith, 1990) as convergence for these types of
models using classical approaches is often problematic. Importantly, Monte Carlos analysis
indicates that in the presence of autocorrelated unobservables, standard static discrete choice
methods generate biased coefficients estimates.
Health expenditure data are censored and skewed and it is well known that accounting for
these features is important for unbiased inference. To estimate the cost side, we estimate the
parameters of a set of health expenditure equations, allowing for individual- and plan-specific
heterogeneity, censored and skewed dependent variable outcomes. In keeping with the spirit of
the choice model estimation, we use a Bayesian framework to generate statistical inference. The
model allows for individual-specific effects and nonlinear plan-specific relationships between
costs and health status. The estimates from the expenditure equations enable us to decompose
cost heterogeneity across plans into differences that are a consequence of selection and
differences that are related to plan design.
The estimates from the choice model indicate that enrollees select into plans based on
both the relatively easy-to-observe demographic variables and the much more difficult–to-
observe health status dimension. That is, much of the information that individuals use to make
health plan choice is private, unlikely to be contractible and is correlated with their expected
cost. The impact of selection on realized average health plan expenditures quite large – upwards
of 50% of the plan cost differentials are driven by selection. Consistent with our Monte Carlos
analysis, in our setting, the use of the cross-sectional multinomial logit choice model leads to
erroneous conclusions over several important dimensions of health plan choice. Unobserved
4
heterogeneity in plan preference is extremely persistent and failure to account for this inertia
leads to biased estimates of premium elasticities and other important factors that influence health
plan choice.
Using the demand and cost side estimates, we perform several premium counterfactuals
in order to quantify the impact of selection on welfare. In these simulations we adjust premiums
according to commonly used risk adjustment rules and we also calculate an optimal premium
structure. While there is significant health status based health plan selection in our setting, the
distortionary consequences of asymmetric information are modest. The welfare gain from using
optimal premium structures is minimal and the reason selection does not greatly affect welfare in
our context is straightforward: plan choices are relatively insensitive to large changes in
premiums. Counter to the conclusions of Cutler and Reber (1998) and roughly consistent with
the work of Einav, Finkelstein and Cullen (2010), we find that large distortions in premiums
induced by selection do not meaningfully distort the plan choices of enrollees. Roughly,
individuals sort into the same plans at the optimal premiums and under the premiums we
observed, even though those premiums are very different from the actual premiums in the data.
The remainder of the paper has the following structure. We provide a brief overview of
the relevant literature in Section 2. The data on which the analysis rests is described in Section 3,
and the model is laid out in Section 4. The results of the model estimation are presented in
Section 5, and the results of the employer pricing policy experiments shown in Section 6. Section
7 concludes. Details of the Bayesian inference used to estimate the parameters in the model are
available in appendices from the authors.
2. Background
At least since the work of Rothschild and Stiglitz (1976) economists have known that
adverse selection can have deleterious impacts on health insurance market outcomes. It is
5
therefore not surprising that a large literature has arisen measuring health status selection in
health plans.3 The typical research in this line of analysis studies differences in demographic or
cost experience between some form of managed care and a less restrictive health insurance
product (e.g. a preferred provider organization or fee-for-service plan).4 While there are a
number of studies that find no selection, the typical study finds that the more restrictive the
provider network or the more managed the care, the more favorable is the health plan risk
selection. Sicker enrollees have a greater willingness to pay for broad provider access and less
insurer involvement in their care.5, 6
Cutler and Reber (1998) study the health plan choices of Harvard employees after the
implementation of a fixed employer premium contribution policy. They show that under such a
premium setting strategy, the premium differentials between plans is likely to be too high and
result in welfare loss. Specifically, they note that the optimal premium differences between plans
should be based solely upon the cost differential of treating the marginal enrollee. However, in
most settings the premium differentials will reflect the average plan-specific cost differential as
well as the average health status of the enrollees. If sicker enrollees prefer plans that offer a
broader choice of providers and choice is more costly to offer, the premium differentials will be
3 Cutler and Zeckhauser (2000) provide an excellent review of this research. More recent examples of adverse selection in health plans studies are: Atherly, Dowd, Feldman (2004), Barrett and Conlon (2003), Tchernis, et al. (2003), Gray and Selden (2002), Riphahn, Wambach and Million (2002), Feldman and Dowd (2000) and Altman, Cutler and Zeckhauser (2003). 4 There is an important body of work that examines adverse selection in the non-health insurance context. An incomplete list of papers in this literature include Cawley and Philipson (1999), Chiappori and Salanie (2000), Finkelstein and Poterba (2004) and Einov, Finkelstein and Schrimpf (2007). 5 While much of the literature on adverse selection does not have access to information that might be considered ‘private’ when modeling selection, several paper incorporate measures of health status into their analysis. Notably, Strombom, Buchmueller and Feldstein (2002) acquire data on hospitalizations and cancer diagnoses around the time of plan enrollment and include this information as regressors in a choice model. Others researchers use chronic illness indicators or paid claims information to construct health status measures (e.g. Parente, Feldman and Christianson, 2004; Atherly, Dowd and Feldman 2004; Harris, Schultz and Feldman, 2002; Royalty and Solomon, 1999). 6 Parente, Feldman and Christianson (2004) use data from a similar setting to ours, supplemented with an employee survey to examine the profile of those who enroll in a Consumer Directed Health Plan (CDHP). Using a MNL approach, they find the CDHP plan enrolled wealthier employees and the CDHP was not more attractive to the young and healthy.
6
too high relative to the optimum. Cutler and Reber estimate the parameters and utility functions
using multinomial logit methods and estimates of costs using other data sources to conclude that
adverse selection resulted in a welfare loss of 4% of gross premiums. However, it is uncommon
for employers to risk adjust premiums (Kennen et al., 2001). Thus, their findings introduce a
puzzle that our paper addresses. If the welfare loss from adverse selection in health insurance is
so large, why do not more employers risk-adjust their premiums?
Cardon and Hendel (2001) use data from the 1987 National Medical Expenditure Survey
aggregating many different heath plans into three choice categories to quantify the influence of
information asymmetries on health plan selection. They estimate a structural model of health
plan choice and health care expenditures.7 Conditioning on an information set that includes age,
gender, geographic location, and race, they find little evidence of adverse selection.
More recently, several researchers have sought to leverage detailed claims data to
measure the welfare loss from adverse selection in employer-sponsored health insurance. Einav,
Finkelstein and Cullen (2010) note that the demand and cost curves can be separately identified
from premium variation alone. This observation along with linearity and premium exogeneity
assumptions makes estimating welfare loss a simple OLS problem. Using cross-sectional data
from a large employer and focusing on two plans, they estimate that the welfare loss from
adverse selection of $9.55 per employee per year. Bundorf, Levin and Mohoney (2008) use data
from a single health insurance broker to analyze the welfare gains from individually risk-
adjusting the out-of-pocket premium to enrollees. The underlying idea is that risk-adjusting the
out-of-pocket premium better sorts enrollees to plans, thereby increasing net surplus. Handel
(2010) uses medical claims data from a large employer to measure health plan switching costs.
He finds that significant switching costs are large and lead to significant inertia in health plan
7 See also, Marquis and Holmer (1996).
7
choice. In Handel’s setting the reduction of switching costs leads to welfare loss as it increases
adverse selection which leads to significant premium distortions. Finally, Bajari, et al. (2010) use
a two-step method to semiparametrically isolate moral hazard from adverse selection and find
evidence of both.8
3. Data and Institutional Setting
The data used here are detailed employee health care claims and health plan enrollment
history for a very large, self-insured employer. Linked, but de-identified, enrollment history is
available for 2002-2005 and claims history is available for 2002-2004, with six months of claims
run-out paid in 2005. As the employer is self-insured, all covered medical care rendered to an
employee will have an associated claim included in our data. A claim contains all information
needed to process an insurance payment and thus includes valuable information such as primary
and secondary diagnosis codes, procedure codes and identification of provider. The claim also
contains information on the amount paid by the insurer and by the enrollee. For each enrollee we
aggregate all their health care claims throughout the year to formulate a precise measure of both
the total claims paid for the enrollee’s care, and the share of these expenditures incurred by the
enrollee as out-of-pocket costs for covered medical care.
The employer offers health insurance coverage for individuals, spouses/domestic partners
and families. Employees select their health plan during the open enrollment period in November.
Information about the plan options, benefits and provider network structures are mailed directly
to each employee just prior to open enrollment. Using these documents, it is easy for the enrollee
8 There is a literature on modeling health plan choices for Medicare and employer-based plans that is also relevant to our work. An incomplete list of the papers in this literature includes Feldman, et al. (1989), Dowd and Feldman (1994-1995), Marquis and Holmer (1996), Cutler and Reber (1998), Royalty and Solomon (1999), Buchmueller (2000), Scanlon, et al. (2002), Strombom, Buchmueller and Feldstein (2002), Dowd, Feldman and Coulam (2003) and Atherly, Dowd and Feldman (2004). All rely on the iid, logit family of choice models to estimate the parameters of interest. These papers estimate premium elasticities for health insurance that range from the moderately elastic to the very inelastic.
8
to compare out-of-pocket premiums and benefits structures across the plans. The Human
Resources department also sends out several emails reminding employees to select a health plan
during open enrollment. Information about the spouse/domestic partner employment-based
benefits choices are not available – we do not know what the exact choice set is in this case, as
the partner may have insurance benefits at another firm. For this reason our analysis focuses
solely on individuals continuously enrolled in single coverage over the three years. While single
coverage does not equate to single marital status, the approximation likely introduces little error.
In addition, employees outside the metropolitan service area were eliminated from the analysis
sample because they have a different health insurance choice set. This leaves 3,578 employees
for the estimation.9
Employee demographics information is gathered from several sources. Age and gender
are taken directly from the encrypted plan enrollment files. Employee earnings were matched
from a separate accounting file. This demographic information is merged with health plan
enrollment files, along with health status and expenditure data culled from the claims files. Plan
characteristics (provider panel structure, copayments, out-of-pocket limitations, employee
contribution levels) are drawn from the employer’s open enrollment materials.
The employer offered a total of four plans in 2002 through 2005. The employer contracts
with the plans to access their provider network and specifies the benefit design. The plans charge
the employer a fee for accessing their network and processing the claims while the employer
pays the medical care claims net of enrollee out-of-pocket payments. The plans include one
closed-panel health maintenance organization (HMO), two point-of-service (POS) plans, and a
Consumer-Driven Health Plan (CDHP).
9 The relative plan shares of the entire sample before imposing our inclusion restrictions closely match the relative shares of our analysis sample suggesting that our inclusion criteria is not inducing a sample selection bias.
9
The HMO is a restricted provider network plan in which enrollees incur modest co-pays
for any within-network utilization, however the plan does not cover any out-of-network medical
care. The organization offered two POS plans with differing provider networks both across and
within the plans. One of the POS plans (POS 1) has three tiers, each with different premium
levels; the tiers corresponding to successively broader provider networks. Tier 2 includes all the
providers in Tier 1 as well as offering enrollees access to additional, potentially higher cost
providers. Likewise, Tier 3 includes all the providers in Tier 2 in addition to other higher-cost
providers.10 The other POS plan (POS 2) has only one option. The POS plans charge higher in-
network co-pays than the HMO but allow out-of-network care (with a higher out-of-network co-
pay). All of these plans had the same out-of-pocket limits of $1,500 and the same prescription
drug carve-out plan.
The CDHP is a recently developed health insurance product that combines a catastrophic
indemnity plan matched with an employer-funded medical spending account. In the CDHP a
single enrollee is given an account of $750 (2003 and 2004) or $600 (2005) to draw upon for
medical care. Once the account is exhausted, the enrollee pays the balance of the $1500
deductible out of pocket. After the enrollee utilizes more than $1,500 of medical care, there is no
additional copayment for in-network health care. Pharmacy coverage is integrated into this plan,
unlike the carve-out in the traditional plans. In sum, there are a total of six health plan choices
available to the employee: HMO, POS 1/tier 1, POS 1/tier 2, POS 1/tier 3, POS 2, CDHP.
Table 1 presents the annualized employee premiums and enrollment shares for the plans
in our data. The premiums vary across plans and time. The organization spends considerable
resources attempting to inform employees of their health plan options, thus enrollees are well
informed of the premiums and benefit differences across the plans when making their plan 10 While the Tiers were advertised as differing in the prices paid to providers, as we discuss in Section 5 we found little cost differential between the tiers in POS 1.
10
selections. During the open enrollment period, brochures and emails are distributed to all
employees detailing the premiums and changes in premiums from the previous year, as well as
the plan benefit structures and provider panels. In addition, the detailed plan information is
available on the organization’s human resources web site.
The temporal variation in out-of-pocket premiums is driven by two changes in the
employer’s premium-setting strategy. First, the employer used a defined contribution strategy
and the formula used to construct the premiums changed over time. For 2002-2003 the employee
share for single coverage was set to zero for the base plan. Employee share for other plans was
simply the excess of other plan total premium over base plan total premium. For 2004-2005 the
employee share was set at 10% of the base plan premium. Employee share for other plans was
10% of this base amount plus the excess of other plan total premium over base plan total
premium. These changes were driven by the organization’s drive to push more of the cost of
health care onto employees. The second change was that the employer moved from a strategy of
no risk adjustment in 2002 to one of rudimentary risk adjustment by the end of our data.
Importantly, we have discussed, in great detail, the premium setting strategy with the human
resource personnel and they assured us that the changes in the relative premiums did not reflect
specific changes in the plan characteristics or employee preferences but changes in the premium-
setting strategy of the organization. Thus, to the best of our understanding, time varying
differences in premiums are exogenous.
The most popular plan is the HMO, and it is also the cheapest plan. Another noteworthy
pattern in Table 1 is that while the premiums of the plans shifted considerably, the shares did not.
This pattern suggests that controlling for autocorrelation in enrollee preferences is likely to be
important.
11
Measuring Health Status
An important feature of this study is that we are able to use past health care utilization
information to formulate health status measures that are incorporated into the choice model. In
order to construct this measure, we need to map the thousands of ICD-9CM codes (and millions
of potential combinations of codes) into a parsimonious representation of current health status.
We use algorithms developed by the Johns Hopkins University ACG Case-Mix System (v. 6),
developed by the Health Services Research and Development Center.
The ACG Case-Mix System employs a combination of clinical assessments and grouping
of diagnoses and procedures along with extensive regression fitting to take the thousands of ICD-
9CM codes that are embedded in the claims to construct simple measures of health status. The
predictive modeling feature of the ACG software produces a concurrent weight (CW), a
summary measure of the current individual health status and resource utilization. The CW is
constructed so that the national average is 1.0 with higher values denoting poorer health and
likely higher expenditures.
A potential concern with our measure of health status is that it is based on the previous
year’s health care utilization which, in turn, is potentially a function of plan characteristics and
benefit structures. That is, our measure of health status may be an endogenous function of plan
choice. The influence of plan choice in affecting the CW is mitigated by the fact that the CW is
based on diagnosis code history, rather than the health care utilization, per se. Thus, impact of
the plan on CW can only occur through differences in accessing care that affect the diagnoses
recorded in their claims history. To investigate this possibility, we examine the impact of plan
choice on CW for those individuals that switched plans. We regress CW on plan and individual
fixed effects. Since we are controlling for time-invariant, individual differences in CW, if the
plan choice does not affect CW, it suggests (but is not dispositive) that our measure of health
12
status is not a function of health plan choice. In this analysis, plan fixed effects are insignificant.
This is true for both each plan individually and for the joint test of their significance. Thus the
evidence we can bring to bear suggests that CW is not a function of plan choice.
Table 2 provides summary statistics of our estimation sample. The organization has a
high percentage of female employees and the average health status (as measured by CW) of the
employees is poorer than the national average. Interestingly, there are important differences in
the mean demographic and health status measures across the plans, however the health status
measures display much greater across-plan variance than the demographic variables. This
suggests that relying solely on demographic variables to measure variation in expected cost may
underestimate the true magnitude of adverse selection.
The simple summary statistics in Table 2 show that health plans differ in both
demographics and in measured health status, and this is prima facie evidence of selection. The
HMO attracts a younger, more male, lower-paid and healthier population than the other plans.
The plan that garners the least favorable selection is the POS 2 plan—the mean age and CW is
significantly higher than the HMO. The CDHP enrolls an older population with a higher salary,
and a higher CW than the HMO but a lower CW than the other plans.
4. Empirical Approach
To measure the welfare consequences of we first must accomplish two tasks: 1) estimate
the expected total health claims and patient out-of-pocket expenditures that individuals would
incur if they were to enroll in any of the available plans; and 2) estimate the parameters of an
indirect utility function. Below we describe our approach to estimating the cost functions and
health plan choice demand parameters.
13
Medical Care Cost Model
In order to estimate total health care expenditures and patient out-of-pocket (OOP)
expenses as a function of observable individual and plan characteristics, we face three important
modeling challenges. First, we need to account for unobserved individual heterogeneity. Second,
much of the data is censored at zero. Third, OOP and total claims expenditure are determined
jointly with plan-specific rules placing nonlinear restrictions on the OOP realizations. To account
for these features of the data, we use a two-part, three-equation censored regression model
building upon the work of Cowles, Carlin and Connet (1996). The first equation in the model is
an indicator of whether the patient receives care. The thresholds are stochastic and are a function
of observables, a time-invariant unobservable individual effect and an iid error. Conditional on
the threshold being breached, the other two equations of the model predict total claims
expenditure, denoted Clmit, and total out-of-pocket expenditure, OOPit.
More specifically, the threshold variable is a zero-one indicator of non-zero claims,
and is a function of the latent variable in the following way:
Positive values of claims and out-of-pocket expenditures are observed only when this threshold
is met. Letting the asterisk denote latent values, the observed Clmit is given by
,
and OOPit is modeled by capping OOP expenditures at the plan’s out-of-pocket limit, denoted
, where Ij denotes the set of plan j’s enrollees:
14
We allow for individual-specific, unobserved, time-invariant shocks that differ across
equations. The system of latent variables is simply:
,
where are the time-invariant, individual-specific intercepts. In the language of
frequentist econometrics the ’s are similar to random effects. The iid error vector is assumed to
be normally distributed: for . Note that the covariance
matrix, , is plan-specific, as we expect different correlations between the threshold, claims and
OOP expense across the plans.
In order to flexibly model health plan costs, the vector of explanatory variables, ,
includes the demographic and health status variables, and interactions of those variables.
Specifically, includes age, age squared, female indicator, age interacted with female, CW
(health status) and a female indicator interacted with age and CW. We estimate plan-specific
parameters for all of the explanatory variables. In this way, we allow for the possibility that some
plans may be more efficient at treating enrollees with relatively good health status while others
may be more efficient in providing care for sicker enrollees.
Bayesian inference is used to formulate our estimates of the parameters of interest. The
aim in Bayesian estimation is to construct the posterior distribution of the parameters given
assumptions on the prior distribution of the parameters and the data. We simulate the posterior
15
distribution of the parameters using Monte Carlo Markov Chain (MCMC) methods. Specifically,
we use the Gibbs sampler (Gelfand and Smith, 1990) and data augmentation techniques (Tanner
and Wong, 1987). Chib (1992) first proposed a Bayesian approach for estimating the tobit
model. The details of the estimation algorithm are described in an appendix available from the
corresponding author.
Our estimates will be unbiased as long as residual, unobserved health status is
uncorrelated with plan choice. We believe this is a reasonable assumption in our setting as we
include in the set of regressors a health status measure based on very detailed information on
prior diagnoses and procedures. The health status measure is based on much of the same
information the enrollee has access to when selecting a health plan.11 While we believe that that
the institutional setting and the quality of our data make it unlikely that, conditional on our
measure of health status, enrollees are cognizant of health status information that affects their
choice of plan, it is clearly still a possibility.12 In order to assess the plausibility of our
assumption, we perform two direct tests for the presence of unobservable health status
information that is correlated with plan choice.
For the first test, we reformulate and generalize our model allowing for linkages between
the cost and health plan choice equations. That is, we allow the plan choice to be endogenous.
Specifically, the error from the choice equation is allowed to impact directly on plan cost
estimates.13 The exclusion restriction that identifies the parameters is that changes in premiums,
conditional on covariates, are not correlated with the cost shocks. This framework is similar to
11 There is a sense in which we have better health status information than the enrollee as we are running their claims histories through a sophisticated algorithm constructed for the purposed of predicting future health care expenditures. 12 In addition, others have argued (and tested) that the type of risk adjustment we employ corrects for unobserved selection bias in an environment where unobservable health status is more likely to bias the results (Shea, et al., 2007). Pietz and Peterson (2007) find that other measures of self-reported health status beyond the diagnostic information in claims do not help predict mortality. 13 In this specification, we cannot include the measures of the variance of OOP expenditures in the plan choice equation.
16
Deb, Munkin and Trivedi (2006) and Geweke, Gowrisankaran and Town (2003). Again, the
inference here is generated using Bayesian methods. The posterior distribution of the parameters
on the utility errors meaningfully overlaps zero, implying that plan choice is not correlated with
unobserved health status information.14 Furthermore, the posterior distribution of the plan
coefficients is very similar to the corresponding distribution from our base specification. Again,
this suggests that unobserved choice shocks are not correlated with plan cost. In the analysis we
present below, we retain separate expenditure and choice models to allow the inclusion of an
estimate of the variance of out-of-pocket expenditures as a covariate in the choice model to
explore risk aversion.
In the second direct test, we estimate health care costs, conditional on the costs being
greater than zero using both individual fixed and random effects.15 We compare the parameter
estimates between the random effects and fixed effects models using a standard Hausman-Wu
test. If individuals use time-invariant information about their health status to inform their plan
choice then we should expect the coefficients estimates between the random effects and fixed
effects models to differ. The Hausman-Wu test fails to reject the hypothesis at the 10% level that
the coefficients are different between the models. That is, there is little evidence that our
estimates from the cost equation will be biased due to enrollees using time invariant information
in selecting a plan.16
Once the cost model is estimated, we construct estimates of the expected Clm and OOP
expenses for each individual (in logarithm scale) across all plans including the plan actually
selected by the individual. Because the Gibbs Sampler method results in a sample from the
posterior distribution of the parameters, including the latent expenditure variables, we can use
14 We do not present results from the joint model here but the results are available upon request from the authors. 15 Less than 10% of the enrollee/year observations have zero reported health care claims. 16 Documentation of additional, indirect, tests of endogeneity is available in Carlin and Town (2010).
17
this sample to calculate the sample mean and variance of out-of-pocket expenditures for each
individual in each plan, to be included in the choice utility equations.
Health Plan Choice Model
We model health plan choice using a multinomial probit with autocorrelated errors (AR-
MNP). In our model the unobservable components of utility are allowed to be freely correlated
across plans and autocorrelated over time. Most analyses of health plan choice rely upon the iid
logit model or the heteroskedastic but independent nested logit model. Our results suggest that
these assumptions may lead to biased inference. Only 7% of the enrollees switch plans in any
given year, suggesting that plan preferences are likely to be correlated over time; our framework
explicitly accounts for this autocorrelation. We are unaware of any analysis of health plan choice
that allows for both full freedom in cross-plan correlation, and correlation of unobserved
preferences across time.
In selecting their health plan, we assume enrollees observe their health status and
formulate consistent estimates of both the mean and variance of their out-of-pocket expenditures
across all plans. Given these estimates they select the plan that maximizes their utility. Let
be the indirect utility of the person for the plan at time t
. Individuals choose plan j if and only if for all . It is well known
that is not identifiable because level and scale are irrelevant to the maximization problem.
The Jth plan is used to normalize the level of utility so that . The CDHP is used
as the baseline choice because of its distinct plan design. We normalize for scale by setting the
variance of the unobserved preferences for the HMO to 1.
18
It is assumed that is linearly related to the characteristics of the individuals and the
choices ( ), so where is the unobservable component of utility. In matrix
form, we have the relationship . The errors are assumed to be normally
distributed: . In a slight modification of Geweke, Keane and Runkle (1997), the
“stickiness” of plan choice across time is captured via an AR(1) relationship in the error terms.
Specifically, , with iid . The variance matrix is a diagonal
matrix, whose diagonal elements are plan-specific variances. The AR(1) coefficient, ,
is a plan-specific parameter, based on the plan individual i is in at time , when the time t plan
election is made. Given the apparent persistence in plan market shares, we anticipate estimated
values of closer to one than to zero.17
Convergence is often elusive when estimating the parameters of a multinomial probit
model using a classical maximum likelihood approach. However, the parameters of this
autoregressive multinomial probit model can be readily estimated using Bayesian methods. As in
the cost model, we specify a diffuse prior distribution for the parameters and simulate the
posterior distribution using the Gibbs sampler and data augmentation techniques for the latent
utilities. An appendix available from the corresponding author describes the likelihood function
and Gibbs algorithm in detail.
The vector includes the following plan invariant variables: age, female indicator, age
interacted with female, and CW (the measure of health status in the year the enrollment election
is made). The parameters on these variables can vary by plan. The vector also includes the
17 It is an open question whether εit captures pure, exogenous preference differences for plans or if captures switching costs that accrue as patients develop relationships with providers. That is, it is an open question whether the errors capture state dependences (e.g switching costs) or correlated heterogeneity. In the results section we provide some evidence that suggests that correlated errors do not capture switching costs.
19
following variables that vary across plan and time: natural log of net salary,
, and the expected value of the natural log of out-of-pocket (OOP)
expenditure, the variance of the natural log of the OOP expenditure, and the office visit
copayment level for the plan.
Our model imposes fewer restrictions than traditionally used multinomial estimation
approaches. Because the elasticity estimates from our AR-MNP model differ from the MNL
estimates, it is instructive to decompose the source of those differences into the importance of
capturing heteroskedasticity and modeling persistence in health plan preferences.
Misspecification of the variance-covariance structure can lead to biased coefficient estimates in
discrete choice estimation, however in many cases that misspecification does not significantly
impact coefficient estimates. That will not be true in our case and thus it is instructive to isolate
the source of the misspecification bias. Specifically, we decompose differences into the
importance of accounting for serial correlation and cross-plan correlation in the residuals. For
comparison purposes we use frequentist methods to estimate a cross-sectional multinomial logit
model and a cross-sectional nested-logit model.18 We also use a Bayesian approach to estimate
an autoregressive, MNP model with a diagonal covariance matrix. The later model is the same
as our base model except the off-diagonals of the variance-covariance matrix are constrained to
zero, which maintains the autoregressive structure but constrains the cross-plan premium
elasticities.
Monte Carlo Analysis
Unlike linear regression framework, failure to account for autocorrelation in the discrete
choice setting can result in large biases in the coefficient estimates of interest. The intuition
18 All cross-sectional models are based on data with one year drawn randomly from the panel for each individual, to eliminate temporal correlation. A cross-sectional multinomial probit model did not converge.
20
underlying this bias is that the coefficients in the discrete choice framework are normalized by
the variance of the error. Static approaches will result in downwardly biased estimates of the
relevant variance and hence biased estimates of parameters of interest.
In this Monte Carlo we generate data on health plan choice for 1,000 individuals over 3
years. The data generating process is constructed to mimic the data we analyze here. We allow
for four plan choices and include covariates that correspond to age, gender, premium and out-of-
pocket co-pays. The realized values of these variables are all randomly generated. Data is
generated using three different auto-regressive parameters: 0, .80, and .95. The premium
elasticity of demand is equal across plans and differs with the AR parameters with the true
elasticity being .6, .4 and .3 for AR parameters of 0, .8 and .95, respectively. We estimate the
parameters using the MNL, multinomial probit, and a multi-period, multinomial probit
approaches. Frequentist methods are used to generate the parameter estimates for the MNL and
the multinomial probit while the Bayesian approach outlined above is used to estimate the multi-
period probit model. In this analysis we focus on the elasticity estimates implied by the estimated
model.
The results show that the static models generate elasticity estimates that are significantly
biased when the true AR parameters is large.19 When the AR parameter is .8, the average bias of
the elasticity estimates from both the MNL and the MNP are upwardly biased by approximately
50%. When the true AR parameter is .95, the average bias from both the static models jumps to
over 100%. Interestingly, the flexibility of the VCV of the MNP model does not help mitigate
the bias. In sum, the results from our simple Monte Carlo exercise suggest that in the presence of
significant persistence in unobserved heterogeneity, discrete choice models will generate large
upward biases in the coefficient estimates.
19 Details of the Monte Carlo estimates are available from the authors upon request.
21
5. Results
Cost Model
The Monte Carlo chains for the cost model converge well after the initial burn-in of 500
iterations.20 Given the space constraints, the cost model contains too many parameters to report
here. However, the posterior distributions for the parameters look sensible. Costs are increasing
in age, female and CW. The model fits the data well and closely replicates the realized
distribution of costs, primarily because the panel data allows the estimation of individual-specific
intercepts. The R2 is 0.56 for claims and 0.68 for out-of-pocket expenses. In Figure 1 we present
the actual and fitted distributions of total expenditures and out-of-pocket costs. The model under-
predicts the number of zeros but otherwise does capture the skewness of both the total cost and
OOP distributions.
The results of the cost model imply that there are significant cost differentials across the
plans. This is not surprising as the plans have different provider networks ranging from a
restrictive HMO network to the open access CDHP plan. The plans also impose different out-of-
pocket payment structures that also can affect marginal utilization. For example, the HMO
generally has constant co-payments for a doctor visit while the CDHP will have very low
effective out-of-pocket expenditures for both low and high utilization, and high out-of-pocket in
the doughnut hole. These differences in plan features likely translate into important differences
in utilization and prices paid to providers which, in turn, lead to plan-specific cost differences.
To quantify the cost differentials we perform the following experiments. We take the
entire sample population and estimate their mean expenditures (the total of expenditures paid
out-of-pocket and by the plan) as if they were all to enroll in each plan. We also take the
20 Details on the convergence of the MCMC chains and estimated parameters are available from the corresponding author.
22
enrollees of each plan and estimate the costs of the enrolling that population in the other plans.
This latter exercise provides a sense of the differential selection into each plan.
Table 3 presents the results of this exercise. These results show that health status based
selection is present in our data and economically important. The realized HMO costs are
predicted to be 11% lower than if they had to treat the entire enrollment. In contrast, the per-
capita costs for the POS 1 are 18% higher, the POS 2 plan is 43% higher and the CDHP plan is
13% higher than if they enrolled the entire population. The significant cost differentials across
the plans imply that the welfare loss from premium distortions are potentially large. It is
important to note that these cost figures are total cost differentials and do not decompose
selection into easily observable, and hence contractible, and non-contractible components.
The results presented in Table 3 also suggest that the relative costs of the plans differ by
health status. For example, the cost differential between the HMO and the POS 1 plans depends
upon the health status of the comparison populations. The cost of the POS 2 plan is 38% higher
than the HMO for those who enrolled in the HMO but is 29% higher for those who enrolled in
the POS 2 plan.
Health Plan Choice Model
The posterior distribution of the parameters for the choice model is presented in Table 4.
The estimation relies heavily on the data augmentation process of sampling the unobserved latent
utilities, thus the convergence of the Monte Carlo process is somewhat slower than in the cost
model, taking 3000-4000 iterations before the variance parameters appear to converge. We used
a 4500 iteration burn-in for our analysis. For comparison purposes, Table 4 also presents the
results of a multinomial logit model, estimated using maximum likelihood methods.
The results of the AR-MNP model indicate: 1) Selection is present both on easily
observed demographics of age and gender and on the more difficult to observe health status
23
dimension; 2) Enrollees are premium inelastic particularly for the HMO and POS 2 plans; 3)
There is significant autocorrelation in the unobservable dimensions of health plan choice and this
autocorrelation is unrelated to health status; and 4) Many of these conclusions would be reversed
if one relied on the estimates of the more traditional approaches to estimating plan choice (e.g.,
MNL model.)21
Our principal focus in this analysis is on the role of selection in health plan choice. Recall
that the coefficient estimates measure the influence of the variables relative to CDHP. The
estimates in Table 4 indicate that the HMO attracts a lower-cost pool than the CDHP (and most
other plans). The HMO enrolls a younger and more male population. Furthermore, the HMO
also attracts an enrollment that has a lower concurrent weight (CW) than the other plans.
The patterns of selection are more complicated for the POS plans. All else equal, relative
to the CDHP, the POS 1 plans attract a younger population, but enrollment in the third cost tier is
more popular with those who have higher CW. Said somewhat differently, on the basis of easily
observed demographics, the POS 1 plan looks to get a better selection than the CDHP, however
examining only the health status measures the POS 1 plan appears to get some measure of
adverse selection. The lesson we take from these estimates is that adverse selection may occur
along multiple dimensions and simple approaches to risk adjustment may be incomplete and
potentially magnify the impact of selection.22
Table 5 presents the distributions of the posterior for the autocorrelation parameters and
the variance-covariance terms from the AR-MNP. As expected, there is significant persistence in
unobserved preferences towards health plans. In Table 5a, the sample means of the posterior
21 While there are important differences in the models, classic measures indicate that the AR-MNP and MNL models have similar fit. An R2 calculation based on the squared differences between 0/1 indicators of plan election and the probability of electing each plan gives a statistic of 0.42 for both models. Not surprisingly, the AR-MNP model R2 jumps to 0.81 when the 2005 enrollment is predicted conditional on the 2004 actual enrollment. This conditional prediction is not relevant to the MNL model. 22 Finkelstein and Poterba (2004) draw a similar conclusion using data from annuities markets.
24
distribution for the autocorrelation parameters are all above 0.50. Interestingly, the errors
associated with the HMO plan have a significantly lower autocorrelation than the non-HMO
plans. The mean of the posterior distribution of the HMO autocorrelation parameter is 0.58 while
the mean of the posterior distribution for the other plans in excess of 0.90.
The variance-covariance terms are presented in the Table 5b, with the implied correlation
coefficients shown in Table 5c. These estimates look quite sensible. Many of the cross health
plan covariances are significantly different from zero, with 4 of the implied correlation
coefficients in excess of 0.20. This finding, in conjunction with the AR(1) estimates,
convincingly rejects the MNL assumption of iid, homoskedastic errors. The error terms from the
POS 1 plans are all positively correlated with the plans which are closest substitutes (Tier 1 /
Tier 2 and Tier 2 / Tier 3). It is also noteworthy that there is a high correlation between the errors
in the HMO and POS 2, suggesting that they are closer substitutes than one might infer given
that they have very different provider networks.
We now turn our attention to the estimates of the premium sensitivity of health plan
enrollees. In Table 4 the parameter estimate on ln(Salaryi - Premiumj) is positive and practically
none of the posterior sample crosses zero. Increases in premiums led to decreases in enrollment.
This is true for both AR-MNP and MNL models. Table 6 quantifies the magnitudes of the
parameter estimates by presenting the own- and cross-premium elasticity estimates from our base
specification.23 For comparison purposes, Table 6 also presents the elasticity estimates from
several other estimation strategies: Classical MNL, classical nested MNL (the nests being the
POS 1 plans and the other plans), and a Bayesian AR-MNP with a diagonal covariance matrix.
Attempts to model a static MNP model had poor convergence and generated small, nonsensical
positive premium elasticities, thus the results are not presented here.
23 We combine all the POS 1 tiers in this elasticity analysis.
25
The implied price elasticities (from the enrollee’s perspective) from our base model are
low and somewhat lower than previous estimates.24 The estimated average enrollee price
elasticity is -.06. An examination of the plan-specific elasticities reveals that the low estimated
average elasticities are driven by a strong enrollee loyalty towards the HMO and POS 2 plans.
The own price elasticities for these plans are approximately -0.01 and -0.07, respectively, while
the plan-specific elasticities for the other plans are approximately -0.10. The plan-perspective
elasticities (not shown) are much larger than the enrollee perspective elasticities ranging from
-0.06 for the HMO to -1.04 for the CDHP. The cross-premium elasticity estimates reveal some
interesting asymmetric substitution patterns. The POS 1 plan is the closest substitute for all the
other plans. The CDHP is the closest substitute to the POS 1 plan.
The MNL model generates much larger price elasticities than those implied by our
primary model. This result is consistent with the Monte Carlo experiment presented in the
previous section. The mean price elasticity across plans and enrollees from this model is -.14
with non-HMO plan elasticities ranging from -.26 to -.36. Our results reject the underlying
assumptions of the MNL model in this population; and, in so far as the importance of persistence
in enrollee unobserved heterogeneity generalizes to other settings, it suggests that the widespread
use of the iid, MNL model may be significantly inflating health plan elasticity estimates.
Estimated elasticities from the nested MNL are approximately half as large as the MNL
suggesting that allowing for simple forms of heteroskedasticity in the plan choice framework
reduces but does not eliminate bias. The implied own-price elasticities estimates from the AR-
MNP models with the constrained variance matrix imply elasticity estimates that are closer to
our primary model than the logit models. In the discrete choice context, the magnitude of the bias
24 For example, Royalty and Solomon (1999) estimate health plan elasticities for Stanford University employees between -.2 and -.3. Using a very different sample of Medicare beneficiaries, Atherly, Dowd, Feldman (2004) and estimate an out-of-pocket premium elasticity for enrollment in Medicare HMOs of -.13.
26
is increasing in the autoregressive parameter, and our estimated AR coefficient is generally quite
large. We have performed Monte Carlo analyses that also find that the coefficient bias is, in fact,
increasing in the AR parameter.
Not surprisingly, the cross-price elasticities from the constrained model differ from our
base specification. The fact that the AR models generate similar own-price elasticities that are
significantly smaller in magnitude than all the static models suggests that the bias in the own
price estimates is a consequence of a failure to account for persistence in unobserved plan choice
preferences.
The estimated cross-price elasticities (and their economic implications) implied by the
MNL and nested MNL models are very different from those implied by the AR-MNP,
highlighting again that the functional form assumptions of these MNL can drive the estimates.
For example, the MNL estimates suggest that the HMO is the closest substitute for all of the
plans because it has the largest share. The AR-MNP estimates imply that the HMO, the only
option with a closed provider panel, is not the closest substitute for any of the other plans.
While not shown in detail here, we further explore the nature of health plan choice by
constructing elasticity of demand estimates by age, gender and health status. There is a literature
that argues that the healthy are more premium sensitive than the sick and these differential
elasticities affect the dynamic stability of health plan premium setting. In general, our results are
not consistent with this view. In the AR-MNP, there are no differences in the elasticities by
gender and modest differences by age. The mean elasticity for the non-HMO plans for enrollees
30 years and younger is -0.12 while for those enrollees who are 45 years of age and older the
corresponding elasticity is -0.09. We also find there are virtually no differences in the elasticities
across health status quartiles. The lowest CW score quartile has an elasticity of -0.10 for the non-
HMO plans while the highest risk quartile has an elasticity of -0.09.
27
Our estimates suggest that unobserved factors affecting enrollees’ plan choice are highly
persistent. There are two classic explanations for this persistence. First, persistence could be a
function of unobserved preference heterogeneity – enrollees have strong preferences for plans
and those preferences do not change over time. Second, persistence could be endogenously
generated via switching costs. Once an individual enrolls in a plan they develop plan-specific
capital and provider-specific relationships that increase the utility of the plan relative to the
alternatives. Handel (2010) finds evidence of large switching costs in his setting.
As noted by Heckman (1981), distinguishing unobserved heterogeneity from switching
costs can be difficult. However, the richness of our data allows us to construct a reasonable,
indirect test of whether the causes of plan “stickiness” differ by health status. The importance of
switching costs should differ by health status – sicker enrollees have more interactions with their
providers and with the health plan and those interactions carry greater health consequences for
sicker enrollees. Thus, if switching costs are important, sicker enrollees should display more
persistence in their health plan choice than their healthier counterparts. The logic underlying this
test is similar to Strombom, Buchmueller and Feldstein (2002) who use a MNL framework and
find that plan enrollees with a previous admission or cancer diagnoses are less likely to switch
plans.
In order to investigate whether enrollees in our data displayed similar patterns in plan
choice we divide the sample into four quartiles based on health status (CW) and allowed the AR
parameters to vary across the quartiles. In this model, the AR(1) coefficients were held constant
across plans.
Table 7 presents the coefficient estimates from this exercise. The results indicate that
there are statistically significant differences between the healthy and the sick in the
autoregressive coefficient, but the magnitudes are not economically important. The implied
28
premium elasticities between the healthiest and sickest cohorts are essentially identical. Our
estimates do not provide much support for the notion that the healthy are more premium
sensitive and experience less health plan inertia than the sick. That, in turn, suggests that
persistence in health plan choice is a consequence of unobserved heterogeneity and not switching
costs.
The results from the cost and health plan choice models suggest that there is 1) health
plan selection based on easily-observed and hard-to-observe differences in health status; and 2)
employees are relatively premium insensitive when selecting a plan. Thus, the importance of
selection on welfare is unclear. Clearly, the presence of differential selection opens up the
possibility that it may result in large premium distortions as suggested by Culter and Reber
(1998). However, the low premium response of enrollees mitigates the impact of these premium
distortions on plan selection and total welfare. To investigate the direction and magnitude of the
welfare consequences requires simulating the impact of alternative premiums setting strategies
on enrollee wellbeing and that is the subject of the next section.
6. Welfare Analysis of Alternative Premium Setting Strategies
The influential Jackson Hole group sought to bring the beneficial effects of market forces
to the health care sector, and one important feature of its managed competition strategy was to
force employees to face the cost differential between different health plans (Enthoven and
Kronick, 1989a and 1989b). However, as noted by Newhouse (1996), Cutler and Reber (1998),
and Cutler and Zeckhauser (2000), unless the premium setting strategy accounts for health status
based selection, managed competition can lead to inefficient premiums. Cutler and Reber (1998)
argue that, in fact, Harvard University’s implementation of this managed competition approach
to premium setting led to significant selection induced welfare loss.
29
In our context, the welfare harm arises because the individuals sort into plans where the
relative (to other plans) cost of their enrollment is greater than the relative utility of that plan. In
order to measure the welfare harm associated with this suboptimal premium setting, we calculate
the impact of alternative premium setting strategies on the compensating variation of employees,
treating the self-insured organization’s health care budget as fixed.25 The optimal set of
premiums would maximize the utility of enrollees given this fixed budget.
As we have estimated the structural indirect utility parameters, it is straightforward to
perform counterfactual premium experiments via simulation. In each of these experiments we
calculate the expected utility from alternative premium strategies and then find the change in
salary for each individual in our data that would make them indifferent between the new
premium vector and the old premium vector.
More formally, letting Pij denote the probability of individual i of selecting plan j, and
letting Wij denote the estimated expected utility of individual i enrolling in plan j conditional on
choosing plan j, we find the compensating variation, CV, such that:
.
Here, Oldprem and Newprem are the base level premium and the new premium in the
experiment, respectively. In the risk adjustment experiments, we use the 2005 data and
employer’s actual 2005 premium level as the comparison point. We explored the welfare impact
25 If the cost differentials between plans are solely a consequence of differential prices paid to providers and not moral hazard then our approach would still capture the welfare loss as the cost differentials still lead to distorted plan choices given those costs.
30
of shifting to four possible premium-setting scenarios.26 The first three of these scenarios are
strategies that are used by employers, with the first two being the most common. The third
strategy is a modestly sophisticated risk adjustment approach that requires the analysis of
previous claims data. The fourth strategy is the optimal strategy modified from Cutler and Reber
(1998). In each of the scenarios below, is the expected average cost of plan j
conditional on the resulting employee premiums (and the characteristics of the plan j’s
enrollees). In addition, in each of the four scenarios, employee premiums were set so that the
average employer contribution remained at $1,726, the actual 2005 average annual employer
contribution.
1. Fixed percent of premium: where is chosen to
meet the target employer contribution. 2. Fixed dollar contribution with no risk adjustment:
where F is set equal to the target employer contribution. 3. Fixed dollar contribution adjusted to the average population risk:
where is the average concurrent weight
across all employees and is the average concurrent weight of plan j enrollees. Here
the nominal F is adjusted so that the actual average employer contribution matches the target.
4. Optimal Premiums: The differential between the premium for plan j and the HMO is set equal to the weighted difference in plan costs across employees, where the weight is the marginal probability of enrollment for each individual:
The HMO premium, on which all other plan premiums are based is then selected so that the average employer contribution matches the target. This is a generalization of the optimal premium setting strategy as characterized by Cutler and Reber (1998), to a setting with more than two plan options in a random utility environment. This is not a first best outcome, but rather, the maximized utility subject to the constraints of a fixed employer contribution and that enrollees face the same premium schedule.
26An obvious strategy to explore is the Nash premium outcome. However, because our estimated plan perspective elasticities are low at the current premiums, the Nash equilibrium would result in equilibrium premiums an order of magnitude larger than our the current premiums and thus seems to be of little practical relevance.
31
The fixed percent of premiums strategy can be viewed as a crude mechanism to risk
adjust premiums as individuals do not fully incur the cost differential of enrolling in high cost
plans. The fixed dollar contribution is a strategy that promotes the efficient allocation of
enrollees across plans; the high cost plans are not subsidized for higher health care risk, and
enrollees join high cost plans if the utility of those plans exceeds the increased cost. The risk
adjusted, fixed dollar premium is the strategy promoted by the early advocates of managed
competition and attempts to maintain the efficiency aspects of the fixed dollar approach while
mitigating the impact of adverse selection. The optimal premium difference between plans is
given by the difference in the marginal cost of the plans for the marginal enrollee. This is the
approach we take in implementing the optimal premium in this experiment.
As mentioned above, by 2005 this employer partially risk adjusted the premiums using an
ad hoc approach. So the baseline premiums should not be viewed as representing a state “no risk
adjustment,” but rather are crudely risk adjusted using an ad hoc risk approach.
The results of this simulation are presented in Table 8. The premiums based on fixed
employer contribution, with and without risk adjustment, are very different from the baseline
premiums levels. The HMO becomes significantly cheaper and the CDHP plan is much more
expensive. All of the premium setting strategies yield welfare improvements. However, despite
the large changes in premiums, the magnitudes of the increases are very modest. The best
performing strategies are the nearly identical third and fourth scenarios, which result in a
compensating variation decrease of approximately $13, or 0.5% of average total health care
expenditures. It is significant that the optimal strategy can be achieved using a fairly simple risk
adjustment methodology when historical health claims data are available.
In order to get a sense of the importance of the underlying model in affecting welfare
conclusions, we also calculate the welfare using the estimates from the Multinomial Logit model.
32
The implied estimated improvement in welfare is significantly higher for the MNL. The welfare
gain from the optimal premium is $275 or 10% of total expenditures – more than 20 times the
estimates from our base model. So not only does it appear that the MNL approach (and its static
cousins) yield biased estimates of the elasticities, it also affects welfare inferences.
Cutler and Reber (1998) find that moving to a fixed contribution setting from one that
subsidized high cost plans at Harvard University increased health status based selection and
results in welfare loss of between 2 to 4 percent of baseline spending. To compare our results to
Cutler and Reber’s, simply difference the compensating variation values of scenarios (2) and (1)
in Table 8. Our results indicate that a change in premium setting strategy such as the one that
occurred at Harvard would actually induce a small welfare gain of $0.84. Our results differ from
Cutler and Reber’s, in part, because of the difference in the estimated premium elasticities.
Cutler and Reber estimate the average out-of-pocket premium elasticity between -0.3 and -0.6 –
a figure that is five to ten times larger than our estimates. The lower the price sensitivity, the
lower the welfare loss due to adverse selection as premium distortions do translate into
distortions in the distribution of enrollees across health plans. Our welfare loss magnitudes are
similar to those in Einav, Finkelstein and Cullen (2010). They estimate the welfare loss from
adverse selection to be $9.55 per year for enrollees in a large multinational firm.
In Table 9 we analyze the distribution of the welfare impact of the different premium
setting strategies across employees. In general, the low salary, healthier and younger employees
see an increase in their welfare from risk-adjusting premiums, while older, sicker employees and
those with higher salaries see a decline in welfare. This finding highlights that employers may be
hesitant to risk-adjust premiums not only because the average welfare gain may be small, but
also because they have other goals besides maximizing total employee welfare subject to an
expenditure constraint. Executives and administrators who make health benefit decisions are
33
likely to be older and better paid and thus would likely be made worse off under a risk
adjustment scheme. In addition, the firm may seek to transfer welfare from the healthy to the sick
and, in our context, risk-adjusting premiums would undermine that goal.
In sum, in our setting, there is significant health status based selection into health plans.
The magnitude of selection can be seen by examining the large differences between the risk-
adjusted premiums and the actual premiums set by the employer. However, the welfare impact of
this selection is small. This finding highlights that most economists’ intuitive reaction that
selection must lead to market distortions is not necessarily correct. The welfare loss will be a
potentially complex function of both the degree of selection, the price elasticity of demand and
the cost-differentials across plans.27
7. Conclusion
Using flexible models of health insurance choice and medical care expenditures that
remove many of the unappealing assumptions of traditional approaches, we estimate the welfare
impact of cost-based health plan selection. Our estimates suggest that adverse selection is present
and economically important – there are significant mean cost differences across plans as a
consequence of differential enrollee health status. Importantly, this selection does not translate
into significant welfare loss induced by pricing distortions and the implementation of risk-
adjusted premiums only improves welfare marginally. This finding highlights the tension
between health insurance competition, cost-based selection and efficiency. Welfare loss due to
selection is low because premium elasticities are low. However, since estimated premium
elasticities are low it suggests that market-based health plan competition will not result in
premiums near marginal cost. The standard prescription to resolve these conflicting incentives
27 Newhouse (1996) points out that increasing competition in health insurance markets can result in larger welfare loss do to selection. Our results are consistent with that observation, as competition should lead to larger premium elasticities and thus larger distortions.
34
induced by health plan competition is to risk-adjust premiums. While this strategy can certainly
solve the selection problem, and in our experiment led to a nearly-optimal premium
configuration, risk adjustment has historically been difficult to implement and is the exception in
employment-based health insurance (Kennen et al., 2001). Our results suggest why this is the
case.
35
References
Akerlof, G. (1970) “The Market for `Lemons’: Quality Uncertainty and the Market Mechanism,” Quarterly Journal of Economics, Vol. 84, No. 3, pp. 488-500.
Altman, D., D Cutler and R. Zeckhauser, (2003) “Enrollee Mix, Treatment Intensity, and Cost in
Competing Indemnity and HMO Plans,” Journal of Health Economics, v 22 , no 1, pp 23-45. Atherly, A., B. Dowd and R. Feldman (2004) “The Effect of Benefits, Premiums and Health
Risk on Health Plan Choice in the Medicare Program,” Health Services Research, v 39, no 4, Part 1, pp 847-64.
Bajari, P., H. Hong, A. Khwaja and T. Marsh (2010) “A Semiparametric Analysis of Adverse
Selection and Moral Hazard in Health Insurance Contracts,” University of Minnesota Mimeograph.
Barrett, G., and R. Conlon. (2003) “Adverse Selection and the Decline in Private Health
Insurance Coverage in Australia: 1989-95,” Economic Record, v79, no 246, pp 279-96. Buchmueller, T. (2000) “The Health Plan Choices of Retirees Under Managed Competition,”
Health Services Research, v 35, no 5, Part I, pp 949-976. Cardon, J. H., and I. Hendel (2001) “Asymmetric information in health insurance: evidence from
the National Medical Expenditure Survey,” RAND Journal of Econometrics, v 32, no 3, Autumn 2001, pp 408-427.
Carlin, C. and R. Town (2010) “How Does Managed Care Do It Redux?” University of
Minnesota Mimeograph. Cawley, J. and T. Philipson (1999) “An Empirical Examination of Information Barriers to Trade
in Insurance,” American Economic Review, v 89, pp. 827-46. Chiappori, P.A. and B. Salanie, (2000) “Testing for Asymmetric Information in Insurance
Markets,” Journal of Political Economy, v 108, pp. 56-78. Chib, S. (1992). “Bayes inference in the Tobit censored regression model,” Journal of
Econometrics 51 (1-2), 79–99. Cowles, M. K., B. P. Carlin and J. E. Connett. (1996) “Bayesian Tobit Modeling of Longitudinal
Ordinal Clinical Trial Compliance Data with Nonignorable Missingness,” Journal of the American Statistical Association, v 91, no 433, 86-98.
Cutler, D. and S. Reber (1998) “Paying for Health Insurance: The Trade-off Between
Competition and Adverse Selection,” Quarterly Journal of Economics, v 113, no 2, pp. 433-466.
36
Cutler, D. and R. Zeckhauser (2000) “Anatomy of Health Insurance,” in Handbook of Health Economics, Volume 1A, Culyer, A. and Newhouse, J., editors, Amsterdam: Elsevier
Deb, P., M. K. Munkin and P. K. Trivedi (2006). “Private Insurance, Selection, and Health Care
Use: A Bayesian Analysis of a Roy-Type Model,” Journal of Business & Economic Statistics, v 24, no 4, pp. 403-415.
Dowd, B. and R. Feldman. (1994-5) “Premium Elasticities of Health Plan Choice,” Inquiry, v 31,
no 4, pp 438-44. Dowd, B., R. Feldman and R. Coulam. (2003) “The Effect of Health Plan Characteristics on
Medicare+Choice Enrollment,” Health Services Research, v38, no 1 Pt 1, pp 113-135. Einov, L. A. Finkelstein and M. Cullen (2010) “Estimating welfare in Insurance Markets using
Prices,” Quarterly Journal of Economics, Forthcoming. Einov, L. A. Finkelstein and P. Schrimpf (2007) “The Welfare Cost of Asymmetric Information:
Evidence from The UK Annuities Market,” NBER Working Paper #13228. Enthoven, A., and R. Kronick, (1989a) “A Consumer-Choice Health Plan for the 1990s, Part 1,”
New England Journal of Medicine, v 320, no 1, pp 29-37. Enthoven, A., and R. Kronick, (1989b) “A Consumer-Choice Health Plan for the 1990s, Part 2,”
New England Journal of Medicine, v 320, no 2, pp 94-101. Feldman, R., and B. Dowd. (2000) “Risk Segmentation: Goal or Problem?” Journal of Health
Economics, v 19, no 4, pp 499-512.
Feldman, R., M. Finch, B. Dowd, S. Cassou. (1989) “The Demand for Employment-Based Health Insurance Plans,” Journal of Human Resources, v 24, no 1, pp 115-42.
Finkelstein, A. and J. Poterba. (2004) “Adverse Selection in Insurance Markets: Policyholder Evidence from the U.K. Annuity Market,” Journal of Political Economy, v . 112, pp.183–208
Gelfand, A. and A. Smith, (1990) “Sampling Based Approaches to Calculating Marginal Densities,” Journal of the American Statistical Association, v 85, no 410, pp 398-409.
Geweke, J. F., M. P. Keane and D. E Runkle. (1997) “Statistical inference in the multinomial
multiperiod probit model,” Journal of Econometrics, v 80, pp 125-165. Geweke, J. F., G. Gowrisankaran and R. J. Town, (2003) “Bayesian inference for hospital quality
in a selection model.” Econometrica, v 71, pp 1215 – 1238. Gray, B., and T. Selden. (2002) “Adverse Selection and the Capped Premium Subsidy in the
Federal Employees Health Benefits Program,” Journal of Risk and Insurance, v 69, no 2, pp 209-224.
37
Handel, B. (2010) “Adverse Selection and Switching Costs in Health Insurance Markets: When Nudging Hurts,” Northwestern University PhD thesis.
Harris, K., J. Schultz and R. Feldman. (2002) “Measuring Consumer Perceptions of Quality
Differences Among Competing Health Plans,” Journal of Health Economics, v 21, no 1, pp. 1-17.
Heckman, J. (1981) “Heterogeneity and State Dependence,” in Studies in Labor Markets, S. Roden, ed., Chicago: University of Chicago Press.
Keenan, P., M. Beeuwkes Buntin, T. McGuire and J. Newhouse. “The Prevalence of Formal Risk Adjustment in Health Plan Purchasing,” Inquiry, v 38, no 3, Fall 2001, pp 245-259.
Marquis, M. S., and M. R. Holmer. (1996) “Alternative Models of Choice Under Uncertainty and Demand for Health Insurance,” The Review of Economics and Statistics, v 78, no 3, pp 421-427.
Newhouse, J. (1996) “Reimbursing Health Plans and Health Providers: Efficiency in Production
Versus Selection,” Journal of Economic Literature, v. 34, no. 3, 1236-63. Parente, S., R. Feldman and J. Christianson, (2004) “Employee Choice of a Consumer-Driven
Health Insurance in a Multiplan, Multiproduct Setting,” Health Services Research, vol 39, no 4, Supplement Part 2, pp 1091-1111.
Pietz, K. and Petersen, L. (2007) “Comparing Self-Reported Health Status and Diagnosis-Based
Risk Adjustment to Predict 1- and 2 to 5 Year Mortality,” Health Services Research, 42(2):629-643.
Riphahn, R., A. Wambach and A. Million. (2002) “Incentive Effects in the Demand for Health
Care: A Bivariate Panel Count Data Estimation,” Journal of Applied Econometrics, v 18, no 4, pp 387-405.
Rothschild, M., and J. Stiglitz, (1976) “Equilibrium in Competitive Insurance Markets: An Essay on the Economics of Imperfect Information,” Quarterly Journal of Economics, v 90, no 4, pp 630-649.
Royalty, A., N. Solomon, (1999) “Health Plan Choice: Price Elasticities in a Managed
Competition Setting,” Journal of Human Resources, v 34, no 1, pp. 1-41. Scanlon, D., M. Chernew, C. McLaughlin, G. Solon (2002) “The Impact of Health Plan Report
Cards on Managed Care Enrollment,” Journal of Health Economics, v 21, no 1, pp. 19-41. Shea, D., J. Terza, B. Stuart, and Briesacher, B. (2007) “Estimating the Effects of Prescription
Drug Coverage for Medicare Beneficiaries,” Health Services Research, 42, no 3, pp. 933–949.
38
Strombom, B. A., T. C. Buchmueller and P. J. Feldstein (2002) “Switching Costs, Price Sensitivity and Health Plan Choice,” Journal of Health Economics, v 21, no 1, pp. 89-116.
Tanner, M. A., and W. H. Wong. (1987) “The Calculation of Posterior Distributions by Data
Augmentation,” Journal of the American Statistical Association, v 82, no 398, pp. 528-540.
Tchernis, R., S. Normand, J. Pakes, P. Gaccinoe, and J. Newhouse, (2003) “Health and Health Insurance: Analysis of Plan Switching Behavior,” Harvard Medical School Mimeo.
39
Table 1 Annual premiums and health plan shares 2002-2005
2002 2003 2004 2005
Share
Premium ($)
Share
Premium ($)
Share
Premium ($)
Share
Premium ($)
HMO .622 0.00 .688 0.00 .688 382 .686 408
Tier 1 .077 0.00 .044 21 .040 741 .047 770
Tier 2 .096 242 .082 247 .080 1,064 .072 1,074 POS 1
Tier 3 .073 522 .095 512 .050 1,474 .031 1,721
POS 2 .090 1,346 .060 1,258 .064 1,463 .086 783
CDHP .041 329 .061 194 .074 670 .073 793
Table 2 Demographics and health status by plan (standard deviations in parentheses)
POS
All Enrollees
HMO Tier 1 Tier 2 Tier 3
POS2
CDHP
Age 46.0 (11.4)
44.3 (11.4)
47.5 (9.8)
48.7 (10.2)
48.5 (10.0)
52.1 (10.3)
50.4 (10.5)
Percent Female
62.5 (48.4)
59.2 (49.1)
76.0 (42.7)
71.6 (45.1)
64.2 (48.0)
71.9 (44.9)
63.8 (48.1)
Salary ($) 48,810 (25,127)
45,249 (21,920)
51,489 (25,197)
49,149 (21,950)
53,772 (29,127)
61,283 (32,761)
65,096 (33,757)
Concurrent Weight
1.78 (3.30)
1.47 (2.76)
2.59 (4.42)
2.22 (3.26)
2.41 (2.76)
2.85 (3.96)
2.14 (3.85)
40
Table 3 Estimated per enrollee cost of alternative enrollments
Estimated Annual Per-Capita Cost ($) if Enrolled
in Plan HMO POS 1 POS 2 CDHP
HMO 1,727 2,471 2,377 2,928 POS 1 2,323 3,227 3,077 3,898 POS 2 2,929 3,897 3,771 5,026
Currently Enrolled in Plan
CDHP 2,178 2,970 2,956 3,716 All in One Plan 1,941 2,731 2,630 3,289
41
Table 4 Posterior means and coefficient point estimates for choice models (standard deviations of posterior in parentheses) [standard errors in brackets]
Parameter Bayesian AR(1)
Probit Classical MNL HMO Intercept 1.7306 *** 5.9475 +++ (0.1405) [0.6413] POS 1, Tier 1 Intercept 0.4449 *** 2.3547 +++ (0.1163) [0.8138] POS 1, Tier 2 Intercept 0.2987 ** 2.6903 +++ (0.1211) [0.7172] POS 1, Tier 3 Intercept 0.2838 *** 2.6077 +++ (0.1128) [0.7722] POS 2 Intercept 0.1782 1.5955 + (0.1235) [0.9356] HMO Age -0.0168 *** -0.0670 +++ (0.0025) [0.0104] POS 1, Tier 1 Age -0.0083 *** -0.0452 +++ (0.0022) [0.0147] POS 1, Tier 2 Age -0.0054 *** -0.0361 +++ (0.0021) [0.0123] POS 1, Tier 3 Age -0.0037 * -0.0309 ++ (0.0020) [0.0129] POS 2 Age 0.0002 -0.0082 (0.0022) [0.0159] HMO Female -0.4624 *** -1.4261 ++ (0.1447) [0.6507] POS 1, Tier 1 Female -0.1887 -0.5932 (0.1216) [0.9150] POS 1, Tier 2 Female -0.1677 -0.9128 (0.1169) [0.8141] POS 1, Tier 3 Female -0.1045 -0.3820 (0.1207) [0.8550] POS 2 Female -0.2095 -0.5777 (0.1388) [0.9895] HMO log(Health Status) -0.0186 *** 0.0269 ++ (0.0073) [0.0125] POS 1, Tier 1 log(Health Status) 0.0063 0.0223 (0.0070) [0.0180] POS 1, Tier 2 log(Health Status) 0.0065 0.0252 (0.0069) [0.0157] POS 1, Tier 3 log(Health Status) 0.0131 ** 0.0073 (0.0069) [0.0165] POS 2 log(Health Status) 0.0138 * 0.0189 (0.0074) [0.0186]
42
Table 4, continued Posterior means and coefficient point estimates for choice models (standard deviations of posterior in parentheses) [standard errors in brackets]
Parameter Bayesian AR(1)
Probit Classical MNL HMO Female x Age 0.0079 *** -0.0761 (0.0030) [0.0483] POS 1, Tier 1 Female x Age 0.0051 ** 0.0976 (0.0026) [0.0735] POS 1, Tier 2 Female x Age 0.0049 * 0.1524 ++ (0.0024) [0.0600] POS 1, Tier 3 Female x Age 0.0019 0.1681 ++ (0.0025) [0.0670] POS 2 Female x Age 0.0054 ** 0.1707 ++ (0.0028) [0.0713] log(Salaryi-Premiumj) 1.5253 ** 19.2587 +++ (0.6302) [3.7340] E(logOOP) -0.0242 ** -0.0699 (0.0119) [0.1128] Var(logOOP) -0.0078 0.0454 (0.0064) [0.0601] Office Visit Copay -0.0130 *** -0.0378 +++ (0.0016) [0.0115]
* 90% Bayesian confidence interval excludes zero. + 90% classical confidence interval excludes zero. ** 95% Bayesian confidence interval excludes zero. ++ 95% classical confidence interval excludes zero. *** 99% Bayesian confidence interval excludes zero. +++ 99% classical confidence interval excludes zero.
43
Table 5 Estimated parameters of variance matrix
Table 5a – AR(1) parameters AR(1) Model
HMO POS 1, Tier 1 POS 1, Tier 2 POS 1, Tier 3 POS 2 CDHP Rho 0.6065 *** 0.9895 *** 0.9790 *** 0.9551 *** 0.9966 *** 0.9966 *** (0.0271) (0.0094) (0.0149) (0.0232) (0.0033) (0.0033) Tau2 0.0891 *** 0.1100 *** 0.1104 *** 0.0989 *** 0.1193 *** (0.0095) (0.0133) (0.0135) (0.0116) (0.0144)
Table 5b – Across-plan variance/covariance matrix Across-Plan Variance-Covariance Matrix (std dev) HMO POS 1, Tier 1 POS 1, Tier 2 POS 1, Tier 3 POS 2 HMO 1.0000 *** (0.0000) POS 1, Tier 1 0.2188 *** 0.1916 *** (0.0407) (0.0341) POS 1, Tier 2 0.0420 0.0302 * 0.1619 *** (0.0439) (0.0161) (0.0329) POS 1, Tier 3 0.0677 ** 0.0073 0.0650 *** 0.1862 *** (0.0325) (0.0140) (0.0267) (0.0439) POS 2 0.2520 *** 0.0517 *** -0.0150 0.0216 0.2595 *** (0.0414) (0.0194) (0.0180) (0.0188) (0.0499)
Table 5c – Across-plan implied correlation coefficients Implied Across-Plan Correlation Coefficients HMO POS 1, Tier 1 POS 1, Tier 2 POS 1, Tier 3 POS 1, Tier 1 0.4998 POS 1, Tier 2 0.1045 0.1713 POS 1, Tier 3 0.1569 0.0388 0.3744 POS 2 0.4948 0.2318 -0.0732 0.0983
* 90% Bayesian confidence interval excludes zero. ** 95% Bayesian confidence interval excludes zero. *** 99% Bayesian confidence interval excludes zero.
44
Table 6 Own- and Cross-Price Elasticities
AR(1) Multinomial Probit Elasticities: Unconstrained Covariance Matrix (Base Model)
HMO POS 1 POS 2 CDHP HMO -0.007 0.014 0.017 0.012 POS 1 0.017 -0.104 0.048 0.140 POS 2 0.003 0.008 -0.068 0.017 CDHP 0.002 0.018 0.015 -0.103
Multinomial Logit HMO POS 1 POS 2 CDHP HMO -0.054 0.120 0.113 0.124 POS 1 0.078 -0.417 0.095 0.087 POS 2 0.023 0.030 -0.293 0.030 CDHP 0.024 0.026 0.028 -0.306
Nested Logit HMO POS 1 POS 2 CDHP HMO -0.052 0.106 0.095 0.110 POS 1 0.081 -0.344 0.068 0.093 POS 2 0.006 0.006 -0.250 0.008 CDHP 0.032 0.035 0.034 -0.265 AR(1) Multinomial Probit Elasticities: Diagonal
Covariance Matrix HMO POS 1 POS 2 CDHP HMO -0.011 0.030 0.027 0.047 POS 1 0.017 -0.147 0.046 0.134 POS 2 0.004 0.011 -0.106 0.029 CDHP 0.006 0.027 0.025 -0.196
Note: Cell entry i,j, where i indexes row and j column, give the percentage change in market share of i with a 1% increase in the price of j.
45
Table 7 AR(1) coefficients by risk group
AR Coefficient Sample Mean (std dev) of Posterior
Healthiest Quartile 0.8564 *** (0.0189) 2nd Quartile 0.8442 *** (0.0177) 3rd Quartile 0.9226 *** (0.0221) Sickest Quartile 0.9152 *** (0.0190)
*** 99% Bayesian confidence interval excludes zero.
46
Table 8 Employer pricing scenarios – Impact on welfare and claims cost Annual Employee Contribution
Actual % of
Premium
Fixed $, no Risk
Adj
Fixed $, w/ Risk
Adj Optimal Premium
HMO $408.20 $466.18 $112.06 $250.12 $230.36 Tier 1 $769.60 $809.64 $1,470.30 $1,223.04 $1,262.30 Tier 2 $1,073.80 $815.36 $1,491.10 $1,237.60 $1,248.26 POS 1 Tier 3 $1,721.20 $823.42 $1,518.14 $1,150.76 $1,311.96
POS 2 $782.60 $809.38 $1,466.92 $768.30 $899.86 CDHP $793.00 $1,019.20 $2,301.26 $2,015.00 $1,965.60 Compensating Variation -- ($7.68) ($8.52) ($12.74) ($12.81) std dev 13.5 67.4 35.8 41.5
Break-Down of Average Annual Per-Capita Costs 1 a. Employer Contribution $1,726 $1,726 $1,726 $1,726 $1,726 b. Employee Contribution $591 $589 $563 $568 $568 c. Plan Cost (1a+1b) $2,316 $2,314 $2,289 $2,294 $2,294 2. Ave Employee OOP Cost $368 $366 $359 $361 $361 3. Total Claims Cost (1c+2) $2,684 $2,680 $2,648 $2,655 $2,655
47
Table 9 Employer pricing scenarios – Distribution of welfare
Count Ave Age
% Female
Ave cwt
Average Income
Average Compensating
Variation Current Premiums 3578 47.0 62% 1.857 $49,299 --- % of Premium Benefit/Same 2565 49.1 72% 2.408 $51,547 ($14.24) Harmed 1013 41.8 38% 0.461 $43,608 $8.93 Total 3578 47.0 62% 1.857 $49,299 ($7.68) Fixed $, no Risk Adjustment Benefit/Same 1811 38.5 52% 1.147 $44,347 ($61.18) Harmed 1767 55.7 73% 2.584 $54,375 $45.45 Total 3578 47.0 62% 1.857 $49,299 ($8.52) Fixed $, with Risk Adjustment Benefit/Same 2230 41.1 60% 1.452 $45,975 ($33.39) Harmed 1348 56.8 67% 2.527 $54,798 $21.41 Total 3578 47.0 62% 1.857 $49,299 ($12.74) Optimal Premium Benefit/Same 2104 39.8 58% 1.297 $45,507 ($35.95) Harmed 1474 57.4 70% 2.655 $54,712 $20.21 Total 3578 47.0 62% 1.857 $49,299 ($12.81)
48
Figure 1 Comparison of actual and predicted claims and out-of-pocket payments