price, quality, and variety: measuring the gains from trade in

Price, Quality, and Variety: Measuring the Gainsfrom Trade in Differentiated Products

Gloria Sheu∗

US Department of Justice

September 2011

Abstract

The empirical trade literature has found that the ability to import differentiated prod-ucts has significant positive welfare effects. These gains have been established usingprice indices that measure the value to consumers of changes in the set of goods avail-able over time. In this paper, I explore how these price indices are shaped by theirunderlying structural assumptions. I draw parallels with methods used in industrialorganization, where a number of researchers have also studied differentiated products,albeit in domestic markets. I show that a standard trade model, the nested constantelasticity of substitution (NCES) framework, produces the same market demand sys-tem and price index as a standard industrial organization model, the nested logit (NL).This finding allows me to connect commonly used NCES and NL variants into onecoherent family. Furthermore, I show how standard NCES empirical techniques, whichrely on data that aggregates over individual products, relate to the product-level datamethods more commonly used in logit settings. I then apply these methods to a dataset on Indian imports of computer printers, highlighting how these approaches differ ina concrete example. In this application, I find that using aggregated data understatesthe gains in the price index by 31 to 43 percent relative to using product-level data.Furthermore, loosening the assumed substitution structure between goods raises theprice index by over 60 percent.

∗The views expressed here are not purported to reflect those of the US Department of Justice. Iwould like to thank Pol Antras, Elhanan Helpman, Julie Mortimer, and Ariel Pakes for their guidanceand support on this project. This paper has also benefitted from discussions with Deepa Dhume,Oleg Itskhoki, Greg Lewis, Marc Melitz, David Mericle, Nathan Miller, Eduardo Morales, and MarcRemer. I am greatly indebted to George Gibson of the Xerox Corporation for giving me access tothe printer data set and for helping me understand the Indian printer market. Mary Carlin at theXerox Corporate Library also provided key assistance in obtaining the printer data. Petia Topalovagraciously made the Indian tariff data available. All errors are my own. First version: November 2009.Email: [email protected]

1 Introduction

As emphasized in the seminal work of Krugman (1979), an important channel by which

countries can gain from trade is through increased access to differentiated products.

Imported goods widen the choice set available to consumers by providing a different

combination of price, quality, and variety than domestic goods alone. Although the

benefits of cheaper imports have long been recognized, consumers also place value on

the varying qualities (that is, the unique mix of non-price characteristics) provided by

foreign differentiated goods. Furthermore, access to imported products may increase the

number of goods available for a given distribution of price and quality, thus increasing

variety. All three of these forces contribute to the gains from trade.

After establishing these effects theoretically, the next step has been to measure them

empirically. Feenstra (1994) facilitated these efforts by introducing a simple procedure

for computing the price index from a differentiated products demand system. This

price index measures how much prices on one set of products would have to fall (or

rise) in order to give consumers the same welfare as that from a different set. The

Feenstra (1994) method has been widely adopted, and much of the resulting literature

underscores the importance of accounting for differentiated products when measuring

the gains from trade.

Now that these papers have confirmed the existence of welfare gains, what more

can be learned? Can the standard theoretical and empirical techniques be refined in

order to better understand the causes of these welfare effects? In considering these

questions, it is important to examine the empirical industrial organization literature

on differentiated products. Like the aforementioned trade literature, there is a rich

industrial organization tradition studying how consumers are affected by changes in

the price, quality, and variety of differentiated products. The basic research question

is the same, just applied to domestic empirical examples instead of international ones.

Therefore, examining the relationship between the methods used in these literatures

shows how the results in trade are shaped by the techniques used to construct them.

The purpose of this paper is to reach a more fundamental understanding of the

international trade approach towards differentiated products by comparing it to tech-

niques that are usually confined to industrial organization. My analysis has two key

components. First I show that the seemingly disparate theoretical frameworks used

in trade and industrial organization are actually tightly linked. More specifically, the

workhorse models of international trade, the constant elasticity of substitution (CES)

1

and the nested constant elasticity of substitution (NCES) setups, produce the same

market demand functions and price indices as two of the most common models in

industrial organization, the multinomial logit (MNL) and the nested logit (NL), re-

spectively. As a consequence, the differences between trade and industrial organization

empirical findings based on these models are driven more by differences in data and

empirical techniques than by differences in theory. Furthermore, this result means that

the random coefficients framework, another common model in industrial organization,

is also connected to the NCES as an extension of the NL that I call the nested random

coefficients logit (NRCL). Therefore, the extra impact of random coefficients can be

assessed through a comparison between the NL and the NRCL.

Second, I present an empirical application that shows how the trade and industrial

organization approaches can differ in practice. Using a product-level data set on im-

ports of computer printers into India, I estimate several industrial-organization-style

logit models. Furthermore, by aggregating this data, I am also able to implement stan-

dard trade methods. This exercise produces three main findings. First, by obscuring

improvements in quality and variety amongst the underlying goods, I find that the

aggregated data commonly used in trade understate the gains in the price index by 31

to 43 percent. Second, the MNL, with its restrictive substitution structure, overstates

gains in the price index by 65 to 68 percent relative to the NL and NRCL. Third, the

addition of random coefficients reveals important heterogeneity across types of con-

sumers. The NRCL price index for certain subgroups of consumers is 53 percent lower

than the market-level NL index.

There are two main distinctions between how trade and industrial organization ap-

proach the study of differentiated products. First, there is a difference in terms of data.

The standard in the trade literature is to use data collected from customs authorities.

Customs information is often the only data available that exhaustively covers the im-

ports of an entire nation. As a result, these data are the preferred choice for trade

papers, where the emphasis is on studying the economy-wide effects of international

trade. However, a drawback of this data is that it aggregates individual goods into

“Harmonized System” (HS) codes. A product is defined as an HS code/supplier coun-

try pair. When reporting imports of computer printers, for instance, a good could be

defined as broadly as “inkjet printers from the United States.”

In contrast, the industrial organization literature usually relies on detailed product-

level data. Continuing with the printer example, a typical data set would report the

quantities sold and prices of individual models. The “HP Deskjet 630C” and the “HP

2

Deskjet 1220C” would be separate observations, recognizing the fact that the latter

model prints at twice the speed as the former. Not only does this level of analysis

match the level of differentiation relevant to consumers, it also allows the researcher

to supplement the sales data with product characteristics information. For instance,

one could obtain proxies for the quality of each printer model by incorporating data on

features like print speed or paper capacity. As a result, this data provide more direct

information on price, quality, and variety when compared to customs data. However,

the downside is that such detailed data are only available for selected sectors and

countries.

The second difference between the trade and industrial organization approaches is

in terms of structural modeling. In order to assess the effects on consumers of changes

in products, the researcher assumes a demand system. A number of theoretical trade

papers use the CES demand system, which is derived from the utility maximization

problem of a representative consumer.1 Because of its simplicity, the CES model is

both algebraically elegant and easy to implement empirically. However, the CES also

severely restricts the substitution structure between products. In order to address this

problem, many empirical researchers have adopted an NCES specification, which allows

the substitution parameter within a sector to differ from that between sectors.

Meanwhile, the industrial organization literature has largely relied on the MNL

demand system.2 Unlike the CES, this framework assumes a distribution of heteroge-

neous consumers, each purchasing one type of good. But like the CES, the MNL places

strong restrictions on substitution patterns. In response, researchers have developed a

number of modifications, the most common being nesting (as in the NL) and random

coefficients.3 I combine these two extensions into a unified model, the NRCL.

My results build upon the aforementioned empirical trade and industrial organi-

zation literatures studying the welfare effects of changes in differentiated products.

Although the many contributions in these literatures are too numerous to fully summa-

rize here, key papers in trade include Broda and Weinstein (2006), Broda et al. (2006),

and Goldberg et al. (2010), along with the aforementioned work in Feenstra (1994).

On the industrial organization side, significant developments include Berry, Levinsohn,

1This functional form is sometimes referred to as the Spence-Dixit-Stiglitz demand system afterthe work in Spence (1976) and Dixit and Stiglitz (1977).

2This trend follows from the influential work in McFadden (1974).3The latter type of model is sometimes referred to as a mixed logit. Train (2009) defines the mixed

logit model based on the form of the resulting demand system. He then notes that one way of derivinga mixed logit demand system is through a random coefficients specification.

3

and Pakes (1995) (“BLP”), Goldberg (1995), Berry et al. (1999), Petrin (2002), and

Berry et al. (2004). Khandelwal (2010) is one of the few hybrid papers that uses an

industrial organization model (the NL) with a trade data set. Here I show that the

theory behind this approach is equivalent to the NCES setup.

The closest paper to this one is Blonigen and Soderbery (2010), which uses product-

level data on the US automobile market to study how aggregated HS trade data biases

the NCES price index. They find that using aggregated trade data can greatly under-

state improvements in the price index. I find a parallel result in my data and then

expand upon this analysis by exploring how CES-style models behave relative to logit

models.

In the next section, I describe the CES-based models and the Feenstra (1994) method

for estimating their price indices. Section 3 lays out the basics of the logit models and

relates them back to the CES-based frameworks, followed by a description of empirical

industrial organization techniques for dealing with product-level data. I present results

from the computer printers example in Section 4, including a discussion of the differences

between the CES-based and logit results. Section 5 concludes.

2 CES-Based Frameworks

The NCES model is popular in empirical trade applications because it yields a price

index that combines changes in price, quality, and variety into one easily interpretable

number. Furthermore, the NCES includes the basic CES framework, the workhorse

model of international trade theory, as a special case.

In order to calculate the NCES price index, Feenstra (1994) provides a simple proce-

dure designed for the typical trade data set. This methodology has become the standard

for measuring the gains from imports of differentiated products. These techniques con-

trast with those favored in industrial organization, where estimation using product-level

data is more common.

2.1 The Consumer’s Problem

Assume there is a representative consumer that has a utility function given by

Ut =

(∑g∈G

Mγ−1γ

gt

) γγ−1

, where γ > 1. (1)

4

Here g indexes different groups of products from the set G. The time period is indexed

by t. The quantity consumed of each group is denoted by Mgt, and γ is the elasticity

of substitution between these groups.

Within each product group g the consumer has an inner nested utility function of

the form

Mgt =

∑j∈Jgt

b1σjtm

σ−1σ

jt

σσ−1

, where σ > 1. (2)

Individual products are indexed by j from the set Jgt. The bjt denotes the quality of

good j, and mjt denotes the quantity consumed of good j. The elasticity of substitution

between products within a group is σ.4 Assuming the nesting structure is reasonable,

one would expect σ > γ, meaning that products within a group are closer substitutes

than those in separate groups.

Each time period the consumer’s problem is to maximize current period utility

subject to a budget constraint. This is an entirely static model, with no borrowing or

saving. The consumer can solve the utility maximization problem in two stages. First,

the consumer maximizes Mgt subject to the constraint∑

j∈Jgt pjtmjt = Ygt, where Ygt

is the total money spent on group g. Then the consumer decides on the allocation

of expenditure across groups. This exercise results in the expression for the share of

expenditure allocated to product j within Ygt,

sjt|g =bjtp

1−σjt∑

j∈Jgt bjtp1−σjt

. (3)

In turn, the share of expenditure devoted to group g out of total expenditure is

sgt =

(∑j∈Jgt bjtp

1−σjt

) 1−γ1−σ

∑g∈G

(∑j∈Jgt bjtp

1−σjt

) 1−γ1−σ

. (4)

Multiplying these two expressions gives the share of expenditure allocated to product

4An extension of this model is to allow the elasticity of substitution to vary by group, giving σg. Ido not pursue this variant for simplicity, but its mechanics are similar to those discussed in the maintext.

5

j out of the money spent on all product groups,

sjt =bjtp

1−σjt(∑

j∈Jgt bjtp1−σjt

) γ−σ1−σ ∑

g∈G

(∑j∈Jgt bjtp

1−σjt

) 1−γ1−σ

. (5)

In the special case where σ = γ, the NCES reduces to a simpler framework known

as the CES model. Once the elasticities of substitution in the outer and inner utility

functions are equal, the nesting ceases to have any effect. It is as if all the products are

located in a single group.5 The resulting expenditure shares are given by

sjt =bjtp

1−γjt∑

j∈Jt bjtp1−γjt

. (6)

The CES is one of the most popular differentiated products demand systems in the

theoretical trade literature.6

2.2 The IIA Problem

When quantifying the gains from changes in differentiated products, it is extremely

important to accurately measure the substitutability between goods. For instance, if

one incorrectly finds that a new product is a poor substitute for existing products,

one will mistakenly conclude that this new product greatly increased welfare. The

CES, although useful in a number of theoretical applications, has a highly restrictive

substitution structure. The industrial organization literature commonly refers to this

issue as the “independence of irrelevant alternatives” problem. It is the struggle to solve

this problem that led to the adoption of the NCES in the empirical trade literature.

In order to illustrate the IIA property, note that according to the CES expenditure

share equation (6) the ratio of the quantity demanded for a pair of products 1 and 2 is

m1t

m2t

=b1tp

−γ1t

b2tp−γ2t

,

which does not depend on the other products available. Thus, if a third good is in-

5That is, utility reduces to Ut =

(∑j∈Jt b

1γ

jtmγ−1γ

jt

) γγ−1

.

6The CES and Cobb-Douglas (which the CES reduces to when γ = 1) models have been featuredin standard trade textbooks such as Helpman and Krugman (1985) and in key theoretical papers suchas Dornbusch et al. (1977), Eaton and Kortum (2002), and Melitz (2003).

6

troduced that is identical to good 1 and very different from good 2, the demand ratio

between 1 and 2 will remain constant. One would expect sales of good 1 to fall relative

to good 2, but the CES model does not allow this to occur.

Another way of expressing this problem is using cross-price elasticities, which in the

case of the CES have the following form:

∂mjt

∂pkt

pktmjt

= (γ − 1)skt ∀j 6= k.7 (7)

These elasticities have the property that

∂mjt

∂pkt

pktmjt

=∂mlt

∂pkt

pktmlt

∀j, k, l such that j, l 6= k. (8)

Hausman (1997) argues that because of this property, the CES will overvalue new goods.

If one thinks of a new good being introduced as its price falling from the reservation

level, expression (8) says that demand must flow symmetrically towards the new product

from all other products. This may not be a realistic assumption for many sectors. For

instance, it is unlikely that expenditure will flow equally from an old laser printer and

from an old injet printer to a newly introduced laser printer.

If σ 6= γ, the NCES model can partially alleviate the IIA problem. This effect is

apparent in the NCES cross-price elasticities,

∂mjt

∂pkt

pktmjt

=

{(γ − 1)skt + (σ − γ)skt|g if j and k are in the same group

(γ − 1)skt otherwise(9)

for all j 6= k. Since it is likely that σ > γ, the (σ − γ)skt|g term should be positive and

thus increase the cross-price elasticity between goods of the same type. This adds a level

of realism compared to the CES. However, the IIA problem remains when comparing

goods within the same group.

2.3 The Price Index

A key reason for the popularity of CES-based models is that they yield a simple price

index for measuring the relative benefits of two sets of goods. In order to understand

this index, imagine that the representative consumer is comparing two possible bundles

7This elasticity can be derived by noting that mjt =sjtYpjt

, where Y denotes income.

7

of goods, the bundle available in time t and that available in time t+ 1. These bundles

may vary in terms of the combination of price, quality, and variety they offer. How

could one derive a metric that reflects the difference in value the consumer assigns to

these bundles?

One common way to build this metric is to look for a factor, τNCESt+1 , by which the

prices of all goods in period t would have to fall (or rise) in order to give the same

utility as the set of goods available in t+ 1. This exercise results in the expression

τNCESt+1 =

(∑g∈G

(∑j∈Jgt+1

bjt+1p1−σjt+1

) 1−γ1−σ) 1

1−γ

(∑g∈G

(∑j∈Jgt bjtp

1−σjt

) 1−γ1−σ) 1

1−γ, (10)

which is the standard measure of the gains from imports of differentiated products in

the trade literature.8 In the special case of the CES model, this index reduces to

τCESt+1 =

(∑j∈Jt+1

bjt+1p1−γjt+1

) 11−γ

(∑j∈Jt bjtp

1−γjt

) 11−γ

. (11)

The nested structure of the NCES model means that τNCESt+1 is actually just a ge-

ometric average of CES price indices calculated within each group of products.9 That

is,

τNCESt+1 =∏g∈G

(τCESgt+1

)ωgt+1, (12)

where

τCESgt+1 =

(∑j∈Jgt+1

bjt+1p1−σjt+1

) 11−σ

(∑j∈Jgt bjtp

1−σjt

) 11−σ

and ωgt+1 =(sgt+1 − sgt)/(ln(sgt+1)− ln(sgt))∑g∈G(sgt+1 − sgt)/(ln(sgt+1)− ln(sgt))

.

8This result follows from the form of the indirect utility function, which is

Y/

(∑g∈G

(∑j∈Jgt bjtp

1−σjt

) 1−γ1−σ) 1

1−γ

. Here Y denotes income.

9This result follows from the proof in the appendix of Feenstra (1994).

8

2.4 Estimation

The next task is to estimate this price index. Most empirical trade papers analyze

welfare effects at an economy-wide level. Given this focus, these authors need a method

that can be applied to customs data and requires as few parameters as possible. The

procedure pioneered by Feenstra (1994) satisfies both of these criteria.

In this section I replace the product subscript j with c for “category.” This is

because the standard method uses trade data, where a good is actually an HS category

imported from a certain country. This aggregates over individual products.

Note that once I switch to defining a product using these categories, the bct param-

eters no longer have a pure quality interpretation. Feenstra (1994) shows that these

terms not only reflect the quality but also the number of underlying products in each

category. If the number of sub-products in a product category increases, this in turn

increases its associated bct. Thus, bct captures both quality and variety.

Define Cg = (Cgt+1 ∩ Cgt) as the “common goods” set available for a group in two

different time periods. As shown by Feenstra (1994), so long as one assumes that the

products in Cg have constant bct parameters between t and t+ 1, the following holds:

τNCESt+1 =∏g∈G

∏c∈Cg

(pct+1

pct

)ωcgt+1

ωgt+1 ∏g∈G

(λgt+1

λgt

)ωgt+1σ−1

. (13)

where

λgt =

∑c∈Cg pctmct∑c∈Cgt pctmct

and

ωcgt+1 =(sct+1(Cg)− sct(Cg))/(ln(sct+1(Cg))− ln(sct(Cg)))∑c∈Cg(sct+1(Cg)− sct(Cg))/(ln(sct+1(Cg))− ln(sct(Cg)))

Here sct(Cg) denotes the share of expenditure accounted for by category c out of the

categories in the set Cg.

In the special case of the CES model, this index is calculated without regards to

any groups. There is one common goods set C = (Ct+1 ∩ Ct), giving

τCESt+1 =∏c∈C

(pct+1

pct

)ωct+1(λt+1

λt

) 1γ−1

(14)

9

where

λt =

∑c∈C pctmct∑c∈Ct pctmct

and ωct+1 =(sct+1(C)− sct(C))/(ln(sct+1(C))− ln(sct(C)))∑c∈C(sct+1(C)− sct(C))/(ln(sct+1(C))− ln(sct(C)))

.

Note that sct(C) denotes the expenditure share of product c in time t amongst the

goods in C.

The advantage of this methodology is that the price index can be computed using

only one parameter, the within-group elasticity of substitution, and widely available

trade data. Feenstra (1994) and Broda and Weinstein (2006) show how to estimate

this elasticity of substitution for different industries. Many other authors assume an

elasticity based on estimates in the literature.

Although this procedure is useful, it depends on the assumption that the common

goods do not experience any change in their bct terms. Therefore, both the quality and

variety of the underlying goods in each product category are assumed to be constant.

If quality or variety improve within the common goods, this will not be reflected in the

price index.

In order to solve this problem, one could switch to product-level data and compute

the price index over short enough time periods so that the common goods set is non-

empty. This is the approach studied by Blonigen and Soderbery (2010). Of course, this

method is not feasible for most industries in most countries, where only aggregated trade

data is available. As a result, in practice most researchers assume that HS code/country

pairs appearing in both periods have constant bct parameters.10

2.5 The Price Index Decomposition

A key advantage of the Feenstra (1994) approach is that it decomposes the price index

into two parts. The∏

g∈G

(∏c∈Cg

(pct+1

pct

)ωcgt+1)ωgt+1

(which I call the common goods

term) captures gains from price amongst common products, while the∏

g∈G

(λgt+1

λgt

)ωgt+1σ−1

(which I call the changing goods term) captures gains from new and disappearing

products. If there has been a large relative increase in spending on new products

in period t + 1, the changing goods term will fall, indicating that the price index has

decreased. If however, these new products are highly substitutable with common goods,

the elasticity of substitution σ will be large, and this term will approach 1.

10An exception is the original Feenstra (1994) paper, where he reports sensitivity results for severaldifferent common goods sets.

10

There are two differences in the NCES decomposition compared to the CES one.

First, in the NCES case, both the common goods and the changing goods terms are

calculated within each group before being aggregated. This reflects the fact that, assum-

ing the chosen grouping is sensible, a change in a product should affect the competing

products in the same group more than those in other groups.

Second, the NCES decomposition uses σ instead of γ for the elasticity of substitution

in the changing goods term. Note that σ is likely to be greater than γ because it

reflects substitution within groups of similar products instead of substitution across all

products. Therefore, the changing goods term is less likely to find large gains from

new products or large losses from disappearing products. Taken together, these two

differences tend to dampen movements in the NCES index relative to those in the CES

index.

Unfortunately, this method does not decompose the distribution of gains into those

due to price, quality, and variety. The common goods term calculates the gain from

price holding quality and variety constant in an artificial set of goods. Because the

common goods set excludes products in time t that did not appear in time t + 1 and

products in time t+ 1 that did not appear in time t, this set is not equal to the actual

choice set in either period.

Furthermore, the changing goods term embeds price, quality, and variety effects,

making it difficult to distinguish them. A new product, for example, may have a large

expenditure share because it is cheap or because it is of high quality.

3 Logit-Based Frameworks

In measuring the gains from trade in differentiated products, the trade literature has

struggled with two issues: (1) how to allow for realistic substitution patterns and (2)

how to estimate price indices without product-level data. The industrial organization

literature on differentiated products has faced these same challenges. In tackling the

substitution problem, industrial organization has turned to two strategies, the first be-

ing nesting products, the second being random coefficients. These methods can be com-

bined in a unified framework that I call the “nested random coefficients logit” (NRCL).

In addressing the data aggregation problem, industrial organization has bypassed this

issue by focusing on sectors for which product-level data is available. Because these

trade and industrial organization approaches are aimed at solving the same basic prob-

11

lems, comparing them highlights the costs and benefits of each.

At first glance this comparison appears difficult because of the stark differences in

the standard trade and industrial organization modeling frameworks. The empirical

industrial organization literature favors demand systems based on the MNL discrete

choice setup. Unlike in the CES framework, here there is a population of heterogeneous

consumers, each with unique preferences. Each buyer only purchases a quantity of one

product instead of consuming some of every product.

Nevertheless, Anderson et al. (1992) show that the MNL model produces the same

market demand system and price index as the CES. Building upon this insight, I show

that the NCES and NRCL are also tightly linked. In fact, I find that a special case of the

NRCL model, the NL, generates the exact same price index as the NCES. This result

allows me to separate the comparison based on structural theory differences (which arise

between the MNL, NL, and NRCL) from that based on data aggregation differences

(which arise between the CES and MNL or the NCES and NL).

3.1 The Consumers’ Problem

Assume that there are different types of consumers, with each type indexed by r.

Further assume, as in the NCES model, that goods are separated into groups indexed

by g ∈ G. In addition, each good within a group g is indexed by j in the set Jrgt.11 The

utility for consumer i of type r buying good j in group g is

urijt = ln(arjtmrijt) + ζrigt + εrijt. (15)

Here arjt is a good-specific measure of quality similar to bjt. The mrijt is the quantity of

good j that consumer i chooses to buy.12

Meanwhile the ζrigt is a random draw from a logit distribution with scale parameter

µr1, and the εrijt is a random draw from a logit distribution with scale parameter µr2.13

11The set of goods available Jrgt may vary by consumer, meaning that some goods can be consumed inzero quantities by all consumers of a certain type. For example, a home office buyer would not considerpurchasing a large enterprize printer, so it should not be in that consumer’s choice set. Another wayto think of this is to assume that the utility from goods outside of Jrgt is zero.

12In most applications of logit models, a consumer can only buy discrete units (usually one unit) ofa good. However, I change this assumption in order to match the CES model, where consumers canbuy a continuous amount of a good.

13A random variable x is distributed logit if it has a cumulative distribution function of

exp[− exp

(− xµ + %

)]where µ is the scale parameter and % is Euler’s constant (≈ 0.577). This is

often referred to as a “Type I Extreme Value” distribution.

12

Thus, each consumer has a series of independently and identically distributed (iid)

random draws, one for each product j ∈ Jrgt and one for each group g ∈ G.

This utility specification combines two common industrial organization methods for

dealing with the IIA problem. First, products are divided into groups, which allows the

substitution between goods in the same group to differ from that for goods in separate

groups. This method is known as nesting, as in the NCES model. Second, there are

some parameters (arjt, µr1, and µr2) that vary according to a probability distribution

across consumers. As a result, aggregate substitution patterns depend on the mix

of substitution responses found in the population. This method is known as random

coefficients.

Each time period consumer i’s problem is to maximize current period utility subject

to a budget constraint. As with the NCES framework, this is an entirely static problem,

with no borrowing or saving. The budget constraint is given by pjtmrijt = yr where yr

is the consumer’s income. Substituting this constraint into the utility function gives an

indirect utility of

vrijt = ln(arjt)− ln(pjt) + ln(yr) + ζrigt + εrijt. (16)

As in the NCES model, the consumers’ problem can be tackled in steps, starting

with the demand for goods conditional on being within a certain product group. When

focusing on one group, the ζrigt term drops out, reducing the choice problem to

max{

ln(ar1t)− ln(p1t) + εri1t, . . . , ln(arJrgtt)− ln(pJrgtt) + εriJrgtt

}, (17)

where, in a slight abuse of notation, I have used Jrgt to refer to both the goods set and

its cardinality. Integrating over the logit random shocks gives,

probrjt|g =arjt

1µr2 p

−1µr2jt∑

j∈Jrgtarjt

1µr2 p

−1µr2jt

,

which is the conditional probability that any type r consumer will choose good j.14

Turning to the choice of which product group to buy from, the consumer chooses

14This expression follows from the form of the logit distribution. Specifically, when given theproblem max{d1 + ε1, . . . , dJ + εJ}, the probability that option j will be the maximum is given by

exp(dj/µ)/∑Jj=1 exp(dj/µ). Here the εj are iid logit random variables with scale parameter µ.

13

the group with the maximum expected indirect utility,

max

µr2 ln

∑j∈Jr1t

arjt1µr2 p

−1µr2jt

+ ζri1t, . . . , µr2 ln

∑j∈JrGt

arjt1µr2 p

−1µr2jt

+ ζriGt

. (18)

Again I have used G to refer both to the set and to its cardinality.15 Maximization

results in a group probability of

probrgt =

(∑j∈Jrgt

arjt1µr2 p

−1µr2jt

)µr2µr1

∑g∈G

(∑j∈Jrt

arjt1µr2 p

−1µr2jt

)µr2µr1

.

I now derive the connection between this model and the CES-based frameworks.

This is an extension of the proof in Anderson et al. (1992), which shows the relationship

between the CES and MNL. Let arjt1µr2 = brjt,

−1µr2

= 1−σr, and 1µr1

= γr−1. Then convert

probrjt|g and probrgt to (expected) expenditure shares by multiplying and dividing by the

consumer’s income. The resulting expenditure shares are

srjt|g =brjtp

1−σrjt∑

j∈Jrgtbrjtp

1−σrjt

(19)

and

srgt =

(∑j∈Jrgt

brjtp1−σrjt

) 1−γr1−σr

∑g∈G

(∑j∈Jrgt

brjtp1−σrjt

) 1−γr1−σr

. (20)

Multiplying these two shares gives

srjt =brjtp

1−σrjt(∑

j∈Jrgtbrjtp

1−σrjt

) γr−σr1−σr ∑

g∈G

(∑j∈Jrgt

brjtp1−σrjt

) 1−γr1−σr

. (21)

Note that if all types of consumers have identical preferences, meaning that brjt = bjt,

σr = σ, and γr = γ for all r, these formulas collapse down to those in the NCES model.

15Expression (18) follows from the form of the expected maximum of a series of iid logit random

variables. That is, E[max{d1 + ε1, . . . , dJ + εJ}] = µ ln(∑Jj=1 exp(dj/µ)), where the εj are iid logit

random variables with scale parameter µ.

14

In industrial organization terminology, this case is known as the NL model. Therefore,

the nested method is the same in both trade and industrial organization. If additionally

σ = γ, the model collapses to the basic MNL framework. In that case the expenditure

shares are identical to those in the CES model. Therefore, the difference between the

NCES and the NL or between the CES and the MNL is due to the empirical techniques

usually chosen to estimate them as opposed to differences in the underlying theory.

The market-level share is found by integrating srjt across the distribution of consumer

types. For example, if the distribution is discrete the share is then

sjt =∑r∈R

f rt srjt, (22)

where f rt is the fraction of expenditure accounted for by type r consumers in time t,

and R is the set of all consumer types.16

3.2 Addressing the IIA Problem

The NRCL tackles the IIA problem in two ways. First, this model takes the nesting

approach just as in the NCES framework. Second, this model averages across the

heterogeneous preferences of different types of consumers. The latter method breaks

the IIA property between products in the same nest.

The effect of these two approaches can be seen in the cross-price elasticities, which

have the following form:

∂mjt

∂pkt

pktmjt

=

1sjt

∑r∈R f

rt [(γr − 1)srjts

rkt + (σr − γr)srjtsrkt|g] if j and k are

in the same group1sjt

∑r∈R f

rt (γr − 1)srjts

rkt otherwise

(23)

for all j 6= k. One would expect σr > γr for all r ∈ R because the nests gather together

similar products. Therefore, just as in the NCES model, the (σr − γr) term should

increase the cross-price elasticity between goods in the same groups.

16I focus on the discrete distribution case because this allows me to express the NRCL results asweighted averages of the NL formulas. This makes the relationship between the NRCL and the othermodels particularly transparent. The discrete specification is familiar in the marketing literature (seeKamakura and Russell (1989) for example), while the specification using a normal distribution has beenpopularized by Berry, Levinsohn, and Pakes (1995). The normal distribution does not give closed-formexpressions for market demand (because the gaussian integral must be computed numerically), whichmakes it cumbersome for my purposes here.

15

The random coefficients have an added effect. If, for example, type a consumers

have high preferences for laser printers, and both goods j and k are laser printers,

the expenditure shares for type a consumers will be large. In turn, this will place

more weight on type a’s elasticities of substitution, γa and σa, in the aggregate cross-

price elasticity. Similarly, those types that dislike goods j and k will tend to have less

influence on the cross-price elasticity because their expenditure shares are smaller.

3.3 The Price Index

Although the basic logic of the price index remains the same as in the NCES, in this

model one needs to integrate welfare across consumer types. In the case of a discrete

distribution for consumer types, this aggregation weights each type’s utility by their

share in expenditure in time t+1. Then imagine taking away the set of goods available

to these consumers in period t+ 1 and replacing it with the set from time t. The price

index is the factor by which the prices on the time t goods would have to fall to equalize

the weighted sum of indirect utilities.

The expected indirect utility for a consumer of type r can be calculated by finding

the expected maximum of the choice problem in equation (18). The condition for the

price index is then

∑r∈R

f rt+1

γr − 1ln

∑g∈G

∑j∈Jrgt

brjt(τNRCLt+1 pjt)

1−σryσr−1

1−γr1−σr

=∑r∈R

f rt+1

γr − 1ln

∑g∈G

∑j∈Jrgt+1

brjt+1p1−σrjt+1 y

σr−1

1−γr1−σr

.

Solving for the price index itself gives

τNRCLt+1 =∏r∈R

(∑

g∈G

(∑j∈Jrgt+1

brjt+1p1−σrjt+1

) 1−γr1−σr

) 11−γr

(∑g∈G

(∑j∈Jrgt

brjtp1−σrjt

) 1−γr1−σr

) 11−γr

frt+1

. (24)

Note that this expression is the geometric average of individual NL (or NCES) price

16

indices, τ rNLt+1 , for each type of consumer, where

τ rNLt+1 =

(∑g∈G

(∑j∈Jrgt+1

brjt+1p1−σrjt+1

) 1−γr1−σr

) 11−γr

(∑g∈G

(∑j∈Jrgt

brjtp1−σrjt

) 1−γr1−σr

) 11−γr

. (25)

In turn, as in the NCES model, each τ rNLt+1 is a geometric average of individual MNL

(or CES) price indices,

τ rNLt+1 =∏g∈G

(τ rMNLgt+1 )ω

rgt+1 (26)

where

τ rMNLgt+1 =

(∑j∈Jrgt+1

brjt+1p1−σrjt+1

) 11−σr

(∑j∈Jrgt

brjtp1−σrjt

) 11−σr

and

ωrgt+1 =(srgt+1 − srgt/(ln(srgt+1)− ln(srgt))∑g∈G(srgt+1 − srgt)/(ln(srgt+1)− ln(srgt))

.

3.4 Estimation

Given the strong theoretical similarities between the trade and industrial organiza-

tion methods, much of the difference between these literatures stems from the use of

different empirical techniques. Industrial organization usually focuses on industries for

which detailed product-level data, including information on product characteristics, are

available. As a result, quality parameters for each product can be estimated and the

“common goods assumption” can be dispensed with.

Choose one good that appears in every time period to be the “outside good.” Assign

this product the zero index and assume that br0t = 1 for all consumer types and time

periods.17 This gives

sr0t =p1−γr

0t∑g∈G

(∑j∈Jrgt

brjtp1−σrjt

) 1−γr1−σr

.

Then take logs of this equation and subtract it from the log of srjt in equation (21).

17This can be thought of as re-scaling the qualities of all other goods to be in units relative to thequality of good 0.

17

After some minor algebraic manipulations following Berry (1994), this results in

ln(srjt)− ln(sr0t) =γr − 1

σr − 1ln(brjt)− (γr − 1)[ln(pjt)− ln(p0t)] +

σr − γr

σr − 1ln(srjt|g). (27)

This equation is the basis for estimating the model.

Standard product-level data sets will report expenditure shares and prices. The

only question is how to capture quality in equation (27). This is where product charac-

teristics information is useful. In industries commonly studied in empirical industrial

organization, such as automobiles and electronics, product characteristics are a rea-

sonable proxy for quality. In the case of printers, characteristics such as print speed,

color capability, or paper capacity are easily observed and closely related to quality.

Therefore, the standard practice has been to use data on such features to parameterize

quality in the demand model.

Collect the characteristics for each printer j in a vector denoted by xjt. Then assume

the following:γr − 1

σr − 1ln(brjt) = (xjt − x0t)β

r + erjt. (28)

The βr is a vector of parameters to be estimated and erjt is an error term that allows

for quality unobserved by the econometrician. This gives

ln(srjt)− ln(sr0t) = (xjt−x0t)βr− (γr−1)[ln(pjt)− ln(p0t)]+

σr − γr

σr − 1ln(srjt|g)+erjt. (29)

Equation (29) is an estimating equation that can be run on product-level purchase data

that is categorized by consumer types. Once the parameters have been estimated for

each consumer type, one can calculate the price index by plugging directly into equation

(24), bypassing the common goods assumption entirely. Note that if βr = β, γr = γ,

and σr = σ for all r ∈ R, this equation then reduces to the NL model. If in addition

the ln(srjt|g) term is dropped, this equation then reduces to the MNL model.

Estimation of equation (29) could proceed by ordinary least squares, but this is not

the usual practice. One would expect the coefficient on price to have a positive bias

(making it smaller in absolute value) because printer vendors will tend to set higher

prices for models that have high unobserved quality. In addition, it is likely that ln(sjt|g)

is endogenous, as increased unobserved quality can drive higher within group sales. This

would also induce a positive bias on the ln(sjt|g) coefficient. Therefore, instruments are

18

found for both the log price and the log share variables.18

One could take a different approach if data broken up by consumer type is un-

available. That is, one could specify a distribution for consumer types, be it discrete

or continuous, and estimate its parameters by matching the market-level expenditure

shares (as in equation (22)) with those observed in the data. Because I have access

to data by consumer type for the computer printers example, I have elected to avoid

wading into these econometric intricacies.19 Introducing this level of complexity into

the model would make the estimation routine less transparent and hamper comparisons

between the NRCL and other logit models.20

3.5 Price Index Decompositions

Another advantage of the product-level estimation approach is that it allows for the

decomposition of changes in the price index into their price, quality, and variety com-

ponents. That is, here one can step beyond the “common goods” and “changing goods”

breakdown by incorporating the estimated qualities into the Feenstra (1994) methodol-

ogy. The resulting decomposition is useful in exploring the mechanisms at work behind

movements in the overall price index.

Imagine that there are Jrg goods in consumer type r’s group g choice set in period

t. Order these goods by some metric such as increasing quality, and then assign each

an index 1, . . . , Jrg .21 Next choose a size Jrg group of goods from time period t+ 1 that

is representative of the t + 1 distribution of price and quality.22 Order these goods

similarly, and again assign each an index 1, . . . , Jrg .23 Because this set is scaled to be of

size Jrg , it captures the price and quality distribution present in t+ 1 but holds variety

18Treating other product characteristics besides price as endogenous is much less common becausefinding an instrument that is uncorrelated with unobserved quality but correlated with observed qualityis difficult.

19For certain distributions, this procedure can be computationally intensive. In the case of thenormal distribution, integrating up to the market-level shares has to be done numerically, and thenthe parameters have to be fitted using a non-linear search. See Knittel and Metaxoglou (2008) andDube et al. (2009) for discussions on the challenges involved in this type of estimation.

20Readers who are interested in these other estimation methods should consult Berry, Levinsohn,and Pakes (1995), Nevo (2000), and Train (2009).

21In theory, the ordering of these goods does not matter, so long as one keeps track of which priceand quality goes with which good. However, the resulting decomposition can be sensitive to whichgoods are matched in periods t and t+ 1, so it is best to establish a consistent procedure.

22I discuss one method for choosing such a set in Section 4.23This procedure assumes that there are at least as many products available in time t + 1 as in t.

Otherwise, the size of the set Jrg would be defined as the number of goods in time t+ 1 and the priceindex would be interpreted as measuring the change relative to the set Jrgt+1 instead of relative to Jrgt.

19

constant at the period t level. Repeat this process for each g ∈ G and r ∈ R.

Then the price index decomposition is

τNRCLt+1 =∏r∈R

∏g∈G

Jrg∏j=1

(pjt+1

pjt

)ωrjgt+1Jrg∏j=1

(brjtbrjt+1

)ωrjgt+1σ−1

(λJrgt+1

λJrgt

) 1σ−1

ωrgt+1

frt+1

.

(30)

where

λJrgt =

∑Jrgj=1 pjtm

rjt∑

j∈Jrgtpjtmr

jt

, ωrjgt+1 =(srjt+1(Jrg )− srjt(Jrg ))/(ln(srjt+1(Jrg ))− ln(srjt(J

rg )))∑Jrg

j=1(srjt+1(Jrg )− srjt(Jrg ))/(ln(srjt+1(Jrg ))− ln(srjt(Jrg )))

.

and

ωrgt+1 =(srgt+1 − srgt/(ln(srgt+1)− ln(srgt))∑g∈G(srgt+1 − srgt)/(ln(srgt+1)− ln(srgt))

.

Here srjt(Jrg ) is the share of expenditure by type r consumers accounted for by good

j out of the goods indexed by 1, . . . , Jrg . Equation (30) has three components. The

first part is a geometric average of price ratios, pjt+1/pjt, which captures the changes

in price in the set {1, . . . , Jrg}. The second part is a geometric average of quality ratios,

brjt/brjt+1, which captures changes in quality in the set {1, . . . , Jrg}. The third part is an

expenditure share adjustment, which reflects how much expenditure has shifted to the

greater number of goods that are available outside of the set {1, . . . , Jrg}. This term

captures variety.

It is important to note how this full decomposition compares with the MNL and NL

special cases. In the case of the NL model,

τNLt+1 =∏g∈G

(Jg∏j=1

(pjt+1

pjt

)ωjgt+1

)ωgt+1 ∏g∈G

(Jg∏j=1

(bjtbjt+1

)ωjgt+1σ−1

)ωgt+1 ∏g∈G

(λJgt+1

λJgt

)ωgt+1σ−1

.

(31)

where

λJgt =

∑Jgj=1 pjtmjt∑j∈Jgt pjtmjt

and

ωjgt+1 =(sjt+1(Jg)− sjt(Jg))/(ln(sjt+1(Jg))− ln(sjt(Jg)))∑Jgj=1(sjt+1(Jg)− sjt(Jg))/(ln(sjt+1(Jg))− ln(sjt(Jg)))

.

Note that sjt(Jg) is the share of expenditure accounted for by good j amongst expen-

20

diture on the goods 1, . . . , Jg. In the case of the MNL model,

τMNLt+1 =

J∏j=1

(pjt+1

pjt

)ωjt+1 J∏j=1

(bjtbjt+1

)ωjt+1γ−1

(λJt+1

λJt

) 1γ−1

(32)

where

λJt =

∑Jj=1 pjtmjt∑j∈Jt pjtmjt

and ωjt+1 =(sjt+1(J)− sjt(J))/(ln(sjt+1(J))− ln(sjt(J)))∑Jj=1(sjt+1(J)− sjt(J))/(ln(sjt+1(J))− ln(sjt(J)))

.

Here sjt(J) denotes the expenditure share of good j out of the goods 1, . . . , J , in time

t.

The comparison between the NL decomposition and the MNL analog is similar

to the comparison between the CES and NCES decompositions. The price, quality,

and variety terms are calculated first within each group instead of immediately across

all products. This reflects the fact that according to the NL model, changes in all

three forces should have the strongest effect amongst goods that are in the same nest.

Furthermore, the NL uses σ instead of γ in the variety term. Since we expect that

σ > γ, this will tend to lower the variety term relative to that in the MNL.24

In moving from the NL to the NRCL, the random coefficients mean that there are

multiple consumer types that must be averaged over. Therefore the NRCL index allows

the effects of a change in price, quality, or variety to be asymmetric across consumers.

Whether this index is larger or smaller than the NL one depends on how the preferences

of individual consumer types compare to the average preferences of all consumers. The

NRCL may pick up gains (or losses) to minority groups of consumers that wash out in

the market-level data.

4 Empirical Example: Computer Printers

Given the similarities and differences between the models discussed above, it is impor-

tant to see how they compare in practice. To this end, I apply these methods to the

Indian import market for computer printers over the period 1996 to 2005.

24The quality term also includes σ, but this difference actually does not have an effect on therelationship between the NL and MNL indices. This is because the definition of bjt in the estimationequation causes the σ to cancel out.

21

4.1 Market Background

The information technology sector is one of the fastest growing import markets in India.

The quantity imported of computers and associated peripherals as classified in HS 8471

increased from 0.48 percent of the value of imports in 1996 to 1.40 percent in 2005.

Growth was extraordinarily strong in computer peripherals, HS 847160, rising from

0.08 percent of value in 1996 to 0.32 percent in 2005, a roughly 4 times increase in

share.25

Some of this growth has been spurred on by a liberalization of India’s import policies.

In 1997, India signed the WTO’s Information Technology Agreement. By doing so,

India agreed to lower tariffs on printers from 20 percent ad valorem to 0 percent by

2005. This goal was successfully achieved in the middle of 2005.

There are few local printer producers in India, and those firms that do operate almost

exclusively make dot matrix machines. The Department of Scientific and Industrial

Research, a government agency tasked with promoting technology development in India,

released a report in 1996 on the state of the Indian printer sector. They concluded that

India was unlikely to expand into laser printer manufacturing, even with the help of

foreign direct investment (DSIR (1996)). The report pointed to small local demand

and a poor technology infrastructure as major hurdles. This situation has only slightly

improved today. The firm WeP Peripherals announced the first Indian laser printer

factory in 2003, but their line remains small. No other Indian firm has established a

plant.

Instead of sourcing printers locally, most of the market is served by imports from a

number of multinational brands (such as Canon, Epson, HP, or Xerox). These foreign

companies prefer to do their manufacturing in China and Southeast Asia and then ship

into India. I do not know of any multinational brand or electronics outsourcing firm

which has a printer manufacturing facility in India.26 India, although a growing market,

is still too small to warrant major horizontal foreign direct investment.

Therefore, the Indian computer printer sector is a dynamic differentiated products

market that has seen a lot of growth fueled by imports. This makes it an ideal candidate

in which to study the effects on consumers of changes in differentiated goods.

25These numbers are computed using the UN Comtrade database.26HP has had plants in India since the late 1990s, but they mostly produce computers. Xerox has a

facility in Rampur, but it makes single-function copiers. There are some plants that make components.

22

4.2 Data Description

My data is based upon an extract from the IDC Hardcopy Peripherals Database (IDC

(2008)). This data set tracks sales of individual printer models, listing their name,

quantity sold, and average price for every quarter from the beginning of 1996 to the

second quarter of 2006.27 Average price includes the purchase price and shipping costs,

but not taxes. I have converted these prices into real figures using the Indian consumer

price index.28 Only new sales are reported, not sales of used or refurbished models.

IDC, a market research firm that focuses on technology products, collects this infor-

mation from retailers, distributors, and online vendors. They claim to track all models

of A2 through A4 size laser and inkjet printers.29 Some observations aggregate a base

configuration and other optional configurations into one model. However, this is not

a major occurrence amongst the machines offered in India because they are mostly

low-end models with few extra options.

I define a printer as a device that can print output from a computer, excluding

portable machines meant for travel. I focus on two technologies: inkjet and laser. These

machines may perform other functions, such as copying or scanning (“multi-function

peripherals” or MFPs). I exclude printers that use impact technologies, which are based

on older typewriter-like designs.30 The IDC data has limited coverage of impact models

and does not report purchases of these categorized by consumer types for most of the

models that are included. Furthermore, I am not able to collect characteristics data for

all of these printers because many were sold by small local firms that do not have their

back catalogs available in print or online. Regardless, laser and inkjet models account

for the vast majority of sales, particularly amongst imported models.31 It is also in

the laser and inkjet categories where most of the improvements in terms of quality and

variety have appeared, as impact is a dying technology. Note also that I limit my data

to non-Indian brands.

27I also have data through the first quarter of 2008, but these observations have to be droppedbecause there is no product that appears in all of these years to serve as the outside good in the logitmodels.

28The CPI is from http://labourbureau.nic.in/ and was downloaded in October 2009.29IDC does aggregate some models into an “other” category, but this never accounts for more than

2.4 percent of sales revenues in any quarter. I discard this category because it is not clear exactlyhow it is constructed. Prior to 2001, MFPs are not recorded in the database. However, when MFPtracking begins in 2001 Q1, these machines only comprise 4.5 percent of sales revenue, so it is unlikelythat they formed a large part of the market in the prior period.

30The main impact technology is dot matrix.31When the Indian customs authority began reporting printer import quantities by inkjet and laser

versus dot matrix in the spring of 2003, inkjet and laser accounted for over 90 percent.

23

Importantly for my purposes, IDC also separates the units sold of each inkjet and

laser model into those purchased by different subsets of consumers. These divisions

are home office, 1 to 9 employee establishments, 10 to 99 employee establishments,

100 to 499 employee establishments, and 500 or more employee establishments. Note

that home office buyers may include family businesses. In order to form the set R

of consumer types for the NRCL model, I aggregate this data into two categories:

home office or 1 to 9 employee establishments (which I call “small” consumers) and

10 or more employee establishments (which I call “large” consumers). I choose to

use only two consumer types in order to keep the comparisons between the random

and non-random coefficients specifications simple. One would expect that this group

of home office and small firms would exhibit distinct buying patterns relative to the

average behavior in the data. Although larger establishments would consider using a

department-sized laser printer, for example, most small buyers would not because of

the set-up costs, technological expertise, and physical space required. Those costs are

not justified for a small establishment that will not print high volumes.

In order to accurately measure the quality of these printers, I need to know some-

thing about their characteristics. IDC provides some basic information, categorizing

models into different bins based on technology (laser, inkjet), function (single, MFP),

color versus monotone printing, and page per minute (PPM) speed (1-10 PPM, 11-20

PPM, etc.). In order to enrich my analysis, I supplement this data with characteristics

collected from manufacturer’s websites and from printer specification sheets published

by the firm Buyers Laboratory.32

I can not find data for all the models reported in the IDC dataset, meaning that

some observations are dropped. After these exclusions, I am left with data for about

96 percent of sales revenue and 97 percent of laser and inkjet units sold in the original

IDC data set. This data cover 1198 unique models. Summary statistics are presented

in Table 1.

There is an important omitted variable from the characteristics listed in Table 1. In

particular, I do not have information on the maintenance costs of each printer model.

These are largely due to printer cartridges (though they also include other factors

like paper and electricity). However, upon examining the industry literature, I have

32If one or two characteristics for a model can not be found, they are imputed from similar modelsof the same brand, or, if those are not available, from similar models across all brands in a givenquarter. This affects 10 percent of observations in the final sample. I convert printing speeds listed incharacters per second to PPM by assuming a rate of 4000 characters per page.

24

found that few estimates of these costs are available. The statistics that do exist

indicate that running costs vary strongly with the technology of the printer (laser or

inkjet) and with whether or not the machine prints in color. Therefore, in estimating

the demand models, I use dummies for these characteristics to proxy for this omitted

variable. Unfortunately, this means that I cannot separately identify tastes for these

technologies from preferences for their maintenance costs.33

I supplement this printer data with information on Indian printer tariffs and ex-

change rates. Tariff data covering 1996 to 2001 are from Khandelwal and Topalova

(2010) for HS category 847160. I extend this data to 2006 using announcements of

changes to the tariff schedule published by the Indian Central Board of Excise and

Customs.34 I obtain information on the quarterly exchange rate between India and

the US, Japan, South Korean, China, and the European Union from Global Financial

Data.

4.3 Identification

In order to estimate the logit-based models, I need instruments for the log price and

log group share variables. When faced with this situation, industrial organization re-

searchers often struggle to find plausibly exogenous instruments that exhibit enough

time-series and cross-sectional variation to be useful. In my empirical application, I can

leverage the international trade aspect of my data to solve this problem.

As discussed above, the vast majority of sales in the Indian printer market (and all

sales in my data) are accounted for by foreign brands. These brands’ parent companies

are located in the US (such as Xerox and HP), Japan (such as Canon and Ricoh), South

Korea (Samsung), China (Lenovo), and the European Union (Oce). Therefore, any rev-

enues these corporations make by selling printers in India must ultimately be converted

from Indian rupees into their home currency in order to become part of their bottom

lines. As such, the exchange rate between their home currency and the Indian rupee

should affect the prices that are set and in turn affect expenditure shares. However,

given that the buyers of printers in this market are largely small Indian firms that only

33A related concern is that upfront printer prices may be uninformative if printer vendors are pursu-ing a strategy of lowering the prices of printers in order to make money on printer cartridges. Althoughsuch a strategy has been used in the US, it is much less prevalent in India because of the high pene-tration of third party and counterfeit cartridges. IDC estimates that over 50 percent of the cartridgessold in India are made by third parties.

34These announcements are available at http://www.cbec.gov.in/ and were accessed in July 2010.

25

operate domestically, the exchange rate should not affect demand independently.35

Thus, I use these exchange rates to build two instrumental variables. First, I take

the exchange rate for the headquarters currency of each brand. Second, I form the

average exchange rate of each model’s rival products in the same IDC product type.36

For example, imagine that there are three models in a certain category, one from Japan

and two from the US. Then the instrument for the Japanese model would be the US

exchange rate, while the instrument for each of the US models would be the average of

the US exchange rate and the Japanese exchange rate. This second instrument helps

me capture the variation in pricing competition across types of printers.

I also construct a third instrumental variable based on tariffs. As mentioned above,

India pursued a dramatic liberalization in the computer printer sector over the time

period I study, zeroing out most printer tariffs. This fall in taxes was mandated by a

WTO agreement covering a number of information technology products, and hence is

probably unrelated to unobservables in printer demand. Furthermore, tariffs are likely

to be correlated with printer prices and in turn with within group expenditure shares,

while not entering into demand separate from their effect on price.

4.4 Overview of Results

I begin by discussing how I estimate the logit-based models. Recall that I have three

logit variants: the MNL, the NL, and the NRCL. In the interest of simplicity, I choose

to define just two product groups in the nested models, one for inkjet and the other for

laser.

The estimating equations for all three models are combined in equation (29). Sim-

ply drop the consumer type distinction in order to reduce the NRCL to the NL and

further drop the log group share term in order to reduce the NL to the MNL model.

Because there are two types of consumers in the NRCL model, I have three equations to

estimate: one equation without consumer types, one equation for small consumers, and

one equation for large consumers. I merge all of these equations into one by stacking

35There are some drawbacks to this approach. Most prices for printers are probably set initially inIndian rupees, not set in foreign currencies and converted. If there are menu costs, this may mean thatprices are sticky in Indian rupees and hence less sensitive to exchange rate movements. In addition,the exchange rate may be affected by domestic policy controls or general equilibrium effects that are inturn related to local demand factors. Nevertheless, the exchange rate is one of the few variables thatexhibits strong variation across time periods while also having a reasonable probability of satisfyingthe exogeneity requirement.

36See the first column of Table 2 for a list of these categories.

26

the observations for all three and interacting the independent variables with a constant

and with dummies for whether an observation is for small buyers and for whether an

observation is for large buyers. I estimate this equation using both least squares and

instrumental variables methods.

I need to difference the data with respect to one printer model, the outside good.

The natural choice is a dot matrix printer, since that is the most common alternative

to the laser and inkjet models included in my main dataset. As previously mentioned, I

only have data on selected dot matrix models, but there is one candidate that appears

in all the years from 1996 to 2005, the Panasonic KX-P1150. This is the product that

I take as my outside good.37

The parameter estimates appear in Table 3. Although I estimate all the models

stacked into one equation, I separate the results into their three component equations

(MNL/NL, NRCL small, and NRCL large) to make the numbers easier to interpret.

In all regressions, instrumenting appears to remove a positive bias on the log price

and log group share coefficients. The price estimate becomes more negative and the

share coefficient becomes less positive. This result accords with the hypothesis that

the log price and the log share variables are positively correlated with unobserved

quality. Note also that instrumenting tends to raise the resulting estimates of σ and

γ, the elasticities of substitution. The first-stage F statistics are all above 50, and the

overidentification test statistics are not significant at conventional levels for any of the

models.38 In what follows, I use the IV results as my preferred specification.

The coefficients indicate that nearly all characteristics increase quality relative to

that for the Panasonic KX-P1150, which has relatively low characteristics. The only

exceptions is resolution, which may result from the fact that a unless they print photos

regularly, many buyers have little use for ultra-high resolution machines. The latter

point reveals a potential heterogeneity between consumer types that I return to in

discussing random coefficients.

The coefficient on the log group share is always highly significant, indicating that

a nested model is appropriate for this data set. This result is to be expected, because

37I do not have consumer-level data for this model, so I assume that 5 percent of reported salesare made up of small consumers. This estimate is based on IDC sales data for two other dot matrixmodels sold from 1998 to 2003.

38I use both the homoskedastic and heteroskedasticity robust overidentification tests suggested byWooldridge (2002). These F test and overidentification test results hold regardless of whether the threeequations (no consumer types, small consumers, and large consumers) are estimated stacked togetheror separately.

27

there is a natural dichotomy in printers between laser and inkjet models.

As for the random coefficients, F tests on the interactions between dummies for

consumer type and the independent variables indicate that these are jointly significant.

Therefore, the NRCL model is the best fit. Focusing on individual coefficients, I find

that small consumers’ coefficients on the color dummy, laser dummy, and network

interface dummy are significantly lower and the coefficient on resolution is significantly

higher compared to those in the non-random coefficient models. Meanwhile, the large

consumers’ coefficient on resolution is significantly lower and the coefficient on the laser

dummy is significantly higher.39 These results indicate that there is some variation in

preferences that the MNL and NL models mask.

A selection of own- and cross-price elasticities appear in Table 4. Each entry is

the percentage change in quantity sold of the row good in response to a percentage

change in price of the column good. In the top panel, which contains the results for the

MNL model, I find that the cross-price elasticity is identical down each column, which

occurs because of the IIA property. Once I shift to the NL model, these elasticities vary

depending on which group (inkjet or laser) the row good is in. Models that are in the

same group as the column good have much larger elasticities compared to those that

are in different groups. Finally, the NRCL elasticities also vary within groups, due to

the random coefficients.

The results for the 1996 versus 2005 price indices are presented in Table 5. These

annual figures average over the results for the first quarter of 1996 versus the first

quarter of 2005, the second quarter of 1996 versus the second quarter of 2005, and so on.

Following Broda and Weinstein (2006), I construct bootstrapped 95 percent confidence

intervals by sampling 100 times from the estimated joint normal distribution of the

regression parameters in Table 3 and calculating the price index for each sample.

In order to facilitate comparisons between the CES-based and logit-based models,

I set the elasticities of substitution in the CES and NCES calculations to the numbers

estimated for the NL and MNL models (Table 3, second column). Remember that

these parameters are the only ones needed in the CES and NCES calculations. The

other input is the aggregated price and expenditure share data. I form this data by

taking the share-weighted average price and the total expenditure for each product

category/country combination reported in Table 2.

In interpreting the numbers in Table 5, note that the index is calculated between

39These results are based on separate 5 percent t tests for each coefficient.

28

1996 and 2005. Therefore, the index is the factor by which one would have to multiply

the prices of all goods in 1996 in order to give the same welfare as the goods available in

2005. For instance, the 0.061 CES index means that 1996 prices would have to fall by

93.9 percent in order to make consumers as well off as in 2005. All five indices estimate

dramatic falls in the 1996 prices, ranging from about 84 percent to 96 percent. However,

there are some subtle differences. The aggregated indices tend to be higher than their

logit counterparts (compare the CES and MNL or the NCES and NL). Nesting tends to

raise the index (compare the CES and NCES or the MNL and NL). Including random

coefficients appears to slightly lower the index (compare the NL and NRCL).

Figure 1 shows how the 1996 versus 2005 price index developed over the intervening

years. For each year from 1997 to 2005, I calculate the price index with respect to

1996, thus showing how much of the 1996 to 2005 change had occurred by that year.

Broadly speaking, all of the indices for the five models move together. The descent in

the indices over the years is reasonably smooth, except for a large drop between 1997

and 1998. This occurs because 1997 was a year where the price indices exhibited losses

(particularly in the second quarter) when some firms withdrew products.

The decompositions into common and changing goods effects (for the CES models)

or price, quality, and variety effects (for the logit models) appear in Table 6.40 In

constructing the logit-based decompositions, I need to choose a subset of goods from

the 2005 set of products available that is the same size as the set available in 1996.

Keeping with the spirit of the price, quality, and variety breakdown, I build a subset

that approximates the joint distribution of price and quality available in 2005, but

scales it to have the same variety (number of products) as in 1996. In this way, the

price and quality terms in the decomposition will give a good approximation of the

relative changes in price and quality between 1996 and 2005.

For example, suppose that there are X products available in 1996 and Y > X

products available in 2005. I split the 2005 ranges of price and quality into 5 different

percentile bands, giving 25 price/quality bins. I then randomly choose models at a rate

of X/Y from each bin. This procedure gives me a set of 2005 goods that is the same

size as the set of 1996 goods, and then I calculate the decomposition using these sets.

In order to ensure that my findings are not driven by a particular sample, I repeat this

procedure 100 times for different samples, and average across the resulting terms to get

my final numbers.

40The common goods effects do not have confidence intervals because those terms do not use esti-mated parameters. They are constructed using only average price data.

29

In comparing the CES and NCES decompositions, the largest difference comes in

the changing goods term, which rises noticeably in the NCES case. As for the logit

decompositions, the price and quality terms are broadly similar across models, while

the variety term rises markedly in the NL and NRCL models. These differences hint at

the importance of the nested elasticity of substitution, σ.

Therefore, an initial scan of the results suggests some important differences between

the five price indices. I discuss these trends in more detail in the following sections.

4.5 Disaggregated Data: CES/NCES vs. MNL/NL

I begin by examining the effect that using product-level data has on the resulting price

indices by comparing the CES with the MNL and the NCES with the NL. Because the

price index formulas in these models are identical, any actual differences in the results

are due entirely to divergent empirical techniques. I find that the CES and NCES

methods cannot distinguish developments within product categories because of these

models’ reliance on aggregated data. As a consequence, the CES and NCES indices

tend to understate gains (or losses) from changes in products.

This pattern is already somewhat apparent in Figure 1. The CES-based indices

fall steadily while the logit indices bounce around. This difference occurs because the

product-level data allow the logit indices to capture subtle changes in the products

offered that the aggregated data miss.

On net, the MNL and NL indices find a larger gain over 1996 to 2005 when compared

to the CES and NCES indices, respectively. This distinction is statistically significant,

as bootstrapped 95 percent confidence intervals for the difference between the 1996/2005

MNL and CES indices and the 1996/2005 NL and NCES indices do not include zero.41

In terms of economic significance, the MNL index is about 42 percent lower than the

CES index, and the NL index is about 31 percent lower than the NCES index. This re-

sult is similar to that in Blonigen and Soderbery (2010), who find that using aggregated

trade data understates the improvements in the price index from increased variety in

the US automobile market.

Thus, it appears that using aggregated data tends to understate improvements in

the price index. One way to explore this point is to take a closer look at the common

41In order to build these confidence intervals, I sample from the estimated asymptotic distributionof the regression parameters in Table 3 to form 100 simulated parameter sets. Then I calculate theindices for each sample and take the difference between the MNL and CES indices and the NL andNCES indices.

30

goods terms. These terms are the price indices calculated using only goods categories

that appear in both time periods. By assumption, these categories are supposed to have

constant quality and variety. However, because printer firms are constantly tweaking

their offerings, this assumption is unlikely to hold. I can assess the effect this has on

the common goods term by calculating the price index for the common goods using

the product-level data underlying these categories. That is, I apply equation (11)

and equation (10) to the individual printer models that are within the common goods

categories.

I graph the resulting “common goods terms” in Figures 2 and 3. The terms con-

structed using product-level data are lower than those using aggregated data in all years

except 1997. Hence, the product-level data reveal that there were significant increases

in quality and variety (or decreases in the second quarter of 1997) within the common

goods. By assuming these changes away, the standard CES and NCES indices have

missed these movements in the price index.

4.6 Nesting Products: CES/MNL vs. NCES/NL

Another important point of comparison is between the nested and non-nested models.

Nesting allows the elasticity of substitution to be larger within groups (where goods

tend to be more similar) than between groups. As a result, I find that improvements

in the price index tend to be dampened in the nested frameworks.

Comparing the overall 1996/2005 indices in Table 5, the CES is about 62 percent

lower than the NCES and the MNL is about 68 percent lower than the NL. Based on

bootstrapped 95 percent confidence intervals, these differences are statistically signifi-

cant.

Nesting also has a noticable effect on the price index decompositions in Table 6. As

the elasticity of substitution within groups rises between the CES and NCES models, the

changing goods term becomes larger. Indeed, the CES term is about 44 percent lower

than the NCES one, and the bootstrapped 95 percent confidence interval indicates that

the difference is statistically significant. This reflects the fact that new and disappearing

goods have a smaller effect on utility when they are more substitutable with some

existing common goods.

Similarly, the variety terms in Table 6 also rise when nesting is implemented. The

MNL variety term is about 67 percent lower than that for the NL. The bootstrapped

31

95 percent confidence interval indicates that this difference is significant.42 The change

occurs because increasing variety has less of an effect on utility when the elasticity of

substitution between some goods increases due to nesting. Therefore, it is the variety

channel that is most affected by nesting.

Some of the most dramatic results from nesting appear in the cross-price elasticities.

As Table 4 shows, the NL cross-price elasticities between printers in the same product

group are on the order of 10 times larger than those in different groups. Meanwhile

the elasticities are exactly constant across printers in the MNL case. As a result,

improvements in the printers available are not as highly valued in the NL model, since

this framework recognizes that there are some good substitutes already on the market.

4.7 Random Coefficients: NL vs. NRCL

The final comparison is between the NL and the NRCL, which highlights the effect of

random coefficients. These coefficients allow the model to better reflect the spread of

tastes across consumers, which can be obscured in market-level data. Here I find that

small consumers experience larger gains than average due to improvements in printers.

Although the overall NRCL index is similar to the NL, this similarity masks hetero-

geneity between consumer types. See Figure 4, which graphs the indices for small and

large types (according to equation (25)) alongside the overall NRCL and NL indices.

The price index for small consumers is always lower than that for large consumers. How-

ever, large consumers account for about 80 percent of expenditures. As a result, the

NL model tracks the large index more closely, whereas the NRCL is pulled downwards

because it incorporates small buyer preferences.

In turn, the NRCL price index for 1996 versus 2005 is lower than the NL index,

although the effect is statistically insignificant. The statistically significant distinction

comes in examining the price index for small buyers, which is 53 percent lower than the

NL index. The NL only distinguishes average sales patterns, so it does not recognize

that certain new goods may have a greater effect on some buyers compared to others.

In this case the NL is influenced mainly by the behavior of large consumers and fails

to pick up improvements for small consumers. This difference has a small effect on the

overall price indices, but it is important when assessing the distributional consequences

of improvements in differentiated products.

42The differences across these models for the price and quality terms are also significant, thoughthey are much smaller in economic terms.

32

When compared to the results from disaggregating data and nesting products, the

effects of introducing heterogeneity in preference coefficients are less pronounced in the

market-level index. This is related to the nature of the computer printer example,

where products clearly vary a great deal at the model level and where there is an

obvious nesting structure (inkjet versus laser). Heterogeneity in coefficients is a less-

obvious concern. Indeed, Table 4 shows that nesting plays a far more important role

in addressing the IIA problem and as a result has the most dramatic effect on the

price indices. That being said, I do find that using random coefficients allows the

resulting price indices to better reflect the preferences of minority consumer types.

An NRCL specification with a richer distribution of consumer preferences (such as a

normal) may produce even more realistic results, although these would come at a higher

computational cost.

5 Conclusion

Although the approaches that international trade and industrial organization take with

regards to differentiated products appear quite different, I find that the underlying

theories are actually closely related. The CES and the MNL models yield the same

demand system and price index, and this fact greatly facilities the comparison between

a number of common variations on these frameworks. In effect, the NCES, NL, and

NRCL are all just ways of addressing the IIA problem. The only differences come in

how these models are used empirically (using aggregated versus product-level data) and

in how they tackle IIA (using nesting versus using random coefficients).

In the computer printers empirical application, I find that aggregated data meth-

ods understate the gains from differentiated products in the price index. This occurs

because these data mask product-level improvements in the goods available on the

market. Meanwhile, non-nested models exaggerate improvements in the price index be-

cause these frameworks underestimate the substitutability between products. Finally,

incorporating random coefficients improves the model’s ability to match the spread of

preferences in the population.

Although these results apply to this one empirical example, they also highlight two

general lessons that apply to all studies on the effects of changes in differentiated prod-

ucts. First, it is the nature of aggregated data to obscure some movements in quality

and variety at the underlying product level. This caveat must be kept in mind when

33

using the common goods assumption to calculate price indices. Second, the CES and

MNL models fall prey to the IIA problem, which greatly limits the substitution struc-

ture between goods and can distort the gains from differentiated products. It is the

constant search for ways in which to alleviate this problem that has produced innova-

tions in demand modeling, such as nesting products and using random coefficients. One

should be sure to choose a flexible option, given the strictures imposed by the current

research question and the data available.

An interesting avenue for future research would be to examine the ramifications

of other demand models on the price index. Several authors have proposed different

frameworks in order to address issues with the CES and MNL beyond just IIA. Feen-

stra (2009), for example, suggests using a translog specification in order to avoid the

constant markups that obtain in the CES under monopolistic competition. Ackerberg

and Rysman (2005) modify the MNL in order to address the overvaluing of variety

caused by each good having its own logit shock. Gowrisankaran and Rysman (2009)

extend the random coefficients logit so as to deal with durable goods. Exploring ad-

vancements such as these would complement the findings in this paper and provide

further insight into the manner in which both trade and industrial organization should

approach differentiated products.

34

References

Ackerberg, Daniel A. and Marc Rysman, “Unobserved Product Differentiation inDiscrete Choice Models: Estimating Price Elasticities and Welfare Effects,” RANDJournal of Economics, 2005, 36 (4), 771–778. [34]

Anderson, Simon P., Andre de Palma, and Jacques-Francois Thisse, DiscreteChoice Theory of Product Differentiation, MIT Press, 1992. [12, 14]

Berry, Steven, “Estimating Discrete-Choice Models of Product Differentiation,”RAND Journal of Economics, 1994, 25 (2), 242–262. [18]

Berry, Steven, James Levinsohn, and Ariel Pakes, “Automobile Prices in MarketEquilibrium,” Econometrica, 1995, 63 (4), 841–890. [3, 15, 19]

, “Voluntary Export Restraints on Automobiles: Evaluating a Trade Policy,” Amer-ican Economic Review, 1999, 89 (3), 400–430. [4]

, “Differentiated Products Demand Systems from a Combination of Micro and MacroData: The New Car Market,” Journal of Political Economy, 2004, 112 (1), 68–105.[4]

Blonigen, Bruce A. and Anson Soderbery, “Measuring the Benefits of ForeignProduct Variety with an Accurate Variety Set,” Journal of International Economics,2010, 82 (2), 168–180. [4, 10, 30]

Broda, Christian and David E. Weinstein, “Globalization and the Gains fromVariety,” Quarterly Journal of Economics, 2006, 121 (2), 541–585. [3, 10, 28]

Broda, Christian, Joshua Greenfield, and David E. Weinstein, “From Ground-nuts to Globalization: A Structural Estimate of Trade and Growth,” NBER WorkingPaper, September 2006, No. 12512. [3]

Dixit, Avinash K. and Joseph E. Stiglitz, “Monopolistic Competition and Opti-mum Product Diversity,” American Economic Review, 1977, 67 (3), 297–308. [3]

Dornbusch, Rudiger, Stanley Fischer, and Paul A. Samuelson, “ComparativeAdvantage, Trade, and Payments in a Ricardian Model with a Continuum of Goods,”American Economic Review, 1977, 67 (5), 823–839. [6]

DSIR, “Technology in Indian Laser Printers Industry,” Technology Status Report, Au-gust 1996. [22]

Dube, Jean-Pierre, Jeremy T. Fox, and Che-Lin Su, “Improving the Numeri-cal Performance of BLP Static and Dynamic Discrete Choice Random CoefficientsDemand Estimation,” mimeo, May 2009. [19]

35

Eaton, Jonathan and Samuel Kortum, “Technology, Geography, and Trade,”Econometrica, 2002, 70 (5), 1741–1779. [6]

Feenstra, Robert C., “New Product Varieties and the Measurement of InternationalPrices,” American Economic Review, 1994, 84 (1), 157–177. [1, 3, 4, 8, 9, 10, 19]

, “Measuring the Gains from Trade under Monopolistic Competition,” mimeo, June2009. [34]

Goldberg, Pinelopi Koujianou, “Product Differentiation and Oligopoly in Interna-tional Markets: The Case of the US Automobile Industry,” Econometrica, 1995, 63(4), 891–951. [4]

Goldberg, Pinelopi Koujianou, Amit Khandelwal, Nina Pavcnik, and PetiaTopalova, “Imported Intermediate Inputs and Domestic Product Growth: Evidencefrom India,” Quarterly Journal of Economics, 2010, 125 (4), 1727–1767. [3]

Gowrisankaran, Gautam and Marc Rysman, “Dynamics of Consumer Demandfor New Durable Goods,” mimeo, February 2009. [34]

Hausman, Jerry A., “Valuation of New Goods Under Perfect and Imperfect Com-petition,” in Timothy F. Bresnahan and Robert J. Gordon, eds., The Economics ofNew Goods, Chicago: University of Chicago Press, 1997. [7]

Helpman, Elhanan and Paul R. Krugman, Market Structure and Foreign Trade,MIT Press, 1985. [6]

IDC, “Worldwide Quarterly Hardcopy Peripherals Tracker,” electronic resource, ac-cessed April 2008. [23]

Kamakura, Wagner A. and Gary J. Russell, “A Probabilistic Choice Model forMarket Segmentation and Elasticity Structure,” Journal of Marketing Research, 1989,26 (4), 379–390. [15]

Khandelwal, Amit, “The Long and Short (of) Quality Ladders,” Review of EconomicStudies, 2010, 77 (4), 1450–1476. [4]

Khandelwal, Amit and Petia Topalova, “Trade Liberalization and Firm Produc-tivity: The Case of India,” forthcoming in The Review of Economics and Statistics,January 2010. [25]

Knittel, Christopher R. and Konstantinos Metaxoglou, “Estimation of Ran-dom Coefficient Demand Models: Challenges, Difficulties, and Warnings,” mimeo,October 2008. [19]

Krugman, Paul R., “Increasing Returns, Monopolistic Competition, and Interna-tional Trade,” Journal of International Economics, 1979, 9 (4), 469–479. [1]

36

McFadden, Daniel, “Conditional Logit Analysis of Qualitative Choice Behavior,” inPaul Zarembka, ed., Frontiers in Econometrics, New York: Academic Press, 1974.[3]

Melitz, Marc J., “The Impact of Trade on Intra-Industry Reallocations and AggregateIndustry Productivity,” Econometrica, 2003, 71 (6), 1695–1725. [6]

Nevo, Aviv, “A Practioner’s Guide to Estimation of Random-Coefficients Logit Mod-els of Demand,” Journal of Economics and Management Strategy, 2000, 9 (4), 513–548. [19]

Petrin, Amil, “Quantifying the Benefits of New Products: The Case of the Minivan,”Journal of Political Economy, 2002, 110 (4), 705–729. [4]

Spence, Michael, “Product Differentiation and Welfare,” American Economic Re-view: Papers and Proceedings, 1976, 66 (2), 407–414. [3]

Train, Kenneth, Discrete Choice Methods with Simulation, Cambridge UniversityPress, 2009. [3, 19]

Wooldridge, Jeffrey M., Econometric Analysis of Cross Section and Panel Data,MIT Press, 2002. [27]

37

Figure 1: All Price Indices, 1996-2005

Notes: These indices are calculated at the quarterly level (comparing the first quarter of 2005 to the first quarter of1996, for example), and then averaged over all four quarters. The index for each year takes 1996 as the base year.

38

Figure 2: CES Common Goods Terms, 1996-2005


Figure 3: NCES Common Goods Terms, 1996-2005


39

Figure 4: NL and NRCL Indices, 1996-2005


40

Table 1: Summary StatisticsVariable Mean Standard DeviationPrice (USD) 604.996 1084.060Units Sold 1059.538 4763.041Color Dummy 0.468 0.499BW PPM Speed 20.806 13.430RAM (MB) 49.764 98.272Resolution (DPI) 1336.798 771.611A3 Capable Dummy 0.355 0.478Footprint (in2) 416.175 381.911Ethernet Interface Dummy 0.343 0.475MFP Dummy 0.356 0.479Laser Dummy 0.663 0.473Number of Model-Quarters 6413Number of Unique Models 1189Notes: Data sources are in the text, Section 4. Price is in real 2001 IndianRs, then converted to USD at 1 Rs=47.12 USD. “BW PPM Speed” is themaximum number of pages per minute that can be printed in black and white.

41

Table 2: Product CategoriesProduct Type Japan US Korea EUMFP Color Inkjet 1-10 PPM X X XMFP Color Inkjet 11-20 PPM X X XMFP Color Inkjet 21 PPM or more X XMFP Color Laser 1-10 PPM X XMFP Color Laser 11-20 PPM X XMFP Color Laser 21-30 PPM X XMFP Color Laser 31-44 PPM X X XMFP Mono Inkjet All Speeds XMFP Mono Laser 1-20 PPM X X XMFP Mono Laser 21-30 PPM X X XMFP Mono Laser 31-44 PPM X XMFP Mono Laser 45-69 PPM X X XMFP Mono Laser 70-90 PPM X XPrinter Color Inkjet 1-10 PPM X XPrinter Color Inkjet 11-20 PPM X XPrinter Color Inkjet 21 PPM or more X XPrinter Color Laser 1-10 PPM X X XPrinter Color Laser 11-20 PPM X XPrinter Color Laser 21-30 PPM X XPrinter Color Laser 31-44 PPM XPrinter Mono Inkjet All Speeds X XPrinter Mono Laser 1-20 PPM X X XPrinter Mono Laser 21-30 PPM X X XPrinter Mono Laser 31-44 PPM X XPrinter Mono Laser 45-69 PPM X XPrinter Mono Laser 70-90 PPM XNotes: Product types are from the IDC taxonomy. “PPM” stands for pagesper minute.

42

Tab

le3:

Log

itR

egre

ssio

nR

esult

sN

oT

yp

esS

mall

Larg

eV

aria

ble

OL

SC

oeffi

cien

tIV

Coeffi

cien

tO

LS

Coeffi

cien

tIV

Coeffi

cien

tO

LS

Coeffi

cien

tIV

Coeffi

cien

tL

n(P

rice

)-0

.594

***

-1.5

21***

-0.6

55***

-1.4

20***

-0.5

41***

-1.4

80***

(0.0

19)

(0.0

68)

(0.0

30)

(0.0

85)

(0.0

19)

(0.0

72)

Ln

(Gro

up

Sh

are)

0.87

9***

0.8

10***

0.8

54***

0.7

38***

0.8

87***

0.8

30***

(0.0

06)

(0.0

35)

(0.0

09)

(0.0

48)

(0.0

06)

(0.0

38)

Col

orD

um

my

0.46

5***

1.4

13***

0.6

53***

1.1

53***

0.4

46***

1.4

13***

(0.0

44)

(0.0

88)

(0.0

97)

(0.1

31)

(0.0

44)

(0.0

95)

BW

PP

MS

pee

d0.

286*

**0.5

02***

0.4

22***

0.4

54***

0.2

76***

0.5

06***

(0.0

14)

(0.0

31)

(0.0

41)

(0.0

51)

(0.0

14)

(0.0

33)

RA

M0.

170*

**0.2

32***

0.1

62***

0.2

80***

0.1

77***

0.2

37***

(0.0

15)

(0.0

18)

(0.0

30)

(0.0

36)

(0.0

16)

(0.0

19)

Res

olu

tion

2.02

5***

-0.5

84**

2.8

40***

0.4

69

1.3

45***

-1.3

10***

(0.1

77)

(0.2

39)

(0.3

46)

(0.4

25)

(0.1

84)

(0.2

56)

A3

Du

mm

y0.

506*

**1.4

62***

0.6

84***

1.4

10***

0.4

30***

1.4

14***

(0.0

33)

(0.0

75)

(0.0

56)

(0.1

29)

(0.0

34)

(0.0

76)

Foot

pri

nt

0.08

90**

0.4

52***

0.1

19

0.7

17***

0.0

78*

0.4

36***

(0.0

40)

(0.0

76)

(0.0

91)

(0.2

50)

(0.0

40)

(0.0

77)

Eth

ern

etD

um

my

0.08

98**

*0.4

84***

-0.0

94

0.3

16***

0.0

75**

0.4

66***

(0.0

31)

(0.0

43)

(0.0

61)

(0.0

83)

(0.0

33)

(0.0

43)

MF

PD

um

my

0.65

8***

0.8

52***

0.6

87***

0.8

53***

0.6

39***

0.8

07***

(0.0

26)

(0.0

39)

(0.0

42)

(0.0

56)

(0.0

28)

(0.0

45)

Las

erD

um

my

1.56

3***

3.1

69***

0.2

53**

1.4

14***

2.1

52***

3.7

83***

(0.0

53)

(0.1

36)

(0.1

02)

(0.1

52)

(0.0

54)

(0.1

56)

Imp

lied

γ1.

594*

**2.5

21***

1.6

55***

2.4

20***

1.5

41***

2.4

80***

(0.0

19)

(0.0

68)

(0.0

30)

(0.0

85)

(0.0

19)

(0.0

72)

Imp

lied

σ5.

927*

**8.9

91***

5.4

87***

6.4

19***

5.7

99***

9.7

02***

(0.2

86)

(1.7

54)

(0.2

99)

1.1

74

(0.3

25)

(2.2

94)

Nu

mb

erof

Ob

serv

atio

ns

6413

6413

2852

2852

5944

5944

Note

s:*

ind

icate

s10%

sign

ifica

nce

,**

ind

icate

s5%

sign

ifica

nce

,an

d***

ind

icate

s1%

sign

ifica

nce

.H

eter

osk

edast

icit

yro

bu

stst

an

dard

erro

rsare

inp

are

nth

eses

.A

llre

gre

ssio

ns

incl

ude

aco

nst

ant.

All

vari

ab

les

are

diff

eren

ced

wit

hre

spec

tto

the

Pan

aso

nic

KX

-P1150.

Sm

all

con

sum

ers

are

hom

eoffi

ceb

uyer

sor

1to

9em

plo

yee

esta

blish

men

ts.

Larg

eco

nsu

mer

sare

10

or

more

emp

loyee

esta

blish

men

ts.

“B

WP

PM

Sp

eed

”is

the

maxim

um

nu

mb

erof

pages

per

min

ute

that

can

be

pri

nte

din

bla

ckan

dw

hit

e.

43

Tab

le4:

ASam

ple

ofO

wn-

and

Cro

ss-P

rice

Ela

stic

itie

sP

rod

uct

Gro

up

Ap

ple

Colo

rB

roth

erC

an

on

Ep

son

HP

Des

kje

tX

erox

Sty

leW

rite

r2400

HL

-631

BJ-2

00ex

EP

L-5

500

1600C

4505

MN

LM

od

elA

pp

leC

olor

Sty

leW

rite

r24

00In

kje

t-2

.5009

0.0

143

0.0

332

0.0

237

0.0

063

0.0

314

Bro

ther

HL

-631

Lase

r0.0

205

-2.5

071

0.0

332

0.0

237

0.0

063

0.0

314

Can

onB

J-2

00ex

Inkje

t0.0

205

0.0

143

-2.4

882

0.0

237

0.0

063

0.0

314

Ep

son

EP

L-5

500

Lase

r0.0

205

0.0

143

0.0

332

-2.4

977

0.0

063

0.0

314

HP

Des

kje

t16

00C

Inkje

t0.0

205

0.0

143

0.0

332

0.0

237

-2.5

150

0.0

314

Xer

ox45

05L

ase

r0.0

205

0.0

143

0.0

332

0.0

237

0.0

063

-2.4

900

NL

Mod

elA

pp

leC

olor

Sty

leW

rite

r24

00In

kje

t-8

.7649

0.0

143

0.3

665

0.0

237

0.0

699

0.0

314

Bro

ther

HL

-631

Lase

r0.0

205

-8.8

709

0.0

332

0.1

984

0.0

063

0.2

627

Can

onB

J-2

00ex

Inkje

t0.2

259

0.0

143

-8.6

243

0.0

237

0.0

699

0.0

314

Ep

son

EP

L-5

500

Lase

r0.0

205

0.1

199

0.0

332

-8.7

924

0.0

063

0.2

627

HP

Des

kje

t16

00C

Inkje

t0.2

259

0.0

143

0.3

665

0.0

237

-8.9

208

0.0

314

Xer

ox45

05L

ase

r0.0

205

0.1

199

0.0

332

0.1

984

0.0

063

-8.7

281

NR

CL

Mod

elA

pp

leC

olor

Sty

leW

rite

r24

00In

kje

t-7

.4814

0.0

123

0.3

108

0.0

179

0.0

618

0.0

251

Bro

ther

HL

-631

Lase

r0.0

176

-8.4

220

0.0

263

0.1

825

0.0

052

0.2

558

Can

onB

J-2

00ex

Inkje

t0.1

915

0.0

113

-6.8

084

0.0

165

0.0

570

0.0

232

Ep

son

EP

L-5

500

Lase

r0.0

154

0.1

098

0.0

231

-7.3

321

0.0

046

0.2

242

HP

Des

kje

t16

00C

Inkje

t0.1

995

0.0

118

0.2

988

0.0

172

-7.3

328

0.0

241

Xer

ox45

05L

ase

r0.0

163

0.1

163

0.0

244

0.1

694

0.0

049

-7.6

992

Note

s:T

hes

ecr

oss

-pri

ceel

ast

icit

ies

are

calc

ula

ted

usi

ng

the

form

ula

sd

iscu

ssed

inS

ecti

on

3.

Each

entr

yis

the

per

centa

ge

chan

ge

inqu

anti

tyso

ldof

the

row

good

inre

spon

seto

ap

erce

nta

ge

chan

ge

inp

rice

of

the

colu

mn

good

.

44

Table 5: Price Index Results, 1996 vs. 2005Model IndexCES 0.061

[0.056, 0.065]NCES 0.160

[0.148, 0.178]MNL 0.035

[0.028, 0.040]NL 0.110

[0.094, 0.123]NRCL 0.100

[0.090, 0.111]NRCL Small 0.051

[0.038, 0.062]NRCL Large 0.122

[0.105, 0.140]Notes: These price indices are calculated at the quar-terly level (comparing the first quarter of 2005 to thefirst quarter of 1996, for example), and then averagedover all four quarters. Bootstrapped 95 percent confi-dence intervals are in brackets.

Table 6: Price Index Decompositions, 1996 vs. 2005Model Common Goods Changing GoodsCES 0.137 0.446

[0.408, 0.475]NCES 0.201 0.800

[0.743, 0.888]Model Price Quality VarietyMNL 0.386 0.376 0.247

[0.376, 0.387] [0.347, 0.413] [0.210, 0.275]NL 0.411 0.352 0.757

[0.398, 0.413] [0.326, 0.388] [0.646, 0.863]NRCL 0.374 0.352 0.783

[0.356, 0.380] [0.319, 0.366] [0.714, 0.852]Notes: These decompositions are calculated at the quarterly level(comparing the first quarter of 2005 to the first quarter of 1996, forexample), and then averaged over all four quarters. Bootstrapped 95percent confidence intervals are in brackets.

45

price, quality, and variety: measuring the gains from trade in

Documents