august 10, 2004 - university of north carolina at chapel...

The Location Patterns of U.S. Industrial Research: Mimetic Isomorphism, and the Emergence of Geographic Charisma*

Stephen J. Appold Department of Sociology

National University of Singapore 10 Kent Ridge Crescent

Singapore 119260 6874-6393

Fax: 6777-9579 http://courses.nus.edu.sg/course/socsja/

[email protected]

August 10, 2004

* I would like to thank Jack Kasarda, Eric Leifer, Michael Luger, and the anonymous reviewers of this journal for comments on an earlier draft of this paper.

The Location Patterns of U.S. Industrial Research:

Mimetic Isomorphism, and the Emergence of Geographic Charisma The emergence of new industrial spaces over the past several decades that have radically altered the economic geography of the U.S. raises questions about the mechanisms responsible for their formation. Existing theories predict either continued concentration or spatial dispersion; none predict the rise of new geographic agglomerations of establishments. A behavioral theory of agglomeration formation explains the emergence of regionally-dispersed, local agglomerations by means of mimetic behavior. A method for detecting social influence in cross-sectional data is applied to data on the 1985 locations by county of over 10,000 privately-owned research laboratories to show that the theoretical model accurately reproduces a key aspect of the existing spatial pattern. The results suggest that laboratories are, to a significant extent, reacting to each other’s actions, creating symbolic, rather than functional, communities and that the locus of power determining local growth is diffused among location decision-makers. R&D; urban agglomeration; geographic charisma; learning theory

1

The Location Patterns of U.S. Industrial Research:

Mimetic Isomorphism, and the Emergence of Geographic Charisma

Between 1953 and 1998 (National Science Board, 2001) spending on research and

development in the United States increased almost eight-fold in constant dollars – approximately 1.8

times as quickly as the growth of the GDP. Scientific research and development clearly plays a key

role in the contemporary economy. Laboratories, the site of much of this formalized learning, are

sometimes believed to generate significant local economic multiplier effects by attracting support

services and high-skill manufacturing (Malecki, 1997). As one of the several privileged institutions in

an emerging knowledge-based economy (Bell, 1973: 116) research laboratories are critical

components of the emerging “new industrial spaces (Scott, 1988).”

––––––––––-

Map 1 about here

––––––––––-

Map 1 shows the 1985 location by county of the 10,393 private, industrial research

laboratories in the United States that were active in ten broadly defined, overlapping, yet not

exhaustive, research fields, aerospace, aviation, biological science, chemical research,

communications, computer science, electric and power, electronics, engineering and medical science.

Information on laboratories was collected from the 19th (1985) edition of Industrial Research Labs of

the United States, the most complete enumeration of private research labs available. Each of these

fields had a history that was at least decades old at the time. The year was chosen because it follows a

decade or so of rapid location decisions, approximately coincides with a brief pause in the sitings of

research laboratories nationally, and pre-dates institutional changes in industrial research, including

the demise of the central corporate laboratory.1

The mean number of industrial research laboratories per county was 3.328 but, as 71 percent

of the counties contained no labs, they were located in fewer than 900 of the approximately 3,065

counties in the continental United States. Approximately 10 percent of counties with any labs housed

a single lab; Los Angeles County had the greatest number, 481. In between the extremes, numerous

2

regionally-dispersed agglomerations of laboratories (19 counties contained more than 100 labs) were

sprinkled across the U.S. The existence of multiple, dispersed agglomerations is difficult to explain

with existing theories.

The standard factors in the economic geographer’s toolkit: reliance upon natural resources

and the need to interact efficiently with others (Fujita and Thisse, 2000; Mills and Hamilton, 1994)

give only modest help in explaining the pattern of regionally-dispersed agglomerations seen in the

map. Research scientists and engineers are not directly involved with the processing of goods and so

are rarely anchored to specific sites. Moreover, since the advent of the telegraph, the ability to

communicate effectively has been progressively de-coupled from physical proximity (Pred, 1967).

The real costs of transportation are diminishing, leading to supply and marketing channels that are

often global in scope (Frobel, Heinrichs, and Kreye, 1980; Gereffi and Korzeniewicz, 1994). High-

skill (Herzog, Schlottman and Johnson, 1986) and low-skill (Martin and Widgren, 1996) labor moves

in national and, as shown by the high levels of immigration, international markets. Technical

knowledge flows through extra-local “invisible colleges” (Crane, 1972) and strategic alliances

(Mowery, Oxley, and Silverman, 1996). Much of the available evidence on the geography of

corporate control and industrial supply (e.g., Castells, 1996) suggests that contemporary cities and the

regions that surround them are not tightly-interdependent economic units and research laboratories

may be more tightly agglomerated than functional interdependence requires (Pascale and McCall,

1980).

Just 25 years earlier than the data depicted in the map, Robert Lichtenberg (1960) termed the

greater New York region, “One Tenth of a Nation,” and Raymond Vernon (1960) proposed a product

cycle theory to explain that region’s continuing viability as the nation’s (and the world’s) innovation

center on the basis the unduplicable advantage of amenities, rapid communication, and a rich supply

of educated labor. Nevertheless, the progressive loosening over the past several decades of the

constraints that have historically limited economic activities to particular places has led to neither of

the most theoretically well-grounded predictions: continued metropolitanism – the increasing

incorporation and subordination of peripheral places into an expanding system of dominance centered

on a single core metropolitan center (Arthur, 1994) – or widespread deagglomeration – creating a

3

dispersed geography of “community without propinquity” (Webber, 1967). Instead of the patterns

predicted on the basis of well-developed behavioral models, multiple new dense agglomerations of

economic activities have developed. Several of these agglomerations based largely on research-

intensive industries, such as those centered in Silicon Valley, the Research Triangle Park, Austin TX,

and a host of less celebrated places, have drastically altered America's economic landscape. The

pattern in the map then presents a puzzle: these dense agglomerations have often been interpreted as a

need for specific resources (such as universities) or agglomeration economies. Yet, prior to the recent

maturity of these new agglomerations, such needs would have favored the Northeast and

disadvantaged the rest of the country.2

Although there has been extensive discussion of local operational interdependencies that may

advantage firms, these must be discounted as a cause of the emergence of new industrial places. As

just mentioned, local operational interdependencies favor older, established industrial places. Such

interdependencies were a crucial element of Vernon’s theory. The little evidence that exists suggests

that decades are required before such localization economies emerge in new locations – if they do at

all. Moreover, while there is an impressive and growing literature documenting the co-location of

critical economic activities, evidence for local connections is limited. Local interaction may develop

in some industrial agglomerations but not in others (Glasmeier, 1988; Saxenian, 1994). Even in the

agglomerations held to have the richest inter-connections, local interdependencies appear to be limited

(Oakey, 1984) and the effect of regional milieu on an organization’s research and innovation appears

to be minimal (Kleinknecht and Poot, 1992; Love and Roper, 1999, Smith, Broberg, and Overgaard,

2002). Industrial location seems similar to residential choice in this regard: sometimes neighborhood

ties develop and sometimes they do not. The critical task is explaining why contemporary

agglomerations, typified by propinquity without functional community, persist and indeed even

develop.

Recognizing the weak constraints on the location of many contemporary economic activities,

a number of theorists have suggested that the locations that are able to mobilize the resources firms

need, rather than those that are well-endowed, will attract firms and the ensuing employment growth

in the contemporary economy (Freeman and Soete, 1997). Local growth coalitions (Molotch, 1976)

4

may be willing to invest considerable energy in mobilizing resources because rentiers tied to

particular locations stand to profit from local employment and population growth. Entrepreneurial

localities (Eisinger, 1988) might then offer incentives to firms and commission elite actors to use their

personal influence to recruit firms. Research universities and government research institutes may act

as institutional anchors for modern local industrial development (Etzkowitz and Leydesdorff, 1997).

While some have argued that local politics is, of necessity, forced to concentrate on local

development (Peterson, 1980), the evidence for the efficacy of growth coalitions – most of which

comes from selected case studies of successful developments – is mixed. Pittsburgh has been

recognized as having a strong elite coalition (Chinitz, 1961), but regional growth has been below the

national average for many decades. A concentrated community power structure positively affects the

extraction of benefits from the Federal government (Hawley, 1963) but such benefits have little effect

on local employment growth (Kasarda and Irwin, 1991). Further, issuing Industrial Revenue Bonds to

reduce the costs of capital to firms and tax rate differentials have had no measurable effect on

employment growth (Carlino and Mills, 1987). Longitudinal investigations of the effects of research

parks (Appold, 2004) and enterprise zones (Bondonio and Engberg, 2000) have not found them to be

effective local development tools. It is tempting to assign causal priority to those most active in

working towards development but the bulk of the evidence suggests that the locus of power in

determining economic geography does not rest with local promoters (Logan, Whaley, and Crowder,

1997), whatever their motivation and institutional affiliation. Accordingly, some observers are

therefore skeptical of the existence of specifically regional innovation systems (Howells, 1999).

This paper explores one of the social mechanisms (Hedström and Swedberg, 1998)

responsible for the development of new industrial spaces, agglomerations of industrial research

laboratories in particular, in an era of eased long-distance communication and low direct dependence

upon natural resources. This is an era characterized by a high level of co-location but a much lower

level of operational connection. By bringing three disparate, but nonetheless related, literatures to

bear on the issue of location decision-making, the paper will examine the basis for continuing and

renewed spatial agglomeration. The economics of information specifies how uncertainty affects

decision-making. The institutional theory of organizations explains how population-level learning

5

through signaling leads to a degree of mimetic isomorphism. The study of cultural signs suggests

how firms being influenced by each others’ decisions can, in rare circumstances, lend geographic

charisma to particular places. The combination of behavioral uncertainty, social signaling, and

cultural imaging results in the emergence of new centers that may challenge the dominance of older,

established places and in a level of agglomeration not mandated by operational considerations. The

hypothesis is cultural in that the critical factor determining the level and location of agglomeration

development is symbolic representation, rather than collective functional interdependence. It is

ecological in that the determining power is distributed diffusely among a large number of actors.

The spatial distribution of establishments is, to a large extent, a consequence of firm location

decisions. The character of the location decision-making process, therefore, may determine the nature

of the resulting spatial structure. In many industries firms cannot optimally choose locations because

the required resources are either difficult to observe or not well understood. Siting decisions are

major capital investments that are infrequently made by most firms; as such, they give rise to much

uncertainty. Decision-makers, therefore, search for signals – or signs (Sebeok, 2001) – to help them

choose correctly. The most powerful signal, indicating that a location is an “appropriate” choice,

might be the presence of other, similar, firms. Economic theory suggests why actions, which are not

intended to communicate, are so effective at doing so. While location decisions are almost

universally legitimated with an operational justification (and such a benefit may sometimes

subsequently develop), decisions are often made on the basis of an unsubstantiated belief, indicating

an influential role for signals in determining the pattern and level of agglomeration when the

operational conditions for dispersion exist.

Detecting the effects of cultural signals is difficult (Schelling, 1978). Therefore, the pattern

of laboratory locations will be compared to simulations based on a theoretically-informed model of

behavior. That model is rooted both in the theory of decision-making and in the theory of spatial

point patterns (Diggle, 1983; Getis and Boots, 1978). Aggregating the outcomes of choice situations

results in a frequency distribution of choices among the possible options. The nature of this

distribution can reveal the degree of social influence. A family of statistical distributions, called

Polya-Eggenberger distributions (Johnson and Kotz, 1969) which includes the binomial distribution

6

as a special case, is useful in estimating a social influence parameter, indicating the degree of

deviation from the baseline model of no influence.

In the next section, I outline a simple theory of location decision-making over time that

incorporates a social influence process. I then derive a method for detecting social influence in cross-

sectional data. Using the method derived, I test the theory on the 1985 locations of private industrial

research laboratories, accounting for the role of universities and discussing the possibility of

unobserved heterogeneity. Following that, I suggest an ideal-typical development sequence that

places the social influence model in the context of other theories of urban development. I conclude by

outlining elaborations on the basic model and further steps for research.

Location decision-making

Location decisions are the result of the need for facilities. Establishing a new facility, such as

a research laboratory, entails a large capital investment. Accordingly, location decisions are among

the most important aspects of the strategic management of firms (Rumelt, Schendel, and Teece,

1994). To the extent that local characteristics yield advantages in the ease and costs of acquiring or

distributing goods, services, or labor and the presence of other firms results in either economies or

diseconomies of agglomeration, the productivity of establishments may vary by location (Hoover and

Giarratani, 1984). Therefore, mangers attempt to choose the locations that will yield the greatest

operating return. When information about the requirements of the production process and about the

characteristics of localities is nearly complete, site choice is a relatively straightforward, albeit

difficult, management decision. Land, resource, and transportation costs dominate other

considerations in the capital-intensive plants utilized in heavy industrial production. Not surprisingly,

they often have systematically-organized location searches and produce well-defined (“regular” or

spaced out) geographic patterns.

When costs are difficult to calculate and predict – as they are for research and other higher-

order information-processing activities – choosing a location becomes more perilous. Information

about how the potential operating environment affects those activities is inadequate because

environmental feedback is slow and arrives, with experience, only after a decision is made. Local

7

features, such as amenities, may be difficult to measure and their effects impossible to quantify.

Moreover, because location decisions occur irregularly, if at all, during the course of a managerial

career, organizational history does not provide trustworthy guidelines (March and Olsen, 1976).

Accordingly, managers in many firms, particularly those engaged in high technology activities

including research, have difficulty identifying their locational needs and those named could be

fulfilled by almost all metropolitan areas (Schmenner, 1982).

When weak or non-existent feedback results in uncertainty and blocks the ability to

distinguish between alternative locations, firms may rely on signals based on each other’s behavior.

Signals are “activities or attributes of individuals ... which, by design or accident, alter the beliefs of,

or convey information to, other individuals (Spence, 1974: 1).” Decision-makers will attempt to

observe signals when the actual characteristics of a market good, in this case a potential laboratory

location, cannot be readily observed. But, because misleading signals may be purposely sent when

information is known to be incomplete, the theory also suggests why local growth coalitions might

have difficulty attracting firms. First, the act of recruitment itself, particularly when paired with

incentives, signals a lack of desirability. Second, firms being recruited know that growth coalitions

have a motivation to misrepresent their localities in a positive light. Therefore, managers will tend to

look beyond the industrial recruiters and growth coalitions for credible information when the relevant

local attributes cannot be readily identified.

While economic theory focuses on two-person (buyer-seller) signaling, institutional theory

suggests that the actions of similar actors that have faced the same situation in the past provide

credible behavioral clues in the absence of direct information. The presence of other labs may serve

to “filter” good from bad choice options (Arrow, 1973). When decisions are made under uncertainty,

information about the state of the world can be seen as information about what ought to be done (Eco,

1976). In attempting to learn from the “best practice” of others, managers allow themselves to be

influenced by each others decisions (DiMaggio and Powell, 1983). Equivalent firms (that is, those

that have experienced similar situations but are not directly party to the decision) have no motivation

to misrepresent themselves. The commitment and expense their presence entails therefore creates an

effective signal. Mimetic isomorphism may also serve a symbolic role, enhancing legitimacy and

8

providing social validation for behavior, without necessarily increasing organizational efficiency

(Tolbert and Zucker, 1983), leading to a higher degree of agglomeration than that required by

functional interdependence. Direct contact is not necessary for the social influence to occur.

Managers may form beliefs and take actions based on what they observe. If geographic

location were unimportant to productivity, managers faced with the need to chose a site for a new

facility might reasonably infer that the existing establishments would be randomly distributed. If, on

the other hand, location were important, it should be reflected in the choices of the firms already in

operation. Therefore, without necessarily understanding why a particular site is more attractive than

another, a manager could nonetheless conclude that the number of successful laboratories that are

located at a particular site is an indicator of its productivity. Managers, in making location decisions,

may do as many researchers have done: they examine the behavior of others, search for correlates and,

in the event none are found, assume some unobserved environmental heterogeneity. That assumed

and unobserved advantage may be the basis of local geographic charisma.

Social processes and statistical distributions

The foregoing discussion implies that the spatial distribution of laboratories is the product of

a social process and that the character of that social process – successive location decisions made by

different actors under uncertainty with high costs to poor decisions – affects the nature of the spatial

distribution, resulting in more agglomeration than would otherwise be expected. Analytic techniques

that specify the relationship between social process and spatial outcome make it possible to identify

the responsible processes even when spatial patterns are observed at a single time point only (Getis

and Boots, 1978: 2-3). The nature of the social processes that create spatial distributions may be most

easily envisioned using urn models, where balls are used to represent the attractiveness of spatial units

(counties) and the degree of replacement after each decision captures the level of learning (influence),

if any, among actors. The resulting patterns can often be characterized using statistical distributions.

The level of influence can be estimated from the spatial data using a family of distributions related to

the binomial (and the nearly equivalent Poisson) distribution once relevant exogenous factors have

been properly accounted for.3

9

Social processes result in distinctive patterns of outcomes – sometimes termed “events” or

“occurrences.” Many unsystematic independent influences upon a quality or property results in a

normal distribution of observed outcomes around a central true value. Similarly, many unsystematic

independent influences on location decisions do not result in a uniform number of occurrences across

space; they create a pattern that approximates a binomial distribution. If the location decision-making

processes were perfectly modeled and all options could be perfectly ordered, there might be no need

to consider what is often called the conditional (error) distribution. In practice, the modeling of

decisions must reckon with the unsystematic misperceptions, optimization problems and measurement

errors that bounded rationality creates. All observed decision making is, therefore, stochastic and the

modeling of decisions is often based on random utility theory (Luce, 1959).

The distribution of “events” (lab locations) among choice options (counties) which are

essentially equivalent is key to distinguishing between independent decision-making and decisions

affected by social influence. The character of the conditional distribution often indicates whether or

not a given process is adequately specified. Heteroskedasticity or autocorrelation in the conditional

distribution frequently indicate non-independence; that is, the “errors” are not independent of either

some determining characteristic or, in the latter case, of each other. Similarly, social influence

involves cases where the decisions made are not independently of each other.

Urn model representations of decision-making

Probabilistic decision-making often can be represented in terms of intuitive thought

experiments about different procedures for drawing balls from an urn. The hypothetical urn contains

the complete set of options available to the decision-maker. These are represented as balls. The balls

have colors or numbers differentiating the various options. The number of balls of each type

combined with the total number of balls represents the probability of each choice possibility. The

number of the balls of each color or number drawn represents the frequency distribution of choices.

An urn model becomes a theory of social process whenever the essence of that process is captured by

the method of drawing balls from the urn (Coleman, 1964: 517). The urn model presented here has a

close correspondence to the behavioral process of location decision-making.

10

Independent decision-making – wherein actors are not affected by the actions of others –

results in a binomial probability distribution of choices, or “events” (laboratory locations), among

equivalent options (counties) around the mean number for that type of option in repeated trials

(location decisions). This corresponds to drawing balls from a uniform rectangular distribution of

options in an urn with replacement. For example, if there are five equivalent options, such as

identical coffee cups on a shelf, the binomial distribution describes the number of times a particular

cup is selected in a given number of repeated trips to the coffee machine. Each individual cup can be

expected to be chosen one-fifth of the time and the distribution around that expected value follows a

predictable pattern.

The simplest case, analogous to the visually distinguishable but functionally equivalent coffee

cups, corresponds to the “featureless plane” sometimes hypothesized by geographers. While not a

literal depiction of any geography, decision-makers could be indifferent among alternative

possibilities because each offers equivalent utility and would therefore be equally acceptable. Actors

may also be indifferent because they are unable to perceive differences which they believe to exist.

The effect is to create a homogeneous choice situation. Webber’s “community without propinquity”

argument, local amenities, and the idiosyncratic histories of particular places and people – when the

fact that someone’s mother lived near Palo Alto or that another person’s vacation home was in Florida

or Arizona determined the location of a company’s laboratory – would create such a random

distribution of laboratories.

The baseline model holds that decision-makers do not influence each other. If actors are not

affected by the actions of others, the probability distribution of choices does not change through time.

In fact, choices made at one time could affect the choices made at a later time, however. Making a

particular choice at one time may increase – as with positive reinforcement – or decrease – as with

negative reinforcement – the chances of that choice being made again in the future (Luce, 1959; Bush

and Mosteller, 1955). Location choices that are made sequentially by one or more decision-makers

can be seen as “chain learning” (Johnson and Kotz, 1977) or population-level learning (Miner and

Haunschild, 1995) situations where the hypothetical decision urn is passed from one actor (firm) to

the next.

11

An urn model of social influence also begins with a uniform rectangular distribution of

options but after each choice a number of balls of the same type as the one chosen are added to (or

subtracted from) the urn, modifying the chance that the same option will be chosen again in the future.

This corresponds to sampling with over- (under-)replacement. The number of balls added

(subtracted) essentially corresponds to a social influence parameter. If decision-makers are positively

influencing each other, the distribution of their choices will follow a Polya-Eggenberger – contagious

binomial – distribution. Among a set of equivalent options, some may be chosen more than would

otherwise be expected and many others may not be chosen at all. Each “event” will exert an attractive

force on other “events.” When a social influence process operates, choices will be a function of the

characteristics of the option and the force of influence.

Although not immediately relevant to the present case, actors could also affect the actions of

each other negatively. Actors could claim scarce resources and each unit of resources claimed by the

existing occupants is not available for use by newcomers. Since resources once claimed are

unavailable to others, this model corresponds to sampling without replacement. In this urn model,

once a ball is chosen it is not replaced. Under these conditions the number of times the same choice is

made follows a hypergeometric distribution. The early choices then tend to disperse subsequent

choices through the possibility space resulting in even or “regular” distributions (Pielou, 1977). For

example, housing units and jobs, once claimed, cannot be claimed by another. Returning to the coffee

cup example, a decision rule “always use a dirty cup in the sink, if one is available” would produce a

Polya-Eggenberger distribution of choices, while a decision rule “don’t clean a cup unless there is no

alternative” would result in a hypergeometric distribution of choices among the identical cups.

In the baseline model, options (counties) exert a fixed “pull” on decision-makers regardless of

how many actors have or have not chosen that option. There is no positive or negative social

influence. The number of individuals choosing a particular option, once the attractiveness of that

option is taken into account, is determined by the number of decisions made and the number of

equivalent options. In the second model, each option’s (county’s) “pull” is augmented by the number

of individuals who have already chosen a particular option up to any given point in time. The

working of social influence makes the conditional distribution less random than expected. In the case

12

of positive influence, the more “events” an option has attracted, the more it will continue to attract.

A social influence equation

Urn models provide an intuitive understanding of the influence process but do not provide an

independent method of estimating the level of social influence. Fitting the parameters of a family of

distributions to the pattern of choices among similar options allows for such estimates. Johnson and

Kotz (1969) show the Polya-Eggenberger family of statistical distributions contains special cases

associated with each of the possibilities named: negative influence (hypergeometric distribution), no

influence (the binomial distribution), and positive influence (the simple Polya-Eggenberger or

contagious binomial distribution). The generalized Polya-Eggenberger distribution has been

previously used to model psychological theories of learning (Bush and Mosteller, 1955) and to model

voting behavior in groups (Coleman, 1964).

Eq. 1.1:

Equation 1.1 shows the probability density function of the Polya-Eggenberger distribution. In

equation 1.1, “n” represents the total number of trials – the number of times a decision was made

regardless of the option chosen. In this case, this is the total number of laboratories nationally. Here,

“k” represents the number of times a particular option (county) was selected. The brackets, “[ ]”,

signify the ascending factorial.

Eq. 1.2:

The second form of the equation (Equation 1.2) clarifies the relationship between the Polya-

)+(k

n=k]=Pr[x [n]

k]-[n[k]

βαβα

⎟⎟⎠

⎞⎜⎜⎝

⎛

1)s)-(n+w+(bs)+w+w)(b+(b1)s)-k-(w(ns)+1)s)w(w-(k+(bs)+b(b

k

n=k]=Pr[x

K

KK⎟⎟⎠

⎞⎜⎜⎝

⎛

13

Eggenberger, binomial and hypergeometric distributions to urn model representations. The parameter

" is equivalent to “b/s;” ß is equivalent to “w/s.” The parameter “b” represents the attractive force of

one option (county) of a particular type; “w” represents the attractive force of all other options; and

“s” signifies the attractive (or repellent) force of each “event” (laboratory in a particular county). If

“s” is equal to -1, the equation produces the hypergeometric distribution. If no influence behavior is

taking place the parameter, “s” – the over-replacement factor, will equal zero and the equation will

simplify into a binomial distribution as illustrated in Equation 2:

Eq. 2:

Comparing predictions to data should reveal the relative merits of each model. The

parameters for the independent decision-making model and for the social influence model, " and ß,

can be estimated from data using the method of moments procedure from the mean (“0”) and variance

(“v(x)”) of the number of laboratories in each county. The equations of the estimators are shown

below. Their derivation from the moment equations is shown in an Appendix.

Eq. 3.1:

Eq. 3.2:

The estimated parameters for each model can then be used with the distribution equations illustrated

earlier (Eq. 1 and Eq. 2) to generate the full expected frequency distribution of labs among the

)w+(bwb

k

n=k]=Pr[x n

k-nk

⎟⎟⎠

⎞⎜⎜⎝

⎛

x + ) x - v(x) ( n) x + v(x) (x - xn = 2

22

α̂

ααβ ˆˆˆ - x n =

14

counties in each county category. The predicted marginal and cumulative distributions for each

model can then be compared with the actual distribution of laboratories using graphical methods and

Kolmogorov-Smirnov statistics (D'Agostino and Stephens, 1986).4

Similar to log-linear models which predict frequency counts or covariance structure models

which reproduce covariance matrices rather than individual outcomes, the models considered here

estimate probability distributions. Because the social influence occurs between options which have

been defined as equivalent, it is impossible to a priori determine which particular option (location)

will be favored.

Heterogenous options

These sequential decision making models are able to model the effect, or non-effect, actors

have on each other through time once the exogenous factors that determine location decisions are

adequately accounted for. Social processes, however, may be complex and the resulting structure

may need to be decomposed into a mixture of simpler processes and distributions (Everitt and Hand,

1981). The first statistical distribution would characterize the allocation of social events (in this case,

laboratory locations) across distinguishable categories of the possibility space (for example, counties

that offer different levels of urbanization economies) while the second statistical distribution would

characterize the allocation of “events” among equivalent options. Because estimates of the degree of

influence depend upon correctly specifying those factors, a strong null model specifying the factors

which should matter and which should not is needed. This model will be developed in the next

section.

Variations in the characteristics of options can also be represented in urn models. The

difference with the preceding discussion which assumed that all options (counties) were equivalent is

that the sampling space at T0 is not uniform rectangular. More desirable options, such as larger coffee

cups, are represented by a greater number of balls than those less desirable but “sampling” proceeds

with replacement, without replacement and with over-replacement, respectively. If homogeneous

subgroups of options are examined, the conditional frequency distributions under the various models

should resemble those in the case of homogeneous options.

15

Explaining the agglomeration of industrial research laboratories

Because the measurement of social influence in location decisions depends upon a strong

baseline model that controls for exogenous factors, the level of urbanization was measured by

classifying counties according to metropolitan status and size of the largest city (Beale and Fuguitt,

1978).5 The effective operation of laboratories might require the ready access to a supply of well-

educated labor, the nearby availability of a wide range of high-level services, and the flow of a wide

range of goods afforded by population. These factors might create economies of urbanization that

could be expected to affect the location decisions of firm managers. The level of residual

agglomeration was modeled; the role of universities as special signals and the possibility of

unobserved heterogeneity is considered afterward.

–––––––––––––––

Table 1 about here

–––––––––––––––

Table 1 shows that urbanization clearly had an effect on the spatial distribution of

laboratories. Almost 90 percent of all laboratories were located in metropolitan counties. Forty

percent of all labs located in the 49 counties which comprise the cores of the largest metropolitan

areas and over 60 percent of all labs in the U.S. were located in 181 core and outlying counties

comprising the largest metropolitan areas. Only 10 percent were located in rural counties. The

number of labs per million residents varied from a mean of 4 in sparsely populated counties that are

not adjacent to metropolitan areas to a mean of 47 in the outlying counties of large metropolitan areas

to an average of 70 in the core counties of large metropolitan areas, indicating a non-linear

relationship.

–––––––––––––––

Figure 1 about here

–––––––––––––––

The distribution of laboratories was not determined solely by urbanization, however. A

scattergram of the number of labs by county population (both logged) is heteroscedastic, revealing

16

little variation in the number of labs in sparsely populated areas and an increase in the average number

and in the variation in the number of labs as population rises. This suggests a limit to the number of

laboratories that can be supported at given population levels but also large differences in the degree to

which the local carrying capacity is approached. Figure 1 shows that a county with a population of,

say, 500,000 could have as few as no labs at all or over 100. On the other hand, some counties with

fewer than 8,000 inhabitants supported a lab. Variation in numbers of labs in areas with similar

population levels (3-200 labs) can be quite large compared to the variation between the extremes of

high and low population (0-481 labs). There was little agglomeration in the sparsely populated areas

where localization economies could substitute for urbanization economies. Yet it is precisely where

such agglomeration might be most valuable that it is least in evidence. (The same finding results from

using a number of measures and spatial corrections.)6

–––––––––––––––

Table 2 about here

–––––––––––––––

Table 2 shows the decomposition of agglomeration, measured by the Theil (1967)

information statistic7, into a component accounted for by urbanization (“between county categories”)

and a component that is not (“within county categories”). Urbanization (differences between county

categories) explained 67 percent of the total agglomeration of labs. However, looking at the 625

metropolitan counties only (which contain 90 percent of the labs), variation in urbanization, as

measured by the Beale categorization, explained only 23 percent of the agglomeration. Most of the

variation in numbers of laboratories is within categories of counties which are essentially equivalent.

Within each metropolitan category the distribution of labs is highly skewed with many labs locating in

some counties and few or none in many others.

Testing for social influence in location decision-making

The decomposition of the distribution of the spatial pattern of laboratories reveals a

substantial degree of unexplained geographic concentration which may be caused by social influence

during the location-decision making process. That thesis can be tested by estimating the parameters

17

of the theoretical model on the basis of the mean and variance of the empirical distribution, generating

the complete frequency distribution using those parameter estimates, and comparing the distribution

of laboratory counts among counties to that predicted by the theoretical model. I will also compare

the empirical distribution to that predicted by the independent-decision model and perform

specification tests.

–––––––––––––––

Table 3 about here

–––––––––––––––

Table 3 shows part of the data, predictions and goodness-of-fit calculations for one of the

category of counties: the outlying counties of large metropolitan areas. The actual marginal and

cumulative probability functions and those calculated from the mean and variance of the data using

the two models are shown. The difference between the actual cumulative probability function and

that calculated from the models forms the basis for the goodness-of-fit calculations shown in the last

two columns. The first row, for example, shows the proportion of outlying counties in large

metropolitan areas which had no labs and the proportion predicted by each of the models. The

independent decision-making model predicts that approximately half of the counties will have

numbers of laboratories that are below the mean for this county category of approximately 16. The

influence model predicts approximately three-fourths of the counties would have frequency counts

below the mean. That closely agrees with the data. The graphic results portrayed in Figure 2 suggest

that the Polya-Eggenberger distribution used in the influence model is fairly accurate in its predictions

of the proportion of counties with one, two, or more labs also. The independent decision-making

(binomial) model is not. That intuitive conclusion is supported by the Kolmogorov-Smirnov

goodness-of-fit statistics for each model. Although the significance level for the maximum deviation

from the predicted cumulative probability distribution is not exactly established, only about 20

percent of the empirical distributions generated by the theoretical model with the estimated

parameters would have smaller maximum deviations. The goodness-of-fit statistic for the binomial

model is an order of magnitude larger and the hypothesis that the data were generated by the

independent decision-making model would have to be rejected.

18

–––––––––––––––

Figure 2 about here

–––––––––––––––

Table 4 summarizes the results for each of the county categories. The variance/mean ratio,

three parameters for the Polya-Eggenberger distribution, the “independent” probability that a

particular county of that type will be selected, the ratio of the influence parameter to the average

probability of being chosen, and Kolmogorov-Smirnov goodness-of-fit statistics for the two models

are given for each county category. The key information from Table 3 is reproduced in the second

row of this table. The variance/mean ratio is a Poisson statistic measuring agglomeration; it provides

an independent indication of the level and direction of the deviation from the independence model. A

value of one indicates spatial randomness, corresponding roughly to the independent decision model.

The three Polya-Eggenberger parameters are the average probability of a county being chosen divided

by the influence parameter ("), the probability of a county not being chosen divided by the same

parameter (ß) and the influence parameter divided by the total resource pull. (The parameter " is

equal to b/s in equation 1; ß is equal to w/s in the same equation.) As described above, these were

estimated from the sample means and variances for each county category and correspond closely to

the parameters estimated by direct search.

–––––––––––––––

Table 4 about here

–––––––––––––––

For each county type the influence parameter (due to prior lab location decisions) is at least as

strong as the attractive force of the county itself. Both the attractive force of the counties and the

attractive force of the labs varies with urbanization. Each county has a baseline probability of being

chosen as a lab location by a given decision-maker of .000318. For remote rural counties this

probability decreases to .000024 (approximately 1/42,000) while core counties of large metropolitan

areas the probability of being selected is approximately 350 times greater (.0084 or about 1/120). The

19

influence parameters fall into a similar range indicating that the attractive force of a lab is roughly

correlated to the attractive force of the county in which it is located. The ratio of the influence

parameter to the baseline average probability of being chosen ranges from just under one to over

eight, from which a significant role of direct social influence can be inferred. The parameters suggest

that the effects of influence of a single individual lab is greatest in the most rural counties and

diminishes in the more urban counties; that result is consistent with information theory. Although the

influence of individual labs diminishes in the more urban counties, the larger numbers of labs ensures

that the overall level of influence has a greater effect on location decisions there.

The conformity between the predicted probability distribution and the actual distribution as

shown in the goodness-of-fit statistics and the graphic results (the remaining figures are included in an

Appendix) provide support for the influence model. The influence model shows its strongest

deviation from the data among large metropolitan area core counties. These deviations are relatively

minor and may be due to the small number of counties in this category. The independence model did

not fit any of the county categories. Tests for specification error indicate that the fit of the influence

model is not merely due to unobserved heterogeneity.8

Similar models were estimated for the laboratories owned by firms with ten or more labs;

these yielded similar results and fewer deviations. These large firms are more likely to have the

resources to conduct systematic location searches; they are generally less dependent upon the

urbanization economies offered by their immediate surroundings; and they are more likely to be able

to offer complete attractive career paths to their employees. Yet these firms are as open to influence

as the much larger set of labs in all firms. Figure 3 illustrates the results of analyses for the outlying

counties of large metropolitan areas.

–––––––––––––––

Figure 3 about here

–––––––––––––––

These results indicate that social influence is an important determinant of location even

among the laboratories connected to the firms least dependent upon agglomeration economies.

Interaction – a need to share research insights and encouragement – was not a factor in determining

20

the agglomeration of laboratories (although such interaction might eventually result). If interaction

were important, branch laboratories of large firms would be clustered together so that developments

and knowledge could be shared in-house rather than with competitors. The average distance between

branch laboratories and their headquarters is 880 miles. Among these large firms, only 16 percent of

the labs are in the same county as their firm headquarters and approximately 60 percent are in another

of the nine Census divisions. Even laboratories in the firms with the most resources to avoid or ignore

each other, but which are also the most visible to each other, are highly likely to locate near each

other, indicating that labs are taking locational cues from each other.

Universities as signals

Universities are often accorded a special place in the theory of knowledge-based economies.

They are sources of personnel and, hopefully, of knowledge. It is conceivable that universities are a

factor in determining the location of industrial laboratories. To the extent they are, it appears they act

as signals, rather than as operational resources. An analysis of the relationship between prominent

university departments and laboratory location suggests that labs clustered around universities and

that they clustered around some universities more than others. However, the pattern of agglomeration

cannot be related to the specific resources the university has to offer.

If universities served as important intellectual resources, as opposed to social signals, three

outcomes would be expected. First, the mean level of labs should be higher near universities than in

otherwise equivalent counties. Second, the distribution of labs should reflect the relative strengths of

the academic departments and, third, since faculty time and university graduates are limited, labs

should show a “regular” distribution between counties with relevant departments.

Controlling for type of county, the presence of a good university – one with at least one top

department as measured by the presence of PhD granting departments whose rating was at least one

standard deviation above the average for “important” PhD granting departments in that field (Jones,

Lindzey, and Coggeshall, 1982) – results in, on average, 3.74 (t, [mean = 1] = 6.32; p < .001) times as

many labs, confirming the first expectation. Looking at the counties which contain a good university

and controlling for the type of county, the presence of a top department in a field does not

21

significantly increase the number of labs in that field over the number found in counties with good

universities but with no prominent department in the relevant field (t [mean = 1] = 1.25; p > .10).

Further, in almost one third (9/30) of the cases, counties with good universities had fewer total labs

than the average county in the same category. One university, the University of Illinois at

Champaign-Urbana, which excels in several areas and has close ties to industry, appears not to have

had a single private lab located in the surrounding area in 1985. Even when the county category and

academic department are considered, an agglomeration of labs is evident rather than the “regular”

distribution expected. These results are consistent with analyses that suggest the geographic reach of

university knowledge (Feller, 1999) and personnel (Appold, 1998) spillovers can be quite large.

Considering unobserved heterogeneity

The interpretation of the results depends critically upon the adequacy of the baseline model

that controls for the operational advantages afforded by urbanization. In general, it is not possible to

differentiate between social influence among actors and heterogeneity among choices on the basis of

statistical distribution alone (Barron, 1992; Cameron and Trivedi, 1986; Xekalaki, 1983). It is often

possible to reproduce a pattern with a quite different statistical distribution. The risk here is that the

differences among counties are inadequately taken into account. While it is true that other

distributions may exist that can reproduce the distribution of laboratories, the model presented here

includes an intuitive behavioral process (represented by an urn model of sequential learning) that is

closely tied to a statistical distribution with interpretable parameters. That model fails to fit in

specification tests where unobserved heterogeneity is introduced. The interpretation of other

distributions may be problematic. Indeed, they are often picked for expediency in estimation.

(Maximum likelihood estimation encountered convergence problems with the distributions used in

here.)

Historical information discounts the heterogeneity model. The distribution of laboratories is

more skewed than the phenomena studied by other researchers (e.g., visits to physicians) and, in the

absence of social influence, would imply an extreme distribution of heterogeneity among counties.

As noted in the introduction, the aggregate movement of laboratories was away from places rich in the

22

types of resources held to attract labs and towards the places, specifically the suburban counties of

large metropolitan areas that lack such attractions. In that regard it is interesting to note that in 1985,

Santa Clara County (Silicon Valley) contained fewer laboratories than was average for similar

counties. While the increase in the number of laboratories between 1960 and 1985 was somewhat

above the average for similar counties, the increase in the number of laboratories per person was

lower than average and the increase in the number of laboratories over the same time period was

slower than it was in a sample of 64 suburban counties in large metropolitan areas. Much of the

apparent heterogeneity among counties appears to be a product of development, rather than a cause.

Social influence in the context of agglomeration growth

In order to illustrate how social influence might fit into a process of local agglomeration

formation, I describe an “ideal type” successful development process on the basis of data that are

gleaned from case studies. The social mechanisms outlined here are not necessarily cleanly separated

from each other and there is no necessary progression from one to the next. I identify three types of

processes in the development trajectory of contemporary laboratory agglomerations: 1) those that

directly involve the promoters of development, 2) those that are based on the social influence of local

elites, and 3) those based on imitative location decision-making.

The earliest phase of many contemporary agglomerations may begin with the attempt of a

local growth coalition, whether led by a private investor (e.g., Cabot, Cabot and Forbes for Route

128), a non-governmental entity (such as Stanford University for Silicon Valley), or a public-private

partnership (as in North Carolina's Research Triangle), to reap the benefits of employment growth.

These benefits include the revenue generated by land sales and ground rent and the tax flow generated

by added employment. Research-intensive, high technology industries were a major source of high-

wage employment growth in the several decades following the Second World War and, therefore, a

fitting target for recruitment. As large, risky development investments, much of the recruitment effort

needed to be underwritten by public bodies.

Many localities invested heavily in recruiting research-intensive employment in the 1960s,

often by creating research or technology parks. The August 1962 and August 1966 issues of

23

Industrial Development report 171 operating research parks in 116 counties. Many of these had direct

or indirect sponsorship from major research universities; few were still operating in 1985. These

research parks had to contend with the advantages of the traditional centers of innovation which could

already boast agglomeration advantages, a supply of highly-educated labor, and a high quality of life.

The economics of rents (Buchanan, Tollison, and Tullock, 1980) suggests further reasons for the lack

of success. To the extent that local elites can gain by development and the firms that are being

recruited are indifferent to locations, localities may need to bargain away all the potential advantage

by providing incentives. It is only when firms can be attracted without compensation that

development could be beneficial.

Local elites can use social connections to recruit firms. The initial occupants of Stanford's

industrial park had close social (but not necessarily functional) ties to the university. Many firms,

including Shockley Transistor Company, recruited from almost exclusively from outside the Bay

Area. Similarly, North Carolina's Research Triangle Park, first proposed by Howard Odum (a

University of North Carolina administrator) and later promoted by state governors, attracted its first

tenant in 1960. However, successful recruitment was limited to firms subject to influence from the

regional power elite; these were mainly those involved in the research aspects of the well-established

local textile industry.

North Carolina's Research Triangle Park might have languished like many others were it not

for an accident of history that allowed the local elite to extend its power (Moriarity, n.d.). Five years

passed after the initial opening of the Park without any additional important tenants. IBM was

considering locating a new facility in the South but it was unwilling to place a branch in the relatively

forlorn Park. Possibly in return for political favors, a governor and a former governor who had moved

to the Federal executive branch, were able to secure a major National Institutes of Health laboratory

for the Park. That was sufficient to attract IBM.

The many stories surrounding the recruitment of IBM to the Park belong to the lore of

idiosyncratic events that would create a dispersed spatial distribution of laboratories. The next

political exchange might be elsewhere or involve a different type of reward. The location decision of

a large firm that did not have direct ties to the local elite, however, established an important signal

24

that the Park was a fitting site for laboratories. Social influence processes could become salient.

Much later, one executive credited the presence of IBM and Northern Telecom (among other firms)

with his decision to locate the U.S. branch of his optical fiber firm in the Park (personal

communication). A large consulting firm had identified a dozen or so metropolitan areas fulfilling the

functional requirements of his firm. Although his firm had no direct relationship with those in

Research Triangle Park and did not necessarily anticipate any developing, he felt it an “appropriate”

location. The business press sometimes mentions that businesses “simply have to be” in certain

locations – much as a restaurant is trendy because of its customers – a place to “see and be seen” –

despite the absence of interaction among tables. For much of the history of the Park, the image of the

resources afforded by the three major universities and the relative absence of a blue-collar heritage

have been cited as an attraction – even if functional integration between industrial research and the

universities has not always been strong. While it might be tempting to cite the role of localization

economies in the development of agglomeration, the time span between the initial location decisions

and the critical mass necessary for such economies to emerge stretches into the decades.

Conclusions

Examining successful regions and looking backward through time, it is always possible to

construct coherent stories that make the employment growth seem, if not inevitable then at least an

heroic accomplishment. There is a visionary leader, a foresighted coalition, a critical amenity, a local

cultural tradition, and some unique local institution. Local histories construct the past after the fact

(Berger, 1963: Chap. 3; Schein, 1999). A careful prospective examination of recent decades would

show dozens of prominent leaders, countless critical amenities, and so on. They would be spread

across a broader geography than that of the recently emerged research-based new industrial spaces.

By investigating local growth after the fact and by sampling on the dependent variable, conspicuous

success, researchers can mistake correlation for cause. The growth of such industrial spaces requires

an explanation that is credible ex ante.

A model of location decision-making based on the operational advantages of particular

locations holds such a promise. But while the evidence for the co-location of related innovative

25

industrial activities accumulates (Anselin, Varga, and Acs, 1997; Feldman, 1994), there is little

evidence for the local connections that are the logical basis of that co-location. Jaffe, Trajtenberg, and

Henderson (1993: Table 3), for example, find that, even in the first year, 30 percent of the citations to

U.S. patents originate from other countries and that less than 10 percent of such citations originate

from the inventor’s metropolitan area and suggest that the more important the discovery, the more

quickly it travels. The degree of localization they found is lower than would be needed to sustain a

claim about regionally-bounded innovation systems. Indeed, Gertler (1995: 12) reports that small

businesses are prepared to “go anywhere” for what they need even if they would prefer sources closer

to home. Similarly, Almeida and Kogut (1997: 29) suggest that the entrepreneur him/herself is the

primary vehicle in local knowledge spillovers and that further local linkages are minimal with

successful firms connecting to national industrial networks. Because the majority of firm purchases

and sales are outside the surrounding 100-mile radius (Angel, 2002), it is not surprising that there is

little evidence for the positive effects of agglomeration on productivity when industries are examined

carefully (Rigby and Essletzbichler, 2002). Sub-national regions are “leaky” and immediate milieus

offer little special support to firms.

In this research, both functional interdependence in the form of urbanization economies and

social influence in the form of mimetic behavior were found to be important in determining the

location of a key institution of a knowledge-based economy – research laboratories. Social influence

among those making location decisions is one of several plausible social mechanisms responsible for

the emergence of new geographic economic centers. Influence originates in the implications of

location decision-making under uncertainty. That uncertainty is most pronounced in an expanding,

restructuring economy for economic activities with poorly understood production processes. Without

the need for new or additional facilities, location decisions need not be made. Without the loosening

of geographic constraints, location decisions would reinforce pre-existing location patterns. Without

uncertainty, straightforward technical location calculations would suffice and geographic dispersion

would result. The operation of social influence results in a “star” system that makes some places

much more successful than others and leads to the formation of new centers in former peripheries.

The stars in the urban system are the shock cities of the urban frontier that arise with changes in

26

economic regime (Suttles, 1984).

This paper used a technique for measuring social influence that conceptually links social

processes and the resultant spatial statistical distributions. By specifying the entire frequency

distribution of outcomes on the basis of theory and two basic parameters (the mean and variance of

the empirical data), the degree to which the agglomeration of laboratories was due to social influence

could be estimated from cross-sectional data. (The method can document the working of social

influence but it cannot differentiate between mechanical (Tarde, 1969), emotional (Shils, 1975), and

rational (Hedström, 1998; Zablocki, 1993) imitative behavior.) The method relies on a strong

normative null model of decision-making behavior in forming a baseline from which the workings of

social influence could be measured. By having a null model, misspecification can be discovered in

much the same way that it can be detected in OLS regression. Both omitted factors and neglected

social influence can lead to misinterpretation. The consistency of the results with the theoretically

predicted distribution of locations and with conditions under which influence processes should be

operating underpins the interpretation of the findings.

Social influence research is almost always vulnerable to the criticism that the apparent results

are due to some missing, unmeasured, yet crucial factor. Yet the possibility of unobserved factors is

the basis for an actor’s search for signals. Observing and responding to those signals may lend a

performance-based prestige (Friedland, 1964) that can result in a place being “considered

extraordinary and treated as endowed with ... exceptional powers or qualities (Weber, 1978: 241).”

Rituals, such as executive site visits, and narratives (Smith, 1999), such as those created by the

journalists and academics who manufacture an aura of exceptionalism around places of industrial

research (Kaplan, 1999) even suggesting a feverish quality (Rogers and Larson, 1984), imply that

such locations confer a well-being that cannot be described in mundane terms (Weber, 1978: 1112-

1114). The ensuing symbolic geography (Zukin, 1991) benefits elite growth machines and attracts the

real estate developers who are not the originators of geographic charisma (indeed attempts at local

promotion often fall flat) but who are the actors who attempt to institutionalize the charisma. Thus,

the meaning of place helps us understand, not only why particular places persist in importance (Firey,

1947) but why and how they become economically important. Such charisma would, of course, be

27

rare; today it clearly centers on the cultural construction, “Silicon Valley.”

A reliance on a system of visible actions for managerial guidance creates a self-regulating

system of signs determining collective behavior. The act of locating, by involving physical

investments (buildings), makes adjustments difficult while accompanying mobilization of productive

resources and the multiplier effects resulting from investment, may create the economic conditions

necessary for a stable spatial outcome. Because location decisions can create the conditions that

justify the actions, the signaling can determine the pattern of development. Communication and

transportation technologies have allowed contemporary agglomerations to become functionally open

systems. They are not closely bounded geographically. They are not necessarily social communities.

They are rarely economic units. Like many upper middle-class residential neighborhoods, the

common identity in agglomerations of research laboratories may exist “mostly as a creature of

romantic imagination (Dorst, 1989: 44).” If there is a locally-bounded economy, it is an economy of

signs that creates a valuable identity that may be used to help recruit scientists and engineers in a

competitive labor market and assist in securing credit in finance markets. The agglomerations of

laboratories are largely imagined or symbolic communities (Anderson, 1991; Hunter, 1974).

A simple model of influence assuming homogenous firms, a stable decision-making process,

immortal firms, and no generalization has been developed and used here. Although these assumptions

are pragmatic, as with many such assumptions, none are likely to be completely valid. A more

complete theory explaining the emergence of new industrial spaces in the U.S. would need to take

additional factors, conditions, and social processes into account. The attributes and identities of

individual laboratories (IBM v. an unknown) may affect the strength of their influence on others. The

strength of the influence of a location decision may decay over time and in combination with

variations in the rate of facility sitings result in alternating periods of “hot potato” growth and periods

of relatively unchanneled location decisions. Laboratory closure might send a strong negative signal

to decision-makers about the desirability of a locality. Imitation of decision rule, rather than imitation

of destination, is also possible. That is, the degree of influence might be “go South” rather than “go to

the Research Triangle.” In fact, it is impossible to tell if the variation in the average number of

laboratories between the county categories is due to generalization and interpretation resulting in

28

decisions to locate in places like other firms, rather than due to variations in urbanization economies.

The theory of decision-making and the urn models are able to incorporate such realistic

complexities but the data requirements are quite high. While stronger inferences could be made and

more subtle models evaluated with longitudinal data, when an adequate sampling frame does not exist

(as in much research on organizations), accounting for unknown and changing biases in data

collection may outweigh the benefits of the additional data. An apparent change in aggregate

behavior may be real or it may be an artifact of a changing data collection protocols.

The model and findings address a question about the formation of new centers in the

contemporary era posed by urbanists over a half century ago. Decades of improvements in

communication and transportation technologies and the continuing economic restructuring from

goods production to information processing have led to a reduction in the operational constraints on

location. Yet, despite the well-founded, but contradictory, predictions of metropolitanism and

dispersion to the contrary, new spatial agglomerations of employment characterized by co-location

without connection have formed. A common strategy of explanation has been to postulate new forms

of functional interdependence or new resource needs. The simple model of social influence presented

here and supported by the analysis of a large dataset widens the discussion of the mechanisms

responsible for the formation of industrial spaces and thus of renewal in large-scale social

organization.

29

Appendix: Derivation of parameters from moment equations

Panel A:α

( )( )( ) ( )( ) ( )( ) ( )

( ) ( )( )

( )( ) )E(x + E(x) - V(x) n

)E(x - V(x)E(x) - )nE(x=

)E(x + nE(x) - nV(x))E(x - V(x) - nE(x)E(x)=

)E(x - V(x) - nE(x)E(x)=)E(x + nE(x) - nV(x)

)E(x - E(x) + V(x) - E(x) - nE(x)E(x)=)E(x + )nE(x - )nE(x + nE(x) - nV(x)

E(x))E(x + E(x) - V(x) - 1) - (n)E(x=1)-(n)E(x - )nE(x + nE(x) - nV(x)

1) - (n )E(x + 1) - (n )E(x=E(x) )E(x + E(x) - V(x) + )nE(x + nE(x) - nV(x)

1) + 1)(-(n )E(x= E(x) + n)E(x + E(x) - V(x)

E(x)+ n1) +( 1)-(n )E(x=)E(x+E(x)-V(x)

)E(x-E(x)+E(x)+n

1)+1)(-(n)E(x=V(x)

E(x)-1+E(x)+n

1)E(x)+1)(-(n*E(x)=V(x)

E(x)-1+

E(x)E(x)+n

1)+1)(-(n*E(x)=V(x)

E(x)-1+1+

E(x)n

1)+1)(-(n*E(x)=V(x)

E(x)-1+1+-

E(x)n+

1)+1)(-(n*E(x)=V(x)

E(x)-1+1++1)+1)(-(n*E(x)=V(x)

+n-1+

1++1)+1)(-(n*

+n=V(x)

2

22

2

2

22

2222

2222

2222

22

22

22

α

α

α

α

αα

αα

αα

αα

αα

αα

αα

αα

ααα

α

βαα

βαα

βαα

βαα

⎥⎦

⎤⎢⎣

⎡

⎥⎥⎥⎥

⎦

⎤

⎢⎢⎢⎢

⎣

⎡

⎥⎥⎥⎥

⎦

⎤

⎢⎢⎢⎢

⎣

⎡

⎥⎥⎥⎥⎥

⎦

⎤

⎢⎢⎢⎢⎢

⎣

⎡

⎥⎦

⎤⎢⎣

⎡

⎥⎦

⎤⎢⎣

⎡

⎥⎦

⎤⎢⎣

⎡

30

Panel B: ß

ααβ

αβα

αβαβα

α

- E(x)n =

E(x)n = +

n = )+E(x)( +

n = E(x)

31

Bibliography Almeida, Paul and Bruce Kogut. 1997. “The Exploration of Technological Diversity and the

Geographic Localization of Innovation.” Small Business Economics 9: 21-31. Anderson, Benedict R. 1991. Imagined Communities: Reflections on the Origin and Spread of

Nationalism. London: Verso. Angel, David P. 2002. “Inter-firm Collaboration and Technology Development Partnerships Within

U.S. Manufacturing Industries.” Regional Studies 36: 333-344. Anselin, Luc, Attila Varga, and Zoltan Acs. 1997. “Local Geographic Spillovers between University

Research and High Technology Innovations.” Journal of Urban Economics 42: 422-48. Appold, Stephen J. 2004. “Research Parks and the Location of Industrial Research Laboratories: an

Analysis of the Effectiveness of a Policy Intervention.” Research Policy 33: 225-43. Appold, Stephen J. 1998. “Labor Market Imperfections and the Agglomeration of Firms: Evidence

from the U.S. Semiconductor Industry's Emergent Period.” Environment and Planning A 30: 439-462.

Arrow, Kenneth J. 1973. “Higher Education As a Filter.” Journal of Public Economics 2:193-216. Arthur, W. Brian. 1994. Increasing Returns and Path Dependence in the Economy. Ann Arbor:

University of Michigan Press. Barron, David N. 1992. “The Analysis of Count Data: Overdispersion and Autocorrelation.”

Sociological Methodology 22: 179-220. Beale, Calvin L. and Glenn V. Fuguitt 1978. “The New Pattern of Nonmetropolitan Population

Change” Pages 157-177 in Karl Taeber, Larry Bumpass and James Sweet, eds., Social Demography. New York: Academic Press.

Bell, Daniel. 1973. The Coming of Post-Industrial Society: A venture in social forecasting. New

York: Basic Books. Berger, Peter L. 1963. Invitation to Sociology: A Humanistic Perspective. Garden City: Doubleday. Bondonio, Daniele and John Engberg. 2000. “Enterprise zones and local employment: evidence from

the states’ programs.” Regional Science and Urban Economics 30: 519-549. Buchanan, James M., Robert D. Tollison, and Gordon Tullock. 1980. Toward a Theory of the Rent-

Seeking Society. College Station: Texas A&M Press. Bush, R.R. and F. Mosteller 1955. Stochastic Models for Learning. New York: John Wiley and

Sons. Cameron, A. Colin and Pravin K. Trivedi. 1986. “Econometric Models Based on Count Data:

Comparisons and Applications of Some Estimators and Tests.” Journal of Applied Econometrics 1: 29-53

Carlino, Gerald and Edwin Mills. 1987. “The Determinants of County Growth.” Journal of

Regional Science 27:39-54.

32

Castells, Manuel. 1996. The Rise of the Network Society. Cambridge: Blackwell Publishers. Chinitz, Benjamin. 1961. “Contrasts in Agglomeration: New York and Pittsburgh.” American

Economic Review 51:279-289. Chou, Y. 1975. Statistical Analysis. New York: Holt, Rinehart and Winston. Coleman, James S. 1964. Introduction to Mathematical Sociology. Glencoe: Free Press. Cowell, Frank.A. 1995. Measuring Inequality: Techniques for the Social Sciences. London:

Prentice-Hall. Crane, Diana. 1972. Invisible Colleges: Diffusion of Knowledge in Scientific Communities. Chicago:

University of Chicago Press. D'Agostino, Ralph B. and Michael A. Stephens. 1986. Goodness-of-Fit Techniques. New York:

Marcel Dekker. Diggle, Peter .J. 1983. Statistical Analysis of Spatial Point Patterns. New York: Academic Press. DiMaggio, Paul J. and Walter W. Powell. 1983. “The Iron Cage Revisited: Institutional

Isomorphism and Collective Rationality in Organizational Fields.” American Sociological Review 48:147-160.

Dorst, John D. 1989. The Written Suburb: An American Site, An Ethnographic Dilemma.

Philadelphia: University of Pennsylvania Press. Eco, Umberto. 1976. A Theory of Semiotics. Bloomington: Indiana University Press. Eisinger, Peter K. 1988. The Rise of the Entrepreneurial State: State and Local Economic

Development Policy in the United States. Madison: The University of Wisconsin Press. Etzkowitz, Henry and Loet Leydesdorff. 1997. Universities and the Global Knowledge Economy: A

Triple Helix of University-industry-government Relations. London: Pinter. Everitt, Brian S. and D.J. Hand 1981. Finite Mixture Distributions. London: Chapman and Hall. Feldman, Maryann P. 1994. The Geography of Innovation. Dordrecht: Kluwer Academic. Feller, Irwin. “The American University System as a Performer of Basic and Applied Research.”

Pages 65-101 in Lewis M. Branscomb, Fumio Kodama, and Richard Florida, eds., Industrializing Knowledge: University-industry Linkages in Japan and the United States. Cambridge, Ma. : MIT Press, 1999.

Firey, Walter I. 1947. Land Use in Central Boston. New York: Greenwood Press. Freeman, Chris and Luc Soete. 1997. The Economics of Industrial Innovation. London: Pinter. Friedland, William H. 1964. “For a Sociological Concept of Charisma.” Social Forces 43: 18-26. Frobel, Folker, Jurgen Heinrichs, and Otto Kreye. 1980. The New International Division of Labor:

Structural Unemployment in Industrialized Countries and Industrialization in Developing Countries. Cambridge: Cambridge University Press.

Fujita, Masahisa and Jacques-Francois Thisse. 2000. “The Formation of Economic Agglomerations:

33

Old Problems and New Perspectives.” Pages 3-73 in Jean-Marie Huriot and Jacques-Francois Thisse, eds., Economics of Cities: Theoretical perspectives. Cambridge: Cambridge University Press.

Gereffi, Gary and Miguel Korzeniewicz. 1994. Commodity Chains and Global Capitalism.

Westport: Greenwood Press. Gertler, Merc. 1995. “‘Being There’: Proximity, Organization, and Culture in the Development and

Adoption of Advanced Manufacturing Technologies.” Economic Geography 71: 1-26. Getis, Arthur and Brian Boots 1978. Models of Spatial Processes: An Approach to the Study of

Point, Line and Area Patterns. New York: Cambridge University Press. Glasmeier, Amy. 1988. "Factors Governing the Development of High Tech Industry

Agglomerations: A tale of three cities." Regional Studies 22:287-301. Hawley, Amos H. 1963. “Community Power and Urban Renewal Success.” American Journal of

Sociology 68:422-431. Hedström, Peter. 1998. “Rational Imitation.” Pages 306-327 in Peter Hedström and Richard

Swedberg, eds., Social Mechanisms: An Analytical Approach to Social Theory. New York: Cambridge University Press.

Hedström, Peter and Richard Swedberg. 1998. Social Mechanisms: An Analytical Approach to

Social Theory. New York: Cambridge University Press. Herzog jr., Henry W., Alan M. Schlottman and Donald Johnson. 1986. “High Technology Jobs and

Worker Mobility.” Journal of Regional Science 26:445-459. Hoover, Edgar M., and Giarratani, Frank. 1984. An introduction to regional economics, 3rd ed. New

York: Alfred A. Knopf. Howells, Jeremy. 1999. “Regional systems of innovation?” Pages 67-93 in Daniele Archibugi,

Jeremy Howells and Jonathan Michie, eds., Innovation Policy in a Global Economy. Cambridge: Cambridge University Press.

Hunter, Albert. 1974. Symbolic Communities: the Persistence and Change of Chicago's Local

Communities. Chicago: University of Chicago Press. Industrial Research Laboratories of the United States 1985. Jacques Cattell Press, R. R. Bowker

Co,; New York. 19th edition. Jaffe, Adam B., Manuel Trajtenberg, and Rebecca Henderson. 1993. “Geographic Localization of

Knowledge Spillovers as Evidenced by Patent Citations.” Quarterly Journal of Economics 108: 577-98

Johnson, Norman L. and Samuel Kotz 1977. Urn Models and their Application. New York: John Wiley and Sons.

Johnson, Norman L. and Samuel Kotz 1969. Discrete Distributions. Boston: Houghton-Mifflin Co. Jones, L.V., G. Lindzey and P.E. Coggeshall, eds. 1982. An Assessment of Research-Doctorate

Programs in the United States. Washington DC: National Academy Press. Kaplan, David A., 1999. The Silicon Boys and Their Valley of Dreams. New York: William Morrow.

34

Kasarda, John D. and Michael Irwin. 1991. “National Business Cycles and Community Competition for Jobs”. Social Forces 733-761.

Kleinknecht, Alfred and Tom P. Poot. 1992. “Do Regions Matter for R&D?” Regional Studies 26:

221-232. Lichtenberg, Robert M. 1960. One-tenth of a Nation: National Forces in the Economic Growth of

the New York Region. Cambridge: Harvard University Press. Logan, John R., Rachel Bridges Whaley, and Kyle Crowder. 1997. “The Character and

Consequences of Growth Regimes: An Assessment of 20 Years of Research.” Urban Affairs Review 32: 603-630.

Love, James H. and Stephen Roper. 1999. “The Determinants of Innovation: R&D, Technology

Transfer and Networking Effects.” Review of Industrial Organization 15: 43-64. Luce, R.D. 1959. Individual Choice Behavior: A Theoretical Analysis. New York: John Wiley. Malecki, Edward J. 1997. Technology and Economic Development: The dynamics of local, regional,

and national competitiveness. Harlow: Longman. March, James G. and Johan P. Olsen. 1976. Ambiguity and Choice in Organizations. Bergen:

Universitetforlaget. Martin, Philip and Jonas Widgren. 1996. “International Migration: A Global Challenge.” Population

Bulletin 51 (1): 1-40. Mills, Edwin S. and Bruce W. Hamilton. 1994. Urban Economics. Glenview: Scott, Foresman and

Company. 5th ed. Miner, Anne S. and Pamela R. Haunschild. 1995. “Population Level Learning.” Research in

Organizational Behavior 17:115-166 Molotch, Harvey. 1976. “The City as a Growth Machine: Toward a Political Economy of Place.”

American Journal of Sociology 82: 309-332. Moriarity, Barry M. n.d. “Research Triangle Park: 1956-1985.” Mimeo. Department of Geography,

University of North Carolina, Chapel Hill. Mowery, David C., Joanne E. Oxley, and Brian S. Silverman. 1996. “Strategic Alliances and

Interfirm Knowledge Transfer.” Strategic Management Journal 17 (Winter Special Issue: Knowledge and the Firm) 77-91.

National Science Board. 2001. Science and Engineering Indicators, 2000. Washington, DC: U.S.

Government Printing Office. Oakey, Raymond P. 1984. High Technology Small Firms: Regional Development in Britain and the

United States. New York: St. Martins Press. Pascal, Anthony H. and John J. McCall. 1980. “Agglomeration Economies, Search Costs and

Industrial Location”. Journal of Urban Economics 8:383-388. Peterson, Paul. 1980. City Limits. Chicago: University of Chicago Press. Pielou, E.C. 1977. Mathematical Ecology. New York: John Wiley and Sons.

35

Powell, Walter W. 2001. “The Capitalist Firm in the Twenty-First Century: Emerging Patterns in

Western Enterprise.” Pages 33-68 in Paul DiMaggio, ed., The twenty-first-century firm: Changing economic organization in international perspective. Princeton: Princeton University Press.

Pred, Allan R. 1967. The Spatial Dynamics of U.S. Urban-industrial Growth, 1800-1914.

Cambridge: M.I.T. Press. Rigby, David L. and Jurgen Essletzbichler. 2002. “Agglomeration Economies and Productivity

Differences in U.S. Cities.” Journal of Economic Geography 2: 407-32. Rogers, Everett M. and Judith K. Larson. 1984. Silicon Valley Fever. New York: Basic Books. Rumelt, Richard P., Dan E. Schendel, and David J. Teece. 1994. Fundamental Issues in Strategy: a

Research Agenda. Boston: Harvard Business School Press. Saxenian, AnnaLee. 1994. Regional Advantage: Culture and Competition in Silicon Valley and

Route 128. Cambridge: Harvard University Press. Schein, Edgar H. 1999. The Corporate Culture Survival Guide: Sense and Nonsense about Culture

Change. San Francisco: Jossey-Bass. Schelling, Thomas C. 1978. Micromotives and Macrobehavior. New York: Norton. Schmenner, Roger. 1982 Making Business Location Decisions. Englewood Cliffs: Prentice-Hall. Scott, Allen J. 1988. New Industrial Spaces: Flexible Production Organization and Regional

Development in North America and Western Europe. London: Pion. Sebeok, Thomas A. 2001. Signs: An Introduction to Semiotics. Toronto: University of Toronto

Press, 2nd ed. Shils, Edward Albert. 1975. Center and Periphery: Essays in Macrosociology. Chicago: University

of Chicago Press . Smith, Philip. 1999. “The Elementary Forms of Place and Their Transformations: A Durkheimian

Model.” Qualitative Sociology 22: 13-36. Smith, Valdemar, Anni-Lene Broberg, and Jesper Overgaard. 2002. “Does Location Matter for

Firms' R&D Behaviour? Empirical Evidence for Danish Firms.” Regional Studies 36: 825-32.

Spence, A. Michael. 1974. Market Signaling: Information Transfer in Hiring and Related Processes.

Cambridge: Harvard University Press. Suttles, Gerald. 1984. “The Cumulative Texture of Local Urban Culture”. American Journal of

Sociology 90:283-304. Tarde, Gabriel. 1969. On Communication and Social Influence. T.N. Clark, ed., Chicago:

University of Chicago Press. Theil, Henri. 1967. Economics and Information Theory. New York: American Elsevier Co. Tolbert, Pamela S. and Lynne G. Zucker. 1983. “Institutional Sources of Changes in the Formal

Structure of Organizations: The Diffusion of Civil Service Reform, 1880-1935.

36

Administrative Science Quarterly 28:22-39. Vernon, Raymond. 1960. Metropolis 1985. Cambridge: Harvard Univ. Press. Webber, Melvin. 1967. “Order in Diversity: Community without Propinquity.” Pages 29-55 in

Lowdon Wingo, ed., Cities and Space. Baltimore: Johns Hopkins University Press. Weber, Max. 1978. Economy and Society. Berkeley: University of California Press. Xekalaki, Evdokia. 1983. “The Univariate Generalized Waring Distribution in Relation to Accident

Theory: Proneness, Spells, or Contagion?” Biometrics 39: 887-895. Zablocki, Benjamin. 1993. “Rational Models of Charismatic Influence.” Pages 73-91 in Aage B.

Sørensen and Seymour Spilerman, eds., Social Theory and Social Policy: Essays in Honor of James S. Coleman. Westport: Praeger.

Zukin, Sharon. 1991. Landscapes of Power: From Detroit to Disney World. Berkeley: University of

California Press.

37

Notes 1 The industry share of total US research and development reached a temporary plateau in the mid-1980s while the growth in the universities’ share of industrial research and development had not yet begun the acceleration that lasted for another decade. According to some industrial recruiters there was a rash of laboratory sitings prior to 1985 and a slowdown afterward. In addition, the relationship between private research laboratories and universities during the late 1980s and early 1990s may have become more direct. The post-war geographic changes through the mid-1980s arguably set the stage for the changes in the nature of the firm and the reorganization of knowledge-intensive production that have occurred since the mid-1980s (Powell, 2001).

2 Disappointingly, in attempting to model the emergence of agglomerations many accounts treat these newer locations as if they were the birthplaces of particular industries. Up through 1985, at least, they were not. The semiconductor industry, for example, did not arrive in Silicon Valley until almost 10 years after the invention of the transistor, by which time several present industry giants, such as Motorola and Texas Instruments, were already well-established in their own separate new places.

3 I will refer to the two distributions almost interchangeably. Geographers have developed the theory of spatial point processes using the Poisson family of distributions. Coleman (1964) also develops much of his argument about social influence using the Poisson family of distributions. The Polya-Eggenberger family of distributions which includes the binomial distribution as a special case more closely corresponds to the urn models discussed below and the theory of decision-making. Johnson and Kotz (1969) and Chou (1975) discuss the relationship among several discrete distributions and the conditions under which the Poisson and binomial distributions are nearly equivalent. When the probability of any particular option being chosen is small and the number of decisions, or trials, is large (as they are in the case considered here), the binomial distribution closely follows the Poisson distribution (Chou, 1975:186).

4 The large numbers of “trials” implies that Chi-squared based measures will be statistically significant even when the deviations are not substantively important.

5 The categorizations, modifications of those suggested by Calvin Beale, based on the population of the largest city and inclusion in or adjacency to a metropolitan area are as follows: Code County Type Population of largest place N 0 Large metro core counties 1,000,000 plus (49) 1 Outlying counties of large metro areas (132) 2 Medium metro area counties 250,000 - 1,000,000 (258) 3 Small metro area counties less than 250,000 (186) 4 Non-metro counties adjacent to metro areas (967) 6 Few, if any, “lab-rich” counties are not bordered by other counties containing labs. Counties containing labs are clearly agglomerated themselves. For this reason some agglomeration analysis was performed using uniform 50 mile squares as the “neighborhood”. The results did not differ substantively from the county level analysis, however, and are not reported here.

7 The Theil statistic is an “information” or “entropy” statistic. The overall Theil index is given by: Theil index = E(X/XM)*log(X/XM)

The measure can be decomposed into a component representing variation between county types and variation in the number of labs among counties of the same type:

Theil index = E(XK/XM)*log(XK/XM) + (X/XK)*log(X/XK) Where X is the number of labs in a county; XK is the mean number of labs within counties in a given group; and XM is the overall mean number of labs in a county. The first term constitutes the between group inequality; the second the total within group inequality. Cowell (1995, Chapters 2 and 3)

38

discusses these issues.

8 A test of specification error was performed by estimating the influence model on data known to contain heterogeneity. The cumulative probability distribution for a model, estimated on the total combined set of counties, deviates substantially from the data. This result indicates that the model is not merely capturing unmeasured environmental heterogeneity. An Appendix figure illustrates the results. As another test of mis-specification, the influence model was estimated for simulated data containing no social influence (shown in Table 4). The results are consistent with the data. It should be noted that statistical distributions with characteristics different from those used here have been most successfully used in modeling unobserved heterogeneity.

Number of research laboratories by county population(log-log scale)

1

10

100

1000

100 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000

County population

Num

ber o

f lab

orat

orie

s

Figure 1

Laboratories in fringe counties of large metropolitan areas

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

Number of laboratories

Cum

ulat

ive

prop

ortio

n of

cou

ntie

s

Polya-Eggenberger Actual Binomial

Figure 2

Laboratories in large firms that are located inoutlying counties of large metropolitan areas

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19


Cum

ulat

ive

prop

ortio

n of

cou

ntie

s

Actual Polya-Eggenberger Binomial

Figure 3

Laboratories in all types of counties

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

Nunmber of laboratories

Cum

ulat

ive

prop

ortio

n of

cou

ntie

s

Polya-Eggenberger Actual Binomial

Appendix Figure 1

Table 1 Distribution of laboratories by type of county and research field

Number Percent

Core counties of large metropolitan areas 4295 41.3%Fringe counties of large metropolitan areas 2145 20.6%Counties in medium-sized metropolitan areas 2338 22.5%Counties in small metropolitan areas 582 5.6%Counties that are adjacent to metropolitan areas 677 6.5%Counties that are not adjacent to metropolitan areas 380 3.7%

Counties in large metropolitan areas 62.0%

Counties in metropolitan areas 90.1%Counties in non-metropolitan areas 10.2%

Total 10393 100.2%

Table 2 Spatial inequality in laboratory location

Theil decompositions of spatial inequality Total

Within county types

Between different county types

Percent of total spatial inequality within county types

Percent of total spatial inequality between different county types

Laboratories in all counties 2.6101 0.8539 1.7562 0.3272 0.6728

Laboratories in metropolitan counties 1.1753 0.9045 0.2708 0.7696 0.2304

Table 3 Actual and Predicted Marginal and Cumulative Probability Distributions for Laboratories in Fringe Counties of Large Metropolitan Areas

Partial calculations (observed - predicted) forKolmogorov-Smirnov goodness-of-fit test

Number of Actual Polya-Eggenberger Binomial Polya-Eggenberger Binomial laboratories Marginal Cumulative Marginal Cumulative Marginal Cumulative distribution distrbution

0 0.24428 0.24428 0.30966 0.30966 0.00000 0.00000 -0.06538 0.244271 0.13741 0.38168 0.08810 0.39776 0.00000 0.00000 -0.01608 0.381682 0.06107 0.44275 0.05582 0.45358 0.00001 0.00001 -0.01083 0.442743 0.05344 0.49618 0.04186 0.49544 0.00006 0.00008 0.00074 0.496114 0.07634 0.57252 0.03383 0.52927 0.00025 0.00033 0.04325 0.572195 0.03817 0.61069 0.02852 0.55778 0.00082 0.00115 0.05290 0.609546 0.03053 0.64122 0.02470 0.58248 0.00223 0.00338 0.05874 0.637847 0.03053 0.67176 0.02181 0.60429 0.00518 0.00856 0.06746 0.663198 0.01527 0.68702 0.01953 0.62382 0.01054 0.01910 0.06320 0.667929 0.01527 0.70229 0.01767 0.64149 0.01904 0.03815 0.06080 0.66414

10 0.00763 0.70992 0.01613 0.65762 0.03097 0.06911 0.05230 0.6408111 0.00763 0.71756 0.01483 0.67245 0.04577 0.11488 0.04511 0.6026712 0.00000 0.71756 0.01371 0.68615 0.06201 0.17690 0.03140 0.5406613 0.00763 0.72519 0.01273 0.69888 0.07755 0.25445 0.02631 0.4707414 0.00763 0.73283 0.01187 0.71076 0.09004 0.34449 0.02207 0.3883415 0.01527 0.74809 0.01112 0.72187 0.09757 0.44205 0.02622 0.3060416 0.01527 0.76336 0.01044 0.73231 0.09910 0.54115 0.03105 0.2222117 0.02290 0.78626 0.00983 0.74214 0.09473 0.63589 0.04412 0.1503718 0.00000 0.78626 0.00927 0.75141 0.08552 0.72140 0.03485 0.0648619 0.00000 0.78626 0.00877 0.76018 0.07313 0.79453 0.02608 -0.0082720 0.00000 0.78626 0.00831 0.76850 0.05940 0.85393 0.01776 -0.0676721 0.01527 0.80153 0.00789 0.77639 0.04595 0.89987 0.02514 -0.0983522 0.00000 0.80153 0.00750 0.78389 0.03392 0.93380 0.01764 -0.1322723 0.00000 0.80153 0.00715 0.79104 0.02395 0.95775 0.01049 -0.1562224 0.00763 0.80916 0.00681 0.79785 0.01621 0.97396 0.01131 -0.1648025 0.00000 0.80916 0.00650 0.80436 0.01053 0.98449 0.00481 -0.17533

[data and calculations omitted for clarity]

Kolmogorov-Smirnov goodness-of-fit statistic: 0.06746 0.66792(maximum absolute difference)

Table 4 Summary of estimated parameters

Type of countyNumber of counties


Mean number of labs per county

Variance in number of labs per county

Variance / mean ratio Alpha Beta

Imitation parameter

Baseline probability of a particular county being chosen

Imitation effect / baseline effect

Kolmogorov-Smirnov test for goodness-of-fit:Polya-Eggenberger distribution #

Kolmogorov-Smirnov test for goodness-of-fit: Binomial distribution #

Core counties of large 49 4295 87.653 10185.106 116.198 0.746 88.183 0.011245 0.008389 1.340 0.1388 0.5952 metropolitan areasFringe counties of large 132 2145 16.250 921.807 56.727 0.290 185.900 0.005371 0.001555 3.454 0.0675 0.6679 metropolitan areasCounties in medium-sized 258 2322 9.000 352.412 39.157 0.235 272.340 0.003669 0.000861 4.259 0.0823 0.5474 metropolitan areasCounties in small 186 576 3.097 27.666 8.934 0.390 1315.030 0.000760 0.000296 2.565 0.0277 0.3687 metropolitan areasCounties that are adjacent 967 677 0.700 3.515 5.020 0.174 2597.609 0.000385 0.000067 5.745 0.0146 0.2439 to metropolitan areasCounties that are not 1473 378 0.257 0.804 3.132 0.120 4898.850 0.000204 0.000025 8.312 0.0138 0.0842 adjacent to metropolitan areas

Total 3065 10393 3.328 353.674 106.272 0.031 98.184 0.010182 0.000318 31.968 0.1537 0.6764

Simulated location pattern (no imitation) 3.409 3.408 1.000 -3.417E+19 -1.047E+23 -- * -- *

* Cannot be calculated# Although the critical points of the distribution are not exactly defined, the Kolmogorov-Smirnov statistics for the Polya-Eggenberger distribution are within the five-percent significance level (and in most cases 20-percent level) for each of the types of counties. The Polya-eggenberger distribution does not fit for the data aggrgated cross county types. The binomial distribution does not fit the data. Cramer-von Mises statistics have been calculated for each of the types of counties. The critical values are not defined for the Polya-Eggenberger distribution but the statistics are at least an order of magnitude smaller for that distribution than for the binomial distribution.

august 10, 2004 - university of north carolina at chapel...

Documents