agglomeration and productivity in china firm level evidence

Upload: muslich-abduh

Post on 06-Jul-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    1/17

    Agglomeration and productivity in China: Firm level evidence☆

    Cui HU a,⁎, Zhaoyuan XU b, Naomitsu YASHIRO c

    a School of International Trade and Economics, Central University of Finance and Economics (CUFE), 39 South College Road, Haidian District, Beijing 100081, PR Chinab Industrial Economy Research Department, Development Research Center of the State Council, R.216, No. 225, Chaoyangmen Nei Dajie, Beijing 100010, PR Chinac Organization for Economic Cooperation and Development (OECD), Economics Department, 2 Rue André Pascal, Paris 75016, France

    a r t i c l e i n f o a b s t r a c t

     Article history:Received 25 August 2013

    Received in revised form 2 January 2015

    Accepted 3 January 2015

    Available online 9 January 2015

    This paper conducts an in-depth evaluation on the role of industrial agglomeration in produc-

    tivity growth of China's industrial sector by exploiting large dataset of manufacturing  rms

    active in 176 three-digit industries and in 2860 counties. We also complement our analysis

    with the 2004 Census data to capture the agglomeration of small  rms. Unlike previous studies

    that often focused on specic industries, we assess the impact of agglomeration in a compre-

    hensive range of industries and extend the scope of analysis to upstream industries as well.

    Moreover, we explore how the ownership of Chinese   rms shapes their ability to benet

    from agglomeration effect as well as to act as the source of externality. We  nd that congestion

    and ercer competition offset the benets of agglomeration for  rms operating within agglom-

    erated regions. On the other hand, a co-location of large  rms contributes signicantly to pro-

    ductivity. We also  nd a more important contribution from the agglomeration of upstream

    industries than from that of the same industry. Private enterprises are the primary source of 

    agglomeration effects especially in upstream industries, whereas their productivity is boosted

    most by the agglomeration of other private enterprises. We reckon that industrial agglomera-tion contributed up to 14% of the productivity growth in China's industrial sector between

    2000 and 2007.

    © 2015 Elsevier Inc. All rights reserved.

     JEL classi cation:D24

    R12

    Keywords:Agglomeration

    Productivity

    China

    1. Introduction

    Sustainability of China's competitiveness as manufacturing base is recently questioned as enterprises in China are facing rise in

    labor and resource costs, currency appreciation and tighter constraints associated with environmental protection. However, if the

    competitiveness of Chinese industry is mainly founded on the productivity improvement as opposed to cheaper labor cost, China is

    likely to retain its position as the world's major exporter of manufacturing goods. A likely source of such productivity growth is the

    industrial agglomeration which increased consistently since the mid-1990s, driven by the globalization of the Chinese economy

    (Lu & Tao, 2006). The geographic concentration of industrial activities was associated with a dramatic increase in number of  rms,namely of the clustering of interconnected small  rms (Long & Zhang, 2012). Case studies reported that such clustering of small

    and medium enterprises (SMEs) reduced the technological barrier to entry and promoted quality upgrading of Chinese  rms

    (Fleisher, Hu, McGuire, & Zhang, 2010; Huang, Zhang, & Zhu, 2008). However, it is less clear whether the contribution of agglomera-

    tion can be generalized beyond the ndings on specic industries or regions.

    China Economic Review 33 (2015) 50–66

    ☆   We thank China Humanities and Social Sciences Youth Foundation of Education Ministry (Project code: 13YJC79004) and the China National Natural Science

    Foundation (Project code: 71403302/G0304 & 71173058/G0301) for funding this paper.

    ⁎   Corresponding author. Tel.: +86 13501193242 (mobile).

    E-mail address: [email protected] (C. Hu).

    http://dx.doi.org/10.1016/j.chieco.2015.01.0011043-951X/© 2015 Elsevier Inc. All rights reserved.

    Contents lists available at ScienceDirect

    China Economic Review

    http://dx.doi.org/10.1016/j.chieco.2015.01.001http://dx.doi.org/10.1016/j.chieco.2015.01.001http://dx.doi.org/10.1016/j.chieco.2015.01.001mailto:[email protected]://dx.doi.org/10.1016/j.chieco.2015.01.001http://www.sciencedirect.com/science/journal/1043951Xhttp://www.sciencedirect.com/science/journal/1043951Xhttp://dx.doi.org/10.1016/j.chieco.2015.01.001mailto:[email protected]://dx.doi.org/10.1016/j.chieco.2015.01.001http://crossmark.crossref.org/dialog/?doi=10.1016/j.chieco.2015.01.001&domain=pdf

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    2/17

    Industrial agglomeration is often considered to generate positive externalities such as knowledge spillover, more ef cient

    input sharing and richer labor pooling (Marshall, 1890). On the other hand, it can be associated with negative effects on pro-

    ductivity due to congestion and  ercer competition. Comprehensive assessments of the net effects of agglomeration in China

    are still limited and their  ndings remain inconclusive. For instance, Batisse (2002) reported a negative relationship between

    the industrial specialization of a province and its growth in value-added, whereas  Fan and Scott (2003)  reported a positive

    relationship between an industrial concentration and the province-level productivity. Empirical evidences at  rm-level are

    even more limited and are often conned to specic industries. An example is  Lin, Li, and Yang (2011)  which studied the

    impact of agglomeration on productivity of textile industry   rms and reported that initial positive externalities are

    overwhelmed by diseconomies as the agglomeration intensies.

    This paper provides a comprehensive assessment of agglomeration effects on productivity of Chinese  rms by exploiting a

    large survey data of manufacturing  rms across 176 three-digit industries spanning between 2000 and 2007. Because the annual

    survey data covers only the  rms with sales of 5 million RMB or above, observing agglomeration effects solely based on those

    data ignores the clustering of smaller   rms which are found to play an important role in upgrading of Chinese industries.

    Therefore, we complement our base analysis by incorporating the agglomeration of smaller  rms using the 2004 Census data

    that covers all the manufacturing  rms.

    Our analysis also comprises three notable features. First, we use the number and the average output size of   rms in a spatial

    unit to capture the size and the   quality   of an agglomeration. The number of   rms is highly relevant to many aspects of Marshallian externality. For instance, knowledge spillover within an agglomeration is proportional to the number of  rms,

    when each  rm engages in some types of knowledge creation and the nearby  rms all benet from its outcome (Henderson,

    2003). Also, larger number of   rms in a region increases the scale and depth of inputs demand, allowing a more ef cient

    input sharing. Finally, it enhances the ef ciency of matching between   rms and workers via deeper labor pooling. On the

    other hand, the average size of  rms in an agglomeration not only captures the size of input demand and labor pooling for a

    given number of  rms, but also the co-location of productive enterprises. Firm size has often been seen as a proxy of productivity

    and several studies have indeed reported signicant externalities arising from the co-location of large   rms (Greenstone,

    Hornbeck & Moretti,2010; Li, Lu, & Wu, 2012).

    Second, we incorporate explicitly the agglomeration of upstream industries. The presence of rich supporting industries that pro-

    vide high quality intermediate goods is essential for a country's industrial competitiveness. The agglomeration of upstream industries

    increases the varieties of inputs supplied, and by allowing  rms to specialize in their core activities while outsourcing some of in-

    house production (Broda & Weinstein, 2006; Holmes, 1999). Given that sharing of specialized inputs has always been considered

    as an essential element of Marshallian externality, it is somewhat surprising that previous studies did not take into account the

    agglomeration of upstream industries.

    Finally, we explore how a rm's ownership structure shapes its ability to benet from agglomeration effects as well as its ability to

    act as the source of agglomeration effects. The difference in corporate behavior and culture can dene the extent to which a rm ben-

    ets from an agglomeration (Saxenian, 1994). Chineserms differ substantially in their performances and corporate behaviors across

    their ownership ( Jefferson, Rawski, Wang, & Zheng, 2000). State-owned enterprises (henceforth, SOEs) enjoy privileges in adminis-

    trative treatment, good access to  nance and sometimes monopolistic power, but can be driven by policy rather than prot. Private

    enterprises (henceforth, PEs) are managed by vibrant entrepreneurs and are highly prot-oriented, but often face barriers to market

    entry and dif culty in accessing credit. Foreign-invested enterprises (henceforth, FIEs) possess advanced technologies but often en-

    gage in low value-added activities such as the assembly and re-export of imported intermediate inputs, referred as the processing

    trade. Such stark differences across ownerships may not only shape the ability of Chinese rms to absorb knowledge spillovers but

    also shape their ability to act as the source of spillovers.

    We  nd positive and sizable contributions by the agglomeration of upstream industries to the total factor productivity

    (TFP) of Chinese   rms. Our estimates imply that doubling the number of   rms in upstream industries raises the TFP by

    3.2%. On the other hand, agglomeration effects within the same industry are more complex: increase in number of  rms

    within the same county is seen to suppress TFP. This somewhat surprising  nding suggests that severe congestion and in-

    tense competition associated with agglomeration are offsetting the Marshallian externality. The increase in number of 

    rms in a moderate distance is associated with smaller negative impact or even positive impact to productivity. We also

    nd that increase in average  rm size contributes importantly to higher TFP. This suggests that the qualitative aspect of 

    an agglomeration, namely the co-location of large  rms possessing rich knowledge stock denes the benets of agglomer-

    ation. When looking across different ownerships, PEs seem to be the main recipient of both positive and negative agglom-

    eration effects but this becomes less clear when the agglomeration of smaller  rms is incorporated. However, we  nd that

    the agglomeration of PEs impacts the productivity of Chinese   rms more than that of SOEs or FIEs. Especially, PEs are the

    main sources of the sizable agglomeration effects from upstream industries. The agglomeration effects from PEs are self-

    reinforcing in a sense that the productivity of a PE is most enhanced by the agglomeration of PEs. Based on our estimated

    results, we reckon that industrial agglomeration contributed up to 14% of TFP growth in China's industrial sector during

    2000–2007.

    The next section provides a non-exhaustive review of works on industrial agglomeration, especially in context of China. Section 3

    describes the dataset used in our analysis, the method to construct indicators of agglomeration and to estimate TFP and our empirical

    model. Section 4 lays out our estimation results and discusses the prominent feature of agglomeration effects in China. Based on the

    results from Section 4, Section 5 computes the role of agglomeration in the productivity growth of China's industrial sector during thesample period. Section 6 concludes with policy implications.

    51C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    3/17

    2. Agglomeration and productivity —a brief review 

     2.1. An overview of studies on agglomeration

    Economic activity is geographically concentrated. Previous studies on why an agglomeration occurs and how it benets economic

    agents have been mainly based on the two perspectives (Rosenthal & Strange, 2004): the localization economies, which are the

    Marshallian externalities arising from a concentration of  rms in a same industry; and the urbanization economies, which arise

    from an increase in city size that enables cross-fertilization of idea among diverse economic activities ( Jacobs, 1969). While both in-

    dustry concentration and city size were found to be associated with higher productivity, when assessed together, empirical evidences

    tended to support the localization economies more than the urbanization economies (for example, Henderson, 2003; Li et al., 2012).

    While this paper is primarily oriented toward the localization perspective, it goes beyond the concept by incorporating explicitly the

    agglomeration of upstream industries.

    There are several channels through which localization economies manifest: rst, an agglomeration enables the sharing of knowl-

    edge and skills through formal and informal interactions among  rms or individuals (Marshall, 1890; Saxenian, 1994). Knowledge

    spillovers occur mostly within proximity. For instance, Jaffe, Trajtenberg, and Henderson (1993) found that patent citations are

    more likely to come from the same state or metropolitan area as the original patent. The shared knowledge may not be conned to

    advanced technologies but can also be management skills and business knowledge. For example,  Greenaway and Kneller (2008)

    reported that a larger number of exporting  rms in proximity promotes other  rms' export entry, suggesting that agglomeration of 

    experienced exporters can reduce the entry costs of exports through spillovers of knowledge on foreign markets. Second, a concen-

    tration of an industry enables the production of specialized intermediate inputs to attain the level that is suf cient to exploit scale

    economies (Marshall, 1890). This, in turn, allows  rms to outsource higher share of their intermediate inputs and specialize on the

    most protable activities (Holmes, 1999). Third, geographical concentration of  rms and workers reduces the search frictions

    between jobs and workers, allowing more ef cient matching. A deeper labor pooling reduces the risks that a worker remains

    unemployed and a rm cannotll its vacancies (Krugman & Venables, 1995). Finally, there is a self-reinforcing “home-market effect”

    that induces rms to locate near a large market, especially when an increasing return to scale and transportation costs is important

    (Davis & Weinstein, 1999).

    On the other hand, agglomeration can be associated with several diseconomies. A dense  rm location may generate congestion

    which increases business costs. Congestion may be especially severe if infrastructure is a bottleneck to economic activities. For in-

    stance, Lall, Shalizi, and Deichmann (2004) found that geographic concentration of the same industry lowers the productivity of 

    Indian rms, whereas access to markets through improvement in inter-regional infrastructure has favorable effect. Furthermore,

    the ercer competition in product and factor markets can suppress a  rm's mark-up and productivity through lower product prices

    and higher inputs costs. Especially, severe competition for workers with specialized skills can result in a disproportional increase in

    their wage compared to their productivity, at least in the short run. Therefore, whether the benet of agglomeration to productivity

    exceeds the diseconomies is an empirical issue.

    Various approaches to capture agglomerations exist, ranging from the Gini coef cients of geographical distribution of industrial

    output or employment to the EG index proposed by  Ellison and Glaeser (1997) which simultaneously controls for the share of an

    industry's employment within a region, the share of aggregate manufacturing employment within a region, and the market concen-

    tration of an industry. While as not as popular as the EG index, the number of  rms within a region has been employed by previous

    studies for its tractability and straight-forward implication. For example, Henderson (2003) assessed the localization economies in

    high-tech industries in the United States by observing the number of plants in the same industry. He claimedthat knowledge spillover

    within an agglomeration is proportional to the number of rms, when each rm engages in some types of knowledge creationand the

    nearby rms all benet from itsoutcome. The tractability of thenumber of rmsas a measure of agglomeration is especially benecial

    when assessing agglomeration effects across numerous industries or when incorporating some aspects of distance. For instance, the

    EG index does not consider the geographical proximity and distance of industrial agglomeration.

    Although the sharing of specialized input has been long considered as one important aspect of localization economies, previous

    studies seldom explored how an agglomeration of upstream industries affects the productivity of downstream rm. The agglomera-

    tionof upstream industries increases the varieties of inputs supplied, thereby contributing to the productivity of sourcingrms(Broda

    & Weinstein, 2006). It also allows rms to specialize in their core activities while outsourcing some of in-house production (Cainelli &

    Iacobucci, 2012; Holmes, 1999). Furthermore, stronger competition in upstream industries due to agglomeration can be associated

    with the supply of cheaper and better quality intermediate inputs. Consistent to such view, the FDI penetration in upstream industries

    is often found to improve the productivity of domestic rms (for example, Ito, Yashiro, Xu, Chen, & Wakasugi, 2012; Javorcik, 2004).

    We therefore extend our analytical framework to incorporate explicitly the agglomeration of upstream industries.

    A strand of research explored the effect of a co-location by largerms on productivity growth. The size of a rm, measured in out-

    put or employment, is often closely related to its productivity as well as to its intangible competitiveness such as good management

    practices (Bloom, Lemos, Sadun, Scur, & Van Reenen, 2014). Locating near largerms may thus provide opportunities to absorb spill-

    overs of advanced knowledge. For instance, Greenstone, Hornbeck, and Moretti (2010) found that the United Statesrms in counties

    which saw new openings of  “Million Dollar plants” realized 12% higher productivity growth after 5 years vis-à-vis the rms in other

    counties. More recently, Li et al. (2012) observed that Chinese  rms are more likely to become larger by locating with a number of 

    large  rms than with larger number of  rms. Therefore, agglomerations that include many large  rms may reward larger favorable

    effects on a 

    rm's productivity. On the other hand, considering that co-location with productive 

    rms is likely to be associated withstronger competition, the benets of knowledge spilloverscould be at least partially offset, resultingin a net impact that is not a priori

    52   C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    4/17

    clear-cut. For instance, Henderson (2003) reported that the average size of plants in the same industry and the same county does not

    have any signicant impact on theproductivityof US plants in high-techindustries. This paper exploreswhether the presence of large

    rms in an agglomeration has an impact of its own besides the size of agglomeration captured by the number of  rms.

     2.2. Studies on agglomerations in China

    The industrial agglomeration in China has increased consistently since the mid-1990s, driven by the globalization of the Chinese

    economy (Lu & Tao, 2006). Major exporting industries (such as textiles and electronics) as well as industries depending heavily onimported intermediate goods (such as machinery) concentrated in coastal areas that have better access to trade routes and advanced

    technology (Fujita & Hu, 2001). Long and Zhang (2012) observed that industrial concentration has been driven by the dramatic

    increase in number of rms in the regions rather than the increase in size of output perrm, namely the clustering of highly intercon-

    nected small  rms. Case studies reported the role of such clustering in reducing technological barriers and promoting quality

    upgrading. For example, Huang et al. (2008) studied the footwear industry in Wenzhou and claimed that clustering lowered the

    technical and capital barriers to entry by deeper division of complex production process, and brought a wide range of entrepreneurs

    in rural areas into the industry. Sonobe, Hu, and Otsuka (2004) observed that clustering of  rms producing light-volt electronic

    appliances in Wenzhou promoted the fast quality upgrading and introduction of new marking strategy. Fleisher et al. (2010) reported

    that rms in the Zhili Township children's garment cluster in Zhejian province have responded to the rigorous competition pressure

    by investing in design, branding and quality upgrading.

    More general empirical evidences on agglomeration effects in China are still limited and somewhat mixed. Early studies relied on

    aggregated data. For instance, Batisse (2002) found negative relationship between industrial specialization and province-level value-

    added growth, whereas Fan and Scott (2003) found strong positive relation between industrial concentration and province-levelproductivity. Ke (2010) reported a positive and causal relationship from an industrial agglomeration to city-level labor productivity.

    Evidences based on rm-level data are more recent and are even more limited. For example, Lin et al. (2011) explored the agglomer-

    ation effects on productivity of rms in textile industry and found an inverted-U type effect, which implies that net positive impact is

    more than offset by diseconomies after certain point. Yang, Lin, and Li (2013) reported that the agglomeration of production contrib-

    utes to productivity of electronicsrms while the agglomeration of R&D activitieshas negative impact. However, not only thescope of 

    those studies is conned to specic industries, they are based on the above-scale surveydata which does not capture the clustering of 

    smaller rms. Furthermore, to our knowledge, there are not yet studies that captured how the rapidly growing non-state sector is

    shaping the agglomeration effects in China. Overall, there is considerable room to be lled in order to understand the role of agglom-

    eration in China's industrial competitiveness.

    3. Empirical strategy 

     3.1. Data

     3.1.1. The Survey on above-scale manufacturing enterprisesThe dataset we use for computing the productivity of Chinese enterprises is the annual survey collected by China's National

    Bureau of Statistics, which covers all the state-owned enterprises (SOEs) and the non-SOEs with annual sales exceeding RMB

    5 million (the Survey of above-scale industrial enterprises, henceforth the Survey). We use this above-size  rm data for its

    availability at annual basis and its large coverage of China's industrial output. The data have been employed by numerous

    works to study the productivity growth of Chinese industries (such as  Brandt, Biesebroeck, & Zhang, 2012) or factors behind

    the productivity growth (such as  Yu, 2014). We use the sample spanning from 2000 to 2007 to compute the productivity of 

    Chinese  rms and the indicators of agglomeration. The number of observations for each year ranges from 162,885 (in 2000) to

    336,768 (in 2007).

    Quite a few observations in the Survey sample are plagued by missing information and misreporting. We therefore conduct a

    data-cleaning procedure following the recent studies such as Cai and Liu (2009) and Feenstra et al. (2014). This involves deleting

    the observations with the following characteristics: (1) key variables such as total industrial output, industrial added value, xed as-

    sets and employees have negative values; (2) the value of liquid assets or total xed assets are larger than that of total assets; (3) the

    number of employees are less than 10; and (4) rm age is negative. This process reduces our observations to 160,868 in 2000 and to

    335,616 in 2007, which stilladd up to more than 98% of original sample. It is shown in Table 1 that the share of retained observations is

    maintained throughout our sample period.

     Table 1

    Description of the Survey data set.

    Source: Authors' calculation.

    Year 2000 2001 2002 2003 2004 2005 2006 2007

    Total observations 162,885 171,240 181,557 196,222 276,474 271,835 301,961 336,768

    Observation retained 160,868 169,279 179,706 195,659 275,289 270,945 300,908 335,616

    Share of retained observations 98.8% 98.9% 99.0% 99.7% 99.6% 99.7% 99.7% 99.7%

    53C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    5/17

    Furthermore, the original identication codes in the Survey are sometime duplicated amongseveral rms or are missing for some

    rms, preventing us to uniquely identify each observation. We therefore assign each enterprise a new identication code based on its

    name in Chinese, its original identication number and the name of its legal representative. Also, China has modied its industry

    coding system in 2002, switching from GB1994 to GB2002. Because our exercise requires consistent information on industry

    classication, we employ the method by Brandt et al. (2012) to prevent our observations from switching their industries due to

    this administrative issue.

     3.1.2. The 2004 CensusOne issue about using the Survey data to assess industrial agglomeration is that it fails to capture the small non-SOE  rms with

    sales under the threshold. In order to take into account the clustering of SMEs highlighted by many case studies, we exploit the

    2004 Census (henceforth, the Census) to calculate alternative set of indicators of agglomerations. To demonstrate the difference in

    coverage between the Census and Survey, Table 2 compares the industrial output andrm number in 2004 by province. The number

    of rms covered by the Census is about 5 times larger than that by the Survey, while the total industrial output covered by the Census

    is only 10% larger than that by the Survey. These differences indicate the existence of a large mass of small rms. Indeed, the average

    rm size in the Census is overall about one-fth of thatin the Survey. The number of small rms covered by the Census but not by the

    Survey is especially large in Zhejiang, Jiangsu and Guangdong where case studies and media have documented vibrant industrial

    clusters. In fact, 40% of such  rms are concentrated in the three provinces. Relying solely on the Survey ignores such mass of small

    rms, and can result in inaccurate measurement of agglomerations in China. We therefore construct an alternative set of indicators

    of agglomeration using the Census data.

    The Census and the Survey share some common variables such as a rm's name, identication number, address code and industry

    code. Since we construct agglomeration indicators from the Census data by a spatial unit and industry (as described later in this

    section), we rely on address code and industry code to match the two datasets. However, the largest drawback of the Census data

    is its limited availability: within in our sample period, it is only available in the year 2004. This constrains our empirical analysis

    using the Census data to a cross-sectional exercise in one specic point of time. Therefore, we complement the base analysis using

    the panel of the Survey data with an additional analysis based on the cross-section of the Census data.

     Table 2

    Comparison between the Census and the Survey data (year 2004).

    Source: Authors' computation based on 2004 Census and 2004 Survey of Above-scale Industrial Enterprises.

    Number of  rms Industrial output (1000 RMB) Average rm size (output,

    1000 RMB)

    Census Survey Census Survey Census Survey

    Beijing 31,669 6871 597,470,208 573,325,440 18,866 83,441

    Tianjin 25,654 6462 611,914,432 585,449,536 23,853 90,599

    Hebei 64,629 9284 1,018,096,896 868,167,360 15,753 93,512

    Shanxi 28,884 5013 417,396,224 377,103,936 14,451 75,225

    Neimeng 11,849 2275 232,745,824 209,593,584 19,643 92,129

    Liaoning 54,610 10,635 914,065,792 860,390,080 16,738 80,901

     Jilin 16,363 3280 355,172,064 334,378,048 21,706 101,945

    Heilongjiang 20,303 3297 395,569,632 371,877,344 19,483 112,793

    Shanghai 55,806 15,766 1,459,413,760 1,396,810,752 26,152 88,596

     Jiangsu 188,844 40,848 2,947,678,976 2,668,238,848 15,609 65,321

    Zhejiang 188,917 41,358 2,122,122,624 1,872,797,952 11,233 45,283

    Anhui 39,263 4778 423,602,208 366,015,040 10,789 76,604

    Fujian 49,824 11,918 751,248,256 678,341,504 15,078 56,917

     Jiangxi 29,468 4019 273,658,912 221,197,920 9287 55,038

    Shandong 120,673 23,915 2,467,854,336 2,252,100,864 20,451 94,171

    Henan 76,896 11,474 923,664,320 757,558,784 12,012 419,733Hubei 29,262 6232 532,919,104 496,024,640 18,212 79,593

    Hunan 43,925 7523 434,186,112 365,406,624 9885 48,572

    Guangdong 137,652 34,584 3,151,649,536 2,955,180,800 22,896 85,449

    Guangxi 19,080 3749 224,198,624 202,548,480 11,750 54,027

    Hainan 2066 588 42,941,776 40,723,348 20,785 69,257

    Chongqing 20,509 2634 259,884,384 214,272,608 12,672 81,349

    Sichuan 43,759 7413 530,365,728 471,684,928 12,120 63,629

    Guizhou 11,121 2545 154,630,880 139,491,408 13,904 54,810

    Yunnan 14,402 2332 234,406,848 209,204,800 16,276 89,710

    Tibet 356 187 2,484,754 2,269,713 6980 12,138

    Shanxi 25,785 3012 315,087,840 273,521,600 12,220 90,810

    Gansu 11,664 1927 169,580,784 158,260,272 14,539 82,128

    Qinghai 2198 463 38,809,336 37,421,872 17,657 80,825

    Ningxia 4019 662 60,518,648 55,365,568 15,058 83,634

    Xinjiang 5807 1430 165,602,080 157,184,448 28,518 109,919

    Total 1,375,257 276,474 22,228,940,898 20,171,908,101 16,163 72,961

    54   C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    6/17

     3.2. The indicators of agglomeration

    Mainland China has six layers of administrations, starting from a province  ① and descending to a prefecture, a county, a

    township and to a village. The spatial unit we use for our base analysis is a county. As of 2013, mainland China comprises

    2856 counties. We observe the agglomerations of 131 3-digit China Industry Code (CIC) industries within a county.② Later in

    the paper, we conduct a robustness tests to show that our main  nding is not sensitive to our choice of spatial unit and industry

    classication.

    The benets of agglomeration are likely decrease with the distance. However, we cannot capture the relation between agglomer-

    ation effects and the distance precisely because neither the Census nor the Survey provides exact geographical location of rms (such

    as their longitude and latitude). As a second best approach, we compare the impact of an agglomeration within the same county the

    rm is located and that of an agglomeration in other countieswithin the same province. We do not consider the agglomeration effects

    across provinces given that the protectionism by provincial government in product and factor market is likely to limit them(Bai, Duan,

    Tao, & Tong, 2004).

     3.2.1. Agglomeration of the same industryWe construct four measures of agglomeration that correspond to a conventional concept of localization economies that is the

    agglomerations of  rms within the same industry. For a  rm i operating in the industry j in the county k, such indicators are:

    1. The number of  rms in the same industry and the same county: agg _NC  jkThis corresponds to the number of  rms in the industry j and in the county k. We include the rm itself when calculating thisindicator so that it is feasible to take the logarithm of index even when there is one  rm operating in a given industry and a

    given county.2. The average size of  rms in the same industry and the same county: agg _YC  jk

    This measure captures the co-location of large rms belonging to the same industry within proximity of a given rm. It is obtained

    by the following formula:

    agg YC  jk  ¼X

    m∈ j;k

    Y m

    0@

    1A−Y i

    0@

    1A.   agg NC  jk−1

      ð1Þ

    where Y i is the output of therm i. When there is only onerm in the industry j in the county k, we replace the index by 0. Althoughthis measure is specic to each rm, we drop the subscript i for the sake of brevity.We construct two more indicators of agglomeration that concern the number or the average size of the rms belonging to the same

    industry but that are located outside the county the

    rm i operates, while still within the same province.3. The number of  rms in the same industry but outside the county: agg _NP  jkThis corresponds to the number of  rms in the industry j but outside the county k.

    4. The average size of  rms in the same industry but outside the county: agg _YP  jkUnlike the second indicator, this indicator is a simple ratio between the aggregate output and number of therms in the industry jbut located outside the county k.

     3.2.2. Agglomeration of the upstream industryIn order to avoid the multicollinearity from having too many variables in our estimation, we focus on agglomerations of upstream

    industries located within the same county. A rm's input coef cients from upstream industries should differ for each rm depending

    on its sourcing strategy. However, because the information on inputs at  rm-level is not available, we follow Javorcik (2004) and

    many other studies to use the industry-level input coef cients as the proxy of  rm-level input coef cients. The coef cients

    are taken from input–output (IO) tables of China.③ For a given  rm in industry j and in country k, the indicator of agglomeration of 

    upstream industries is constructed as the following:1. The number of  rms in the same county but upstream industries: Uagg _NC  jk

    The indicator aggregates the number of  rms in other industries as the industry  j  located in the country  k  using the inputcoef cients of the industry j as weights:

    Uagg NC  jk  ¼X

    m;m≠ j

    σ  jmN mk   ð2Þ

    ① Provinces also include municipality directly administered by central government such as Beijing, Tianjin, Shanghai and Chongqing, as well as autonomous regions

    like Guangxi, Ningxia, Neimeng, Xingjiang and Xizang.② The CIC comprises 470 four-digit, 181 three-digit, and 38 two-digit industries, but we covered only 131 three-digit industries due to there are not suf cient obser-

    vations in some industries.③ Because IO table withinour sample periodis only availableon 2002 and2007, weobtainthe input coef cientsin theotheryearsfrom a linear interpolation between

    the 2002 and the 2007 input coef cients. We also assume that the input coef cients in 2001 are identical to those in 2002. Furthermore, input coef cients in the IOtables are at two-digit level, implying that three-digit industries within the same two-digit industry are all assigned the same coef cients.

    55C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    7/17

    the subscript m represent industry other than j. σ  jm is theinputcoef cient of industry j from industry m and N mk is the total number of rms in industry m in the county k.2. The average size of  rms in the same county but upstream industries: Uagg  _ YC  jk

    The indicator is calculated as the following:

    Uagg YC  jk  ¼X

    m;m≠ j

    σ  jmY mk

    =Uagg NC  jk   ð3Þ

    where Y mk is the sum of output of the rms belonging to industry m in the same county k as rm i.

    All the indicators are transformed into natural logarithm. Table 3 provides summary statistics of these agglomeration indicators

    both when using the Survey data and the Census data.

     3.3. Estimation of TFP 

    We observe the total factor productivity (TFP) as the principle measure of productivity. TFP is obtained as the   “residual” after

    deducting the contributions by labor and capital inputs to its output. It is thus obtained from an estimation of production function.

    We apply the method proposed by Levinsohn and Petrin (2003) which controls for the potential endogeneity in estimate of inputs

    coef cients. Consider a  rm's production function such as below:

    Y t  ¼ f I t ;ς t ;β ð Þ ð4Þ

    where, I t  are inputs used in production; β  are parameters; and  ς t  is error term vector, which can be interpreted as Hicks-neutralproductivity.

    Becauserms with higher productivity are expected to usemore inputs, the input vector I t may be correlated with the unobservableproductivityς t . In such case, the estimates obtained from OLS are biased. Levinsohn and Petrin (2003) consider the following log-linearproduction function:

     yt  ¼  β 0 þ β llt  þ β kkt  þ β mmt  þ υt  þ ζ t    ð5Þ

    where, yt , lt , kt and mt are log of output, labor, capital and intermediate inputs, respectively; lt is deemed as free variable, kt and mt arestate variables. The error term ς t  in the Eq. (4) is divided into two parts, namely υt  and ζ t . υt  is the unobservable state variable which

    impacts the rm's decision on inputs, whereas ζ t  is an independently and identically distributed disturbances that has no impact onthe rm's decision. Assume intermediate input's demand function is given as,

    mt  ¼  mt   υt ; kt ð Þ:   ð6Þ

     Table 3

    Summary statistics of the indicators of agglomeration.

    Source: Authors' calculation.

    Obs Mean Std. Dev. Min Max

    Survey data sampleagg_NC 609,331 0.56 0.81 0.00 6.29

    agg_YC 609,331 4.40 5.11 0.00 17.69

    agg_NP 609,331 4.12 1.40 0.00 8.04

    agg_YP 609,331 10.69 1.29 0.00 18.24

    Uagg_NC 609,217 0.60 1.59 0.00 6.01

    Uagg_YC 609,058 10.69 0.98 0.08 18.86

    Census data sampleagg_NC 67,013 1.70 1.28 0.00 7.65

    agg_YC 67,013 7.26 3.74 0.00 17.44

    agg_NP 67,013 5.86 1.44 0.00 9.20

    agg_YP 67,013 9.43 1.13 0.00 16.79

    Uagg_NC 67,013 2.42 1.53 0.00 7.04

    Uagg_YC 67,013 9.25 0.90 3.27 14.01

    Note:

    1. The agglomeration indicators are all in natural logarithm.

    2. agg_NC is thenumber of rmsin thesame industry andsame county;agg_YC is theaverage size of rmsin thesame industry andsame county; agg_NPis thenumber

    of  rms in the same industry same province but outside the county; agg_YP is the average size of  rms in the same industry same province but outside the county;Uagg_NC is the number of  rms in the same county but upstream industries; and Uagg_YC is the average size of  rms in the same county but upstream industries.

    56   C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    8/17

    If such function is monotonic inυt forall kt , the unobservable υt can be captured by theinverse function of intermediate input to getυt  = υt (mt , kt ). This enables the Eq. (5) to be rewritten as,

     yt  ¼  β llt  þ ϕt   mt ; kt ð Þ þ ζ t    ð7Þ

    where, ϕt (mt , kt ) = β 0 + β kkt  + β mmt  + υt (mt , kt ).Following idea of  Robinson (1988), subtracting expectation of  yt  conditional on mt  and  kt , the Eq. (7) can be transformed as

    below:④

     yt −E yt jmt ; kt ½ ¼  β l   lt −E lt jmt ; kt ½ ð Þ þ ζ t :   ð8Þ

    Since ζ t  is independently and identically distributed errors, OLS regression of Eq. (8) yields the consistent estimate of  β l.Coef cients on capital input β k and on intermediate inputs β m are estimated as the following:  rst, it is assumed that υt  follows

    a  rst order Markov process and that capital does not immediately respond to  ξt ,the innovations in productivity over last period'sexpectation, then,

    ξt  ¼ υt −E  υt jυt −1½ :   ð9Þ

    According to the relationship of Eq. (5), we can get:

    ξt  þ ζ t  ¼ yt −β̂ llt −β kkt  þ β 

    mmt  þ  Ê  υt jυt −1½ ð10Þ

    where  β̂ l  is the estimated coef cient of labor input obtained previously.β k⁎ andβ m⁎ are thecandidate values forβ k andβ m, respectively. Ê 

    [υt |υt  −   1] is the residual obtained from the regression of    yt −β̂ llt −β kkt −β 

    mmt 

      on   ϕ̂t −1−β 

    kkt −1−β 

    mmt −1

     using weighted

    quadratic least squares. The latter is therefore a function of  β k⁎ and β m⁎.The following two moment conditions are required to obtain the consistent estimates:

    E   ξt  þ ζ t ð Þkt ½ ¼  E  ξt kt ½ ¼  0   ð11Þ

    E   ξt  þ ζ t ð Þmt −1½ ¼  E  ξt mt −1½ ¼  0   ð12Þ

    The rst moment condition requires that capital input and productivity are not correlated, whereas the second requires that the

    past intermediate inputs are not correlated with productivity. The estimates  β̂ k  and  β̂ m are obtained by minimizing the followingGMM standard equation:

    Q   β ð Þ ¼   minβ X

    4

    h¼1

    Xi

    XT i1t ¼T i0

    ξit  þ̂ζ it 

     Z i;ht Þ

    2ð13Þ

    where Z t  = {lt , kt , mt − 1, mt − 2}. In the equation above, i represent enterprises, h the number of  Z ; and T i0 and T i1 are the second yearand the last year when enterprise i is observed in the sample. Since panel data are required for the estimation of production function,we rely on theSurvey data to estimate TFP. This implies that the recipients of agglomeration effects in this paper are SOEs and relative

    large non-SOEs.

    Given that  rm-specic prices are not available, we follow Brandt et al. (2012) to use the 2-digit industry-level price index to

    deate a  rm's output and intermediate input, and the  xed asset investment price to deate its capital. Furthermore, we estimate

    product function separately for each 2-digit industry considering the fact that input coef cients differ substantially across industries.

    The average TFP (in logarithm) for each year is shown in  Table 4, indicating a clear uptrend that is in line with previous studies. It

    can also be seen that the average TFP of private enterprises is generally higher than those of SOEs and FIEs although there has

    been an apparent surge in the TFP of SOEs at the latter years. This is in line with the well-documented growth in protability of 

    SOEs as they retreat from the sectors open to competition and FDI and concentrate on highly regulated upstream industries (Du &

    Wang, 2013).

     3.4. Empirical model of agglomeration effects

    We estimate the following model that relates the TFP of a rm i in industry j located in county k to the agglomeration of the sameindustry and the upstream industries:

    lnT FP ijkt  ¼  α  þ β 1Horizontal jkt −1 þ β 2Horizontal jmt −1 þ β 3Upstream jkt −1 þ γ  X ijkt −1 þ η i þ η t  þ η  j þ η k þ ε ijkt :   ð14Þ

    ④ Take expectation for Eq. (4) conditional on mi and ki, we can get E [ yt |mt , kt ] = β lE [lt |mt , kt ] + ϕt (mt , kt ), so Eq. (5) is gotten.

    57C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    9/17

    The rst term on the left hand side is a constant, whereas the second to the fourth terms are the indicators of industrial agglom-

    eration. The term Horizontal jkt is a vector of theindicators of agglomerationin thesame industry and in the same county, agg _ NC , andagg _ YC . The term Horizontal jmt is a vector of theindicators of agglomerationin the same industry but outside thecounty, agg _ NP andagg _ YP . The term Upstream jkt  is a vector of the two indicators of agglomeration of the upstream industries, Uagg  _ NC and Uagg _ YC .Since all agglomeration indicators are in logarithm, the estimated coef cients can be interpreted as elasticity of productivity to

    number of  rms or average  rm size in the same industry or in upstream industries.

    The signs of estimated coef cients on the indicator of agglomeration Horizontal jkt are not clear a priori given that positive impactsof Marshallian externalities are at least partly offset by congestion effects and intensied competition in product and factor markets

    that suppresses product prices while driving up inputscosts. However, such negative effects to a rm's productivity can be less severe

    when the agglomeration is located farther from the  rm (i.e., in other counties) or occurring in upstream industries. On the other

    hand, knowledge spillovers are expected to be smaller when an agglomeration is not located within the proximity or it concerns

    industries using different technologies. Therefore, the signs and relative sizes of coef cients on Horizontal jmt  and Upstream jkt  arealso not clear.

    Thefth term X ijkt is thevector of control variables. It includes the followingrm characteristics that can inuence a rm's produc-tivity: (1) the logarithm of R&D Investment; (2) the market share, which is the rm's output share in the same industry. Firms with

    larger market share often enjoy stronger monopolistic power and higher mark-up; (3) debt ratio, measured as the ratio of the rm's

    total liability to total assets, which we interpret as the rm's borrowing capability. Enterprises with strong borrowing capability may

    realize higher productivity because their investments are less subjected to credit constraints; (4) the export intensity, measuredas the

    ratio of the rm's export to its total value-added. Firms that export enjoy scale economy by supplying the world's market and can also

    tap into advanced technology through transaction with foreign partners (De Loecker, 2007); and (5) rm age, measured as the differ-

    ence between the year in sample and the year of establishment. Longer survival reects the rm's intangible competitiveness such as

    superior managerial capability. It also accounts for productivity advantage over competitors that is founded on  “learning by doing”

    effect. Furthermore, a county-level average wage is included in order to control for thequalityof labor in the county therm is located.

    Firms often agglomerate in areas with abundant supply of skilled labor, the use of which is an important determinant of a rm's pro-

    ductivity. Controlling for the local labor quality allows us to capture the pure effects of agglomeration. Table 5 provides summary sta-

    tistics of those control variables.

    Because of more intense competitions in product and factor markets within agglomerated regions,  rms with originally high

    productivity are more likely to locate in such area ( Baldwin & Okubo, 2006). We address the possibility of such reverse causality

    from productivity to agglomeration in two ways: rst, we lag all indicators of agglomeration one period along with the control

    variables. Thus, when using the indicators constructed from the Census data, Eq. (14) is estimated with a cross-sectional OLS which

    regresses  rm's TFP of 2005 on the indicators that capture agglomeration in the year 2004. Second, when using the indicators com-

    puted from the Survey data, we include a  rm-level xed effect η i

     that captures time-invariant unobservable  rm characteristics

    (such as superior managerial quality). Because some rms switch industries and areas over the sample period, three digits industry

    xed effects and province  xed effects are included in estimation. Furthermore, we include time  xed effect to control for trend

    growth in productivity when using panel data. Lastly, ε ijkt  is an i.i.d error which follows N (0, σ 2).

     Table 4

    Summary of the estimated TFP by year.

    Source: Authors' calculation based on survey data of above-scale industrial enterprises.

    Sample Statistics 2001 2002 2003 2004 2005 2006 2007

    All rms Mean 2.95 3.08 3.23 3.31 3.47 3.60 3.75

    Std. Dev 1.56 1.54 1.49 1.42 1.42 1.41 1.40

    SOEs Mean 2.46 2.60 2.85 3.02 3.26 3.51 3.89

    Std. Dev 1.82 1.83 1.80 1.79 1.81 1.77 1.75

    PEs Mean 3.12 3.21 3.32 3.37 3.51 3.63 3.78Std. Dev 1.37 1.36 1.35 1.29 1.32 1.33 1.35

    FIEs Mean 3.10 3.18 3.25 3.23 3.38 3.51 3.60

    Std. Dev 1.55 1.55 1.53 1.51 1.48 1.47 1.47

    Notes:

    1. TFP are in natural logarithm.

    2. SOEs refer to state-owned enterprises; PEs refer to private enterprises; FIE S refer to foreign invested enterprises.

     Table 5

    The denition and summary statistics of control variables.

    Source: Authors' calculation based on survey data of above-scale industrial enterprises.

    Variables Denition Obs Mean S.D. Min Max

    R&D investment Natural log of R&D investment 1,303,196 0.61 1.84 0.00 16.14

    Market share The proportion of  rm' output to total output of industry 1,830,664 0.00 0.00 0.00 0.80

    Debt ratio The ratio of total assets to total liability 1,830,664 0.59 0.35 0.00 48.50

    Export ratio The ratio of export to the added value 1,830,664 0.16 0.33 0.00 2.00

    Age (year) Firm age 1,830,664 10.00 11.40 1.00 150.00

    Average wage Natural log of average wage of the county 1,830,664 2.70 0.49   −2.36 5.74

    58   C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    10/17

    The model is estimated for the whole sample as well as for the sub-samples of rms with different ownerships, namely the SOEs,

    private enterprises (PEs) and FIEs. The latter specication reveals how the difference in business conditions, corporate culture and

    technological endowment among the three types of  rms shapes their ability to benet from agglomeration. Furthermore, we also

    assess how such differences shape a  rm's ability to act as the source of spillovers by estimating the Eq. (14) with the indicators

    that capture an agglomeration of the rms of specic ownership.

    4. Estimation results

    4.1. Agglomeration effects based on the Survey data

    Table 6 presents the results of base estimation and those over sub-samples of rms with different ownerships. As the most basic

    setting, column 1 shows the results where only the indicator of agglomeration of the same industry in the same county is included.

    The average rm size (agg_YC) has statistically signicant positive contribution to a  rm's productivity, while the impact of number

    of rms (agg_NC) is not signicant and negative. The insignicant coef cient on number of rms suggests that severe congestion ef-

    fect associated with agglomeration is offsetting the benets from Marshallian externality. The positive coef cient on average rm size

    however suggests that the co-location of large, productive  rms within proximity does contribute to productivity.

    As we add the agglomeration of the same industry outside the same county and that of upstream industries within the same

    county in column 2, the coef cient on agg_NC now indicates a statistically signicant negative effect on productivity. It implies that

    doubling the number of  rms in the same industries within the same county can reduce a  rm's productivity up to 1.2%. On the

    other hand, the coef cient on the number of  rms in the same industry outside the county (agg_NP) is signicant and positive,

    implying that doubling such number raises a  rm's productivity by 1.3%. This somewhat surprising  nding that agglomeration ismore benecial when it is in a moderate distance indicates a specic situation in China where an agglomeration of the same industry

     Table 6

    Estimates of agglomeration effects based on the Survey data.

    Sample All (1) All (2) SOEs (3) PEs (4) FIEs (5)

    Indicators of agglomeration(1) agg_YC 0.004*** 0.004*** 0.001 0.004*** 0.003**

    (8.97) (7.99) (0.52) (8.09) (2.92)

    (2) agg_NC   −0.002   −0.012*** 0.001   −0.016*** 0.001

    (−0.73) (−4.54) (0.06) (−5.37) (0.21)

    (3) agg_YP 0.010*** 0.003 0.014***   −0.001

    (5.30) (0.59) (5.79) (−0.13)

    (4) agg_NP 0.013*** 0.032*** 0.008** 0.009(5.12) (3.61) (2.72) (1.61)

    (5) Uagg_YC 0.052*** 0.028*** 0.060*** 0.035***

    (18.31) (3.11) (17.63) (5.16)

    (6) Uagg_NC 0.032*** 0.024** 0.037*** 0.027***

    (13.10) (2.93) (12.52) (4.60)

    RD investment 0.003*** 0.003*** 0.009*** 0.003*** 0.004**

    (5.11) (5.05) (4.20) (3.13) (2.57)

    Market share 7.065*** 7.275*** 3.378*** 9.452*** 6.881***

    (19.79) (20.13) (4.14) (15.82) (10.26)

    Debt ratio   −0.077***   −0.075***   −0.046***   −0.083***   −0.051***

    (−17.83) (−17.56) (−3.92) (−13.93) (−6.38)

    Export ratio   −0.084***   −0.084***   −0.052   −0.032***   −0.144***

    (−13.76) (−13.72) (−1.19) (−3.97) (−15.17)

    Age   −0.010***   −0.011***   −0.039*** 0.003 0.030***

    (−4.25) (−4.57) (−4.15) (1.22) (3.37)

    Average wage 0.156*** 0.149*** 0.087*** 0.142*** 0.212***(28.27) (26.96) (4.28) (21.96) (16.02)

    Constant 2.036*** 1.298*** 0.353 2.543***   −1.270

    (8.13) (5.13) (0.49) (3.01) (−1.24)

    Prob F 0.000 0.000 0.000 0.000 0.000

    R-square 0.484 0.480 0.137 0.371 0.289

    Observations 849,775 849,775 77,849 582,251 183,863

    Notes:

    1. All estimations include  rm, time, industry and province  xed effects.

    2. t-statistics are in the parentheses.

    3. ***, **each corresponds to signicance at 1% level and 5% level, respectively.

    4. The data used for estimation is from Annual Survey of Above-scale Industrial Enterprises.

    5. SOEs refer to state-owned enterprises; PEs refer to private enterprises; FIE S refer to foreign invested enterprises.

    6. agg_NC is thenumber of rmsin thesame industry andsame county;agg_YCis theaverage size of rmsin thesame industry andsame county; agg_NPis thenumber

    of  rms in the same industry same province but outside the county; agg_YP is the average size of  rms in the same industry same province but outside the county;

    Uagg_NC is the number of  rms in the same county but upstream industries; and Uagg_YC is the average size of  rms in the same county but upstream industries.All these variables are in natural log form. Denitions of other control variables please refer to  Table 5.

    59C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    11/17

    penalize rms in proximity with severe congestion and competition that overwhelm the localization economy. The negative effect of 

    stronger competition is also hinted by the positive coef cient on the average size of rms in the same industry but outside the same

    county (agg_YP), which is larger than that on the average rm size within the same county (agg_YC). This suggests that the benet

    from the colocation of large, competitiverms is partly offset when they are within proximity, because they can be formidable rivals

    in the local product and factor markets.

    Furthermore, the coef cients on the number of rms and the average rm size in upstream industries within the same county are

    both positive and signicant. Moreover, those coef cients are considerably larger than those corresponding to the agglomeration of 

    the same industry. For instance, they imply that that doubling the number of rms in upstream industries or their average size raises

    a rm's productivity by 3.2% and 5.2%, respectively. These results provide strong support to our view that agglomeration of upstream

    industries is the important source of competitiveness of Chineseindustry. Furthermore, the benet of agglomeration to productivity is

    less weakened by the negative impact via  ercer competition when agglomeration concerns non-competing industries. Finally, it is

    noteworthy that the F-statistics indicates the joint-signicance of all those additional indicators of agglomeration. This indicates

    that studies that failed to incorporate agglomeration of upstream industries were indeed prone to misspecication and may have

    not captured the true contribution of agglomeration to productivity.

    Columns 3 to 5 report the results for the sub-samples of rms with different ownerships, namely state-owned enterprises (SOEs),

    private enterprises (PEs) and foreign invested enterprises (FIEs). Interestingly, many of the observations above are driven by the

    sub-sample of PEs. For instance, the signicant and negative impact on productivity from agg_NC is only observed among the PEs.

    The signicant positive coef cients on both agg_YC and agg_YP are also observed only for PEs. Such results may be reecting the

    fact that PEs are most exposed to market competition and are therefore most subject to negative impacts of agglomeration. They

    are also most eager to absorb knowledge spillovers given their strong prot-orientation and a large room for catch-up in production

    and managerial technology (Zhang, Zeng, Mako, & Seward, 2009). Interestingly, the number of  rms outside the county (agg_NP)

    seems to benet SOEs more than PEs.

    As for the agglomeration of upstream industries, the number of rms (Uagg_NC) and average rm size (Uagg_YC) are both asso-

    ciated with positive and signicant coef cients for rms with all types of ownership. This is not surprising since the benet from the

    agglomeration of supporting industries such as supply of higher quality and cheaper intermediate goods should rewardrms regard-

    less of their corporate behavior. However, PEs are again enjoying the largest impact, possibly because they outsource more of their

    inputs. Overall, our observation reveals that PEs are the primary recipient of both positiveand negative agglomeration effects in China.

    Each column also displays the contribution of control variables that is more or less consistent across ownership. Most importantly,

    R&D contributes signicantly to productivity as reported by previous studies such as Wu (2006). The estimated coef cients suggest

    that doubling R&D expenditure increases the rm's productivity from 0.3% to 0.9%, depending on the rm's ownership. The elasticity

    is highest among the SOEs, which are observed to occupy a large share of R&D investment in China (Pilat, Yamano, & Yashiro, 2012).

    This is roughly in line with the view that the return to R&D investment is shaped by a  rm's capability to absorb new technology,

    which is formed by previous R&D activities (Cohen & Levinthal, 1989). Contrary to our conjecture, the debt ratio and export ratio

    are negatively associatedwith therm's productivity. The negative relation between export ratio and productivity is especially appar-

    ent for FIEs, which engage intensively on processing trade. Firm age is also negatively associated with productivity in case of SOEs.

    Lastly, county-level average wage contributes signicantly to a rm'sproductivity, suggesting that skilledlabor is indeed an important

    element of competitiveness by Chinese rms.

    4.2. Agglomeration effects based on the Census data

    The estimation of agglomeration effects using the indicators of agglomeration constructedfrom the Census datais a cross-sectional

    OLS that does not control for an unobserved  rm heterogeneity. This may result in biased coef cients if such heterogeneity affects

    both the productivity and likelihood of locating in agglomerated region. Also, it only observes the relationship between productivity

    and agglomeration in a specic point of time, the year 2005. The estimated coef cients are therefore not comparable from those ob-

    tained from the estimation using indicators based on the Survey data. Keeping those caveats in mind, we investigate whether taking

    into account the agglomeration of smaller rms will alter or provide new nding on our previous results based on the Survey data.

    The results fromthe estimation displayed in Table 7 are qualitatively similarto the results in Table 6. For instance, thecoef cient onthe number of rms in the same industry within the same county (agg_NC) is negative and signicant, conrming the severe conges-

    tion and competition effect associated with agglomeration. The negative impact of agg_NC becomes more pronounced when agglom-

    eration outside the county and that of upstream industries are incorporated (column 2). On the other hand, contrary to the results in

    Table 6, the coef cient on agg_NP is also negative, albeit the limited signicance. This suggests that the agglomeration in a moderate

    distance is not as benecial as previously found, possibly because the positive externality wane with distance or congestion and com-

    petition effects spreads across counties. The smaller coef cient on agg_NP than that on agg_NC is still in line with theprevious nding.

    As forthe averagerm size,the coef cienton agg_YC and that on agg_YP are both positivewiththe former being larger than the latter,

    in line with the results in Table 6. The contribution by the agglomeration of upstream industries is conrmed by the signicant and

    positive coef cient on Uagg_NC. These results support our  ndings based on the Survey data. It is also noteworthy that the signs of 

    coef cients on control variables are also in line with the previous results except the one on export ratio.⑤

    ⑤ There is abundant theoretical and empirical evidence on the self-selection of productive rms into exports (for instance, Melitz, 2003, Tybout et al., 1998). The

    strong positive correlation between productivity and export intensity may be due to the inability to control for the unobserved  rm heterogeneity that raisesproductivity.

    60   C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    12/17

    However, when looking across sub-samples of rms with different ownerships in columns 3 to 5, PEs are not necessarily the main

    recipient of agglomeration effects. For instance, it is the SOEs that suffer the signicant negative impact from agg_NC while the coef-

    cient is insignicant for PEs and FIEs. This is likely to reect an erosion of market shares and prots of SOEs due to competitions

    against PEs and FIEs. While the coef cient on Uagg_NC is positive and signicant for both SOEsand PEs,the size of coef cient suggests

    that SOEs benet more than PEs. Furthermore, the coef cients on the average  rm size suggest that SOEs and FIEs enjoy larger

    spillovers from a co-location with large  rms than PEs.

    4.3. Agglomeration effects from different ownerships

    The difference in ownership may dene not only a  rm's ability to absorb favorable agglomeration effects but also the extent to

    which it acts as a source of agglomeration effects. For instance, FIEs has long been considered as the source of technology spillover.

    Also, PEs, which are signicantly smaller than SOEs or FIEs, may be an active source of agglomeration effects to the extent that smaller

    rms are more likely to interact with and outsource to other  rms due to its limited resource (Rosenthal & Strange, 2001). On the

    other hand, SOEs that are oriented toward government policies and enjoy policy-backed monopolistic power may generate little spill-

    overs to other rms. In order to investigate which type of rms are the most important source of agglomeration effects, we construct

    theindicators of agglomeration separatelyfor SOEs, PEs and FIEs and re-estimate themodel corresponding to the column 2 of Tables 6

    and 7. We estimate the agglomeration effects to the TFP of all  rms as well as to that of PEs, the latter being the main recipients of 

    agglomeration effects.

    Table 8 summarizes the results of this exercise. For the sake of brevity, the estimated coef cients on control variables are not

    displayed. Columns 1 to 3 observe how the TFP of Chinese rms is affected by the agglomeration of SOEs, PEs and FIEs. The coef cient

    on agg_NC, the number of rms in the same industry within the same county, is insignicant regardless of the type of rms. This sug-

    gests that the negative coef 

    cient observed in Tables 6 and 7 is a composite effect of agglomeration of all three types of 

    rms. On theother hand, the coef cient on agg_YC is positive only for the agglomeration of PEs and for that of FIEs but with limited signicance.

     Table 7

    Estimates of agglomeration effect based on the Census data.

    Sample All (1) All (2) SOEs (3) PEs (4) FIEs (5)

    Indicators of agglomeration(1) agg_YC 0.058*** 0.052*** 0.054*** 0.038*** 0.059***

    (55.90) (49.89) (17.11) (31.49) (21.90)

    (2) agg_NC   −0.006***   −0.023***   −0.072***   −0.004   −0.004

    (−3.50) (−9.98) (−6.39) (−1.45) (−0.72)

    (3) agg_YP 0.066*** 0.065*** 0.062*** 0.023**(15.30) (5.74) (11.96) (2.04)

    (4) agg_NP   −0.006*   −0.011   −0.001 0.015*

    (−1.65) (−0.82) (−0.22) (1.67)

    (5) Uagg_YC 0.124*** 0.178*** 0.084*** 0.128***

    (34.74) (14.91) (20.61) (14.76)

    (6) Uagg_NC 0.046*** 0.155*** 0.023***   −0.020***

    (16.61) (15.49) (7.24) (−2.94)

    R&D investment 0.129*** 0.129*** 0.187*** 0.115** 0.107**

    (99.01) (99.02) (48.07) (71.83) (38.61)

    Market share 68.861*** 67.599*** 46.829*** 74.325*** 70.663***

    (103.65) (102.07) (28.19) (72.67) (59.67)

    Debt ratio   −0.172***   −0.176***   −0.272***   −0.134*** 0.006

    (−25.20) (−25.89) (−12.47) (−15.82) (0.41)

    Export ratio 0.167*** 0.151*** 0.511*** 0.125***   −0.073***

    (22.90) (20.82) (9.02) (12.98) (−5.87)

    Age   −0.011***   −0.011***   −0.127*** 0.044*** 0.033***(−4.30) (−4.07) (−13.17) (15.21) (4.75)

    Average wage 0.037*** 0.135*** 0.045 0.197*** 0.071***

    (4.38) (14.28) (1.39) (18.21) (3.13)

    Constant 5.753*** 4.594*** 3.379*** 5.124*** 4.927

    (162.27) (76.72) (17.25) (74.67) (25.45)

    Prob F 0.000 0.000 0.000 0.000 0.000

    R-square 0.243 0.249 0.443 0.216 0.226

    Observations 215,076 215,076 21,178 147,185 46,713

    Notes:

    1. All estimations include control variables as well as industry and province xed effects.

    2. t-statistics are in the parentheses.

    3. ***, **,* each corresponds to signicance at 1% level, 5% level and 10% level, respectively.

    4. The data used for estimation is from 2004 Census.

    5. SOEs refer to state-owned enterprises; PEs refer to private enterprises; FIEs refer to foreign invested enterprises.

    6. agg_NC is thenumber of rmsin thesame industry andsame county;agg_YCis theaverage size of rmsin thesame industry andsame county; agg_NPis thenumberof  rms in the same industry same province but outside the county; agg_YP is the average size of  rms in the same industry same province but outside the county;

    Uagg_NC is the number of  rms in the same county but upstream industries; and Uagg_YC is the average size of  rms in the same county but upstream industries.

    All these variables are in natural log form. Denitions of other control variables please refer to  Table 5.

    61C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    13/17

    Furthermore, the positive and signicant coef cient on agg_NP noted in column 2 of  Table 6 is only observed for PEs. While the

    coef cient on agg_YC is positive when agglomeration concerns all types of  rms, it is largest in case of PEs. Therefore, Chinese

    rms seem to benet most from the co-location of large, competitive PEs.

    It is noteworthy that the important contribution from the agglomeration of upstream industries is generated primarily from PEs.

    The coef cient on Uagg_NC is only positive for the agglomeration of PEs whereas it is insignicant or negative for those of SOEs and

    FIEs. The coef cient implies that doubling the number of PEs in upstream industries raises the productivity of Chineserms by 4.8%.

    While it is somewhat surprising that an agglomeration of FIEs in upstream sector has a negative impact on productivity, it is possible

     Table 8

    Agglomeration effects from different ownerships (the Survey data).

    Samples All rms PEs

    The source of agglomeration SOEs (1) PEs (2) FIEs (3) SOEs (4) PEs (5) FIEs (6)

    (1) agg_YC   −0.001 0.002*** 0.001** 0.000 0.027*** 0.006**

    (−1.53) (4.16) (2.43) (0.14) (16.27) (2.48)

    (2) agg_NC 0.012   −0.002   −0.004   −0.017 0.001 0.016**

    (1.52) (−

    0.74) (−

    1.46) (−

    1.60) (0.25) (2.44)(3) agg_YP 0.004*** 0.008*** 0.003*** 0.004*** 0.024*** 0.007*

    (7.26) (4.94) (5.20) (6.56) (7.56) (1.92)

    (4) agg_NP   −0.017*** 0.016*** 0.002   −0.021*** 0.012***   −0.013*

    (−8.61) (6.24) (0.69) (−8.66) (3.36) (−1.97)

    (5) Uagg_YC   −0.002 0.048*** 0.019***   −0.004*** 0.064*** 0.021***

    (−1.50) (17.32) (10.24) (−3.04) (16.55) (2.95)

    (6) Uagg_NC   −0.002 0.020***   −0.052*** 0.008 0.034*** 0.015*

    (−0.23) (6.51) (−13.88) (0.87) (8.28) (1.93)

    Prob F 0.000 0.000 0.000 0.000 0.000 0.000

    R-square 0.482 0.449 0.484 0.322 0.350 0.268

    Observations 839,791 727,896 742,700 573,801 464,062 140,042

    Notes:

    1. All estimations include control variables as well as rm, time, industry and province xed effects.

    2. t-statistics are in the parentheses.

    3. ***, **,* each corresponds to signicance at 1% level, 5% level and 10% level, respectively.

    4. The data used for estimation is from Annual Survey of Above-scale Industrial Enterprises.5. SOEs refer to state-owned enterprises; PEs refer to private enterprises; FIEs refer to foreign invested enterprises.

    6. agg_NCis thenumber of rmsin thesame industry andsame county;agg_YC is theaverage size of rmsin thesame industry andsame county; agg_NPis thenumber

    of  rms in the same industry same province but outside the county; agg_YP is the average size of  rms in the same industry same province but outside the county;

    Uagg_NC is the number of  rms in the same county but upstream industries; and Uagg_YC is the average size of  rms in the same county but upstream industries.

    All these variables are in natural log form.

     Table 9

    Agglomeration effects from different ownerships (the 2004 Census data).

    Samples All rms PEs

    The source of agglomeration SOEs (1) PEs (2) FIEs (3) SOEs (4) PEs (5) FIEs (6)

    (1) agg_YC 0.013*** 0.059*** 0.018***   −0.002 0.196*** 0.007***

    (10.90) (45.75) (27.75) (−1.10) (81.02) (9.31)

    (2) agg_NC   −0.087***   −0.012*** 0.011*** 0.038*** 0.022*** 0.012***

    (−8.96) (−5.51) (3.30) (3.13) (9.04) (3.39)

    (3) agg_YP 0.008*** 0.060*** 0.008*** 0.007*** 0.025*** 0.009***

    (5.54) (13.19) (5.23) (4.34) (4.31) (5.54)

    (4) agg_NP 0.008*   −0.010*** 0.004 0.016*** 0.008* 0.018***

    (1.84) (−2.58) (0.99) (3.32) (1.82) (4.44)

    (5) Uagg_YC 0.030*** 0.139*** 0.042*** 0.012*** 0.095*** 0.019***(18.64) (34.75) (20.39) (7.22) (21.28) (8.50)

    (6) Uagg_NC   −0.022*** 0.039***   −0.015***   −0.038*** 0.029***   −0.012***

    (−3.22) (13.17) (−4.68) (−4.67) (8.84) (−3.32)

    Prob F 0.000 0.000 0.000 0.000 0.000 0.000

    R-square 0.233 0.249 0.228 0.205 0.255 0.210

    Observations 212,324 205,464 202,179 144,875 139,607 139,617

    Notes:

    1. All estimations include industry and province xed effects.

    2. t-statistics are in the parentheses.

    3. ***,* each corresponds to signicance at 1% level and 10% level, respectively.

    4. The data used for estimation is from 2004 Census.

    5. SOEs refer to state-owned enterprises; PEs refer to private enterprises; FIEs refer to foreign invested enterprises.

    6. agg_NC isthe numberof rmsin thesame industry andsame county;agg_YC is theaverage size of rmsin thesame industry andsame county; agg_NPis thenumber

    of  rms in the same industry same province but outside the county; agg_YP is the average size of  rms in the same industry same province but outside the county;

    Uagg_NC is the number of  rms in the same county but upstream industries; and Uagg_YC is the average size of  rms in the same county but upstream industries.All these variables are in natural log form.

    62   C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    14/17

    that it intensies competition in factor market, namely of skilled labor. On the other hand, the average size of FIEs in upstream indus-

    tries has positive and signicant impact, implying that co-location of competitive FIEs in upstream industries is benecial. However,

    the coef cienton Uagg_YCis even larger when it concerns PEs. Overall, wend that PEs are themain sources of agglomeration effects

    in China's industry. In contrast, neither the number nor average size of SOEs in upstream industries seems to benet the productivity

    of Chinese rms.

    Columns 4 to 6 observe how the TFP of PEs is affected by the agglomeration of different types of   rms. The overall picture looks

    fairly similar to the case of allrms. It can be noted that the positive coef cients on agg_YC and agg_YP are larger when they concern

    PEs than SOEs or FIEs. This indicates that PEs learn signicantly more from the competitive PEs in proximity than from other types of 

    rms. Furthermore, PEs enjoy positive and signicant impact from the number of PEs in the same industry outside the county

    (agg_NP) but not from thatof SOEs or FIEs. Finally,like the case of all rms, only the number of PEs in upstreamindustries contributes

    positively to the productivity of PEs, but with larger magnitude (6.4% productivity increase when doubling the number of PEs).

    Overall, it can be said that PEs benet most from the agglomeration of PEs. The agglomeration effects from PEs are self-reinforcing

    in that sense.

    We also observe whether the indicators based on the Census data provide alternative picture. The results, summarized in Table 9,

    reveal some interesting new ndings. For instance, columns 1 to 3 indicate that the negative impact from agg_NC is driven mostly by

    the agglomeration of SOEs and PEs, whereas the number of FIEs in the same industry has favorable impact. This suggests that the

    clustering of small domestic rms is generating substantial congestion and competition, whereas FIEs do not add strong competition

    pressure in domestic product markets possibly because they are more export-oriented. PEs are also the primary source of congestion

    effects across counties, for a negative and signicant sign on agg_NP is only associated with its agglomeration. On the other hand, the

    coef cients on average rm size are largest in case of an agglomeration of PEs, in line with Table 8. Similarly, the positive coef cient on

    Uagg_NC is still observed only for PEs.

    Turning to columns 4 to 6, wend that the coef cients on agg_NC are positive and signicant for the agglomeration of all types of 

    rms. This implies that thenegative net impact from the number of rmsin thesame industry within thesame county observedin the

    column 2 of Tables 6 and 7 is sufferedby other types of rmsthan PEs. However, such results should be interpreted with care, for they

    may be driven partly by theunobservedrm characteristics that increase both the productivity of PEs and their propensity to locate in

    agglomerated areas. Other notable  ndings in Table 8 are also preserved: PEs benet most from the average size of PEs in the same

    industry and in upstream industries and only the number of PEs in upstream sector contribute to their productivity.

    4.4. Robustness check

    We assess the robustness of our ndings based on Survey data corresponding to the column 2 of Table 6. We do so by reproducing

    the estimation using alternative estimates of rm-level productivity as well as with different industrial classication and regional unit.

    The results of this exercise are summarized in Table 10.

     Table 10

    Base estimates with alternative specications.

    TFP_OLS (1) TFP_OP (2) Labor productivity (3) 4 Digit industries (4) 2 Digit industries (5) City level (6)

    agg_YC 0.003*** 0.004*** 0.004*** 0.002*** 0.004*** 0.005***

    (7.89) (7.97) (9.50) (5.93) (6.45) (7.26)

    agg_NC   −0.008***   −0.011***   −0.013***   −0.011***   −0.004   −0.003

    (−3.45) (−4.41) (−5.47) (−4.39) (−1.56) (−1.10)

    agg_YP 0.011*** 0.010*** 0.014*** 0.005*** 0.031*** 0.005***

    (6.45) (5.36) (7.80) (4.83) (7.93) (6.33)

    agg_NP 0.015*** 0.012*** 0.012*** 0.009*** 0.024*** 0.011***

    (6.42) (4.88) (4.96) (4.51) (6.17) (4.42)

    Uagg_YC 0.058*** 0.052*** 0.054*** 0.049*** 0.047*** 0.038***

    (21.03) (18.43) (19.57) (17.18) (16.32) (7.47)Uagg_NC 0.036*** 0.032*** 0.029*** 0.033*** 0.031*** 0.072***

    (15.23) (13.33) (12.21) (13.75) (11.86) (16.44)

    Constant   −1.292*** 1.852*** 2.453*** 1.580*** 0.939*** 0.999***

    (−5.27) (7.36) (10.03) (6.26) (3.39) (3.88)

    Prob.   N F 0.000 0.000 0.000 0.000 0.000 0.000

    R-square 0.062 0.440 0.043 0.471 0.332 0.461

    Observations 849,771 849,285 847,308 849,771 849,771 849,776

    Notes:

    1. All estimations include control variables as well as rm, time, industry and province xed effects.

    2. t-statistics are in the parentheses.

    3. ***corresponds to signicance at 1% level.

    4. The data used for estimation is from Annual Survey of Above-scale Industrial Enterprises.

    5. agg_NC is thenumber of rmsin thesame industry andsame county;agg_YCis theaverage size of rmsin thesame industry andsame county; agg_NPis thenumber

    of  rms in the same industry same province but outside the county; agg_YP is the average size of  rms in the same industry same province but outside the county;

    Uagg_NC is the number of  rms in the same county but upstream industries; and Uagg_YC is the average size of  rms in the same county but upstream industries.All these variables are in natural log form.

    63C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    15/17

    First, we usethe total factor productivity estimated by OLS (column 1). The novelnding that agglomeration captured as the num-

    ber of rms in thesame industry impacts theproductivitynegatively when it is within the same county remains valid. Also the sizable

    positive impact from the agglomeration in upstream industries, as well as the larger coef cients on agg_YP compared to agg_YC are all

    preserved. We then use the TFP estimated by the method of Olley and Pakes (1996), obtaining similar results (column 2). Because the

    estimated TFP may pick up the difference in prices and mark-up rates amongrms, we also check if our ndings are robust to another

    measure of productivity, namely a  rm's labor productivity measured as value-added per employee. Again, we obtain very similar

    results (column 3).

    We next assess the sensitivity of our results to industrial classication. The alternative indicators of agglomeration are constructed

    under  ner industrial classication (four-digit) as well as rougher classication (two-digit). Column 4 displays the results of 

    estimation based on four-digit industrial classication, which is very similar to the base results. In the estimation based on

    two-digit industrial classication, however, the coef cient on agg_NC becomes insignicant while remaining negative (column 5).

    One interpretation to this change is that under a rougher classication, agg_NC may count  rms that are not actually competing in

    the same industry, thereby diluting the negative impact from intensied competition. All other observations are preserved.

    Finally, we construct the indicators of agglomerationat alternative regional unit, namely at city level. A city is a largerregional unit

    than a county, for it usually comprises several counties. The results presented in column 6 are somewhat similar to the case of 

    indicators constructed on basis of two-digit industrial classication. The negative coef cient on agg_NC becomes insignicant while

    all other observations are preserved. One explanation to this change is that congestion is less severe under a larger geographical

    scope of industrial concentration. Put differently, agg_NC in this case comprises some of  rms previously counted as agg_NP, which

    has signicantly positive coef cient.

    5. Contribution to the industry-level productivity growth

    How much did the industrial agglomeration contributed to the productivity growth of China's industrial sector? We exploit the

    production function used to compute rm-level TFP we described in Section 3.3 and the coef cients on indicators of agglomeration

    in column 2 of  Table 6 to compute the share of TFP growth during 2000–2007 that is explained by agglomeration. Note that the

    data used for this exercise are based on the Survey data, given the long time-span needed. First, we obtain the industry-level TFP

    by plugging the employment and capital stock of each industry to the estimated production function which has different input

    coef cients for each industry (see Section 3.3). We then aggregate the industry-level TFP using output shares as weights to obtain

    the aggregated TFP of the whole industrial sector. The average annual growth rate of such aggregated TFP, displayed in  rst column

    of  Table 11, is 8.23%. The same column displays the average annual TFP growth of the private sector obtained from the same exercise

    but using only the sample of PEs, which is 7.71%.

    The aggregate TFP growth due to agglomeration effects is computed by plugging the industry-level mean of each indicator of 

    agglomeration during 2000–2007 into the Eq. (15), holding other variables constant. The estimated coef cients are from column 2

    of  Table 6. This yields the predicted TFP growth for each industrial sector j:

    ΔT FP  j;t  ¼  β̂ 11Δt agg YC  j;t  þ  β̂ 12Δt agg NC  j;t  þ  β̂ 13Δt agg YP  j;t  þ  β̂ 14Δt agg NP  j;t 

    β̂ 21Δt Uagg YC  j;t  þ  β̂ 22Δt Uagg NC  j;t  þ  β̂ 23Δt Uagg YP  j;t  þ  β̂ 24Δt Uagg NP  j;t ð15Þ

    where the upper bar on each indicator of agglomeration indicates industry-level mean during 2000–2007 and the hat on each of co-

    ef cients indicates that they are estimates from the model. We once again aggregate those industry-level TFP growth using output

    share as weights to obtain the aggregate TFP growth of industrial sector predicted by agglomeration effects. Such predicted growth,

    shown in the second column of  Table 9, is 1.16%, which is equivalent to 14.09% of the average annual TFP growth of total industrial

    sector. The contribution share of agglomeration is larger in private sector, amounting to 15.43%. It is also seen that the contribution

    share of agglomeration is smaller but still comparable to that of R&D investment computed following the same steps. Although not

    captured by the Survey data, the private sector of China's industry consists of numerous small rms thatdo not engagein R&D invest-

    ment due to resource constraint. For those rms, agglomeration effects can be the primary driver of productivity growth.

     Table 11

    Contribution of agglomeration to industry-level TFP growth during 2000–2007.

    Average annual TFP growth rate

    of industrial sector (percentage)

    Agglomeration effect R&D

    Contribution to average

    growth rate (percentage)

    Contribution share

    (percentage)

    Contribution to average

    growth rate (percentage)

    Contribution share

    (percentage)

    All rms 8.23 1.16***

    (331.81)

    14.09 1.88***

    (25.00)

    22.84

    Private Enterprises 7.71 1.19***

    (230.73)

    15.43 1.53***

    (10.53)

    19.84

    Notes:

    1. t-statistics are in parentheses.

    2. *** indicates signicant at 1% level.3. The data used for computation is from the coef cients on indicators of agglomeration in column 2 of  Table 6 and Annual Survey of Above-scale Industrial Enterprises.

    64   C. Hu et al. / China Economic Review 33 (2015) 50–66

  • 8/17/2019 Agglomeration and Productivity in China Firm Level Evidence

    16/17

    6. Conclusions

    This paper cond