Journal of Geographical Sciences >
Influencing factors of manufacturing agglomeration in the Beijing-Tianjin-Hebei region based on enterprise big data
Huang Yujin, PhD Candidate, specialized in economic geography. E-mail: huangyj.19s@igsnrr.ac.cn |
Received date: 2022-05-08
Accepted date: 2022-07-12
Online published: 2022-12-25
Supported by
Strategic Priority Research Program of the Chinese Academy of Sciences(XDA19040401)
National Natural Science Foundation of China(41871117)
National Natural Science Foundation of China(41771173)
Industrial agglomeration is a highly prominent geographical feature of economic activities, and it is an important research topic in economic geography. However, mechanism-based explanations of industrial agglomeration often differ due to a failure to distinguish properly between the spatial distribution of industries and the stages of industrial agglomeration. Based on micro data from three national economic censuses, this study uses the Duranton-Overman (DO) index method to calculate the spatial distribution of manufacturing industries (three-digit classifications) in the Beijing-Tianjin-Hebei region (BTH region hereafter) from 2004 to 2013 as well as the hurdle model to explain quantitatively the influencing factors and differences in the two stages of agglomeration formation and agglomeration development. The research results show the following: (1) In 2004, 2008, and 2013, there were 124, 127, and 129 agglomerations of three-digit industry types in the BTH region, respectively. Technology-intensive and labor-intensive manufacturing industries had high agglomeration intensity, but overall agglomeration intensity declined during the study period, from 0.332 to 0.261. (2) There are two stages of manufacturing agglomeration, with different dominant factors. During the agglomeration formation stage, the main locational considerations of enterprises are basic conditions. Agricultural resources and transportation have negative effects on agglomeration formation, while labor pool and foreign investment have positive effects. In the agglomeration development stage, enterprises focus more on factors such as agglomeration economies and policies. Internal and external industry linkages both have a positive effect, with the former having a stronger effect, while development zone policies and electricity, gas, and water resources have a negative effect. (3) Influencing factors on industrial agglomeration have a scale effect, and they all show a weakening trend as distance increases, but different factors respond differently to distance.
HUANG Yujin , SHENG Kerong , SUN Wei . Influencing factors of manufacturing agglomeration in the Beijing-Tianjin-Hebei region based on enterprise big data[J]. Journal of Geographical Sciences, 2022 , 32(10) : 2105 -2128 . DOI: 10.1007/s11442-022-2039-9
Figure 1 Spatial distribution curves of four typical industriesNote: The solid line represents the actual spatial distribution curve of the industry, the gray strip represents the 95% global confidence interval under random conditions, and the dotted line represents the average value of the confidence interval. |
Figure 2 Kernel density distribution of manufacturing enterprises in the Beijing-Tianjin-Hebei region in 2004, 2008, and 2013 |
Table 1 The number and agglomeration intensity of agglomerated, dispersed, and randomly distributed industries in the Beijing-Tianjin-Hebei region in 2004, 2008, and 2013 |
Year | Agglomerated industries | Dispersed industries | Randomly distributed industries | Total number of industries | Average agglomeration intensity | |||
---|---|---|---|---|---|---|---|---|
Number | Proportion | Number | Proportion | Number | Proportion | |||
2004 | 124 | 76.5% | 22 | 13.6% | 16 | 9.9% | 162 | 0.332 |
2008 | 127 | 78.4% | 27 | 16.7% | 8 | 4.9% | 162 | 0.307 |
2013 | 129 | 76.8% | 29 | 17.3% | 10 | 6.0% | 168 | 0.261 |
Table 2 Top 10 manufacturing industries in the Beijing-Tianjin-Hebei region in terms of agglomeration intensity |
Industry classification | 2004 | Industry classification | 2008 | Industry classification | 2013 |
---|---|---|---|---|---|
Aerospace vehicle manufacturing | 1.22 | Leather tanning | 1.09 | Manufacturing of wire rope and its products | 1.09 |
Electronic computer manufacturing | 1.12 | Bicycle manufacturing | 1.07 | Bicycle manufacturing | 1.06 |
Bicycle manufacturing | 1.10 | Other unspecified manufacturing | 0.94 | Motorcycle manufacturing | 0.90 |
Bookbinding and other printing service activities | 1.07 | Aerospace vehicle manufacturing | 0.92 | Cultural and office machinery manufacturing | 0.78 |
Other electronic equipment manufacturing | 1.01 | Ship and floating device manufacturing | 0.90 | Metal furniture manufacturing | 0.74 |
Ship and floating device manufacturing | 1.01 | Bookbinding and other printing service activities | 0.84 | Electronic component manufacturing | 0.71 |
General equipment manufacturing | 0.92 | Electronic computer manufacturing | 0.80 | Leather goods manufacturing | 0.69 |
Manufacturing of special instruments | 0.91 | Electronic component manufacturing | 0.75 | Aerospace vehicle and equipment manufacturing | 0.66 |
Leather tanning | 0.90 | Medical instrument and equipment manufacturing | 0.75 | Fur tanning and product processing | 0.66 |
Biological and biochemical products manufacturing | 0.83 | Metal furniture manufacturing | 0.65 | Leather tanning | 0.65 |
Note: The theoretical value range of overall agglomeration intensity in the 0-194 km range is 0-1, but to simplify the calculation in this study, the results of 512 dispersed distances were added, resulting in the expansion of the results to 2.63 times the original, which does not affect the intensity comparison. |
Figure 3 Number of agglomerated industries (a) and agglomeration index of industries (b) in the Beijing-Tianjin-Hebei region at different distances |
Table 3 Descriptions of core explanatory variables |
Influencing factor | Variable | Quantitative indicator | Name | Data source |
---|---|---|---|---|
Resource endowment (RES) | Agriculture | Intermediate inputs in agriculture, forestry, animal husbandry and fishery as a proportion of total industry inputs | RES_AGR | Regional input- output tables |
Mining | Intermediate inputs in coal, petroleum, metal, and non-metal as a proportion of total industry inputs | RES_MIN | ||
Electricity, gas, water | Intermediate inputs in electricity, gas, and water supply as a proportion of total industry inputs | RES_ENE | ||
Agglomeration economies (AGG) | Labor pool | Number of employees in the industry | AGG_EMP | Economic census data |
Internal links of industries | Intermediate inputs in industries as a proportion of total inputs | AGG_INI | Regional input-output tables | |
External links of industries | Intermediate inputs of other manufacturing products as a proportion of total inputs | AGG_INT | ||
Knowledge spillover | Number of industry patents | AGG_TEC | PatSnap patent platform | |
Government behavior (GOV) | Local protectionism | State-owned enterprises in the industry as a proportion of all enterprises | GOV_NAT | Economic census data |
Development zone policies | Number of times the industry has become the target of a development zone | GOV_LEV | Catalogue of China Development Zones | |
Globalization (GLO) | Foreign trade | Industry exports value as a proportion of total sales value | GLO_EXP | Industrial enterprise database |
Foreign investment | Foreign-invested enterprises in the industry as a proportion of all enterprises | GLO_FOR | Economic census data |
Note: To reduce the two-way causal relationship between independent variables and dependent variables, the variables derived from the micro-data of industrial enterprises are all lagged by one period. |
Table 4 Descriptive statistics of variables |
Variable type | Variable name | Observations | Average | Standard deviation | Minimum | Maximum |
---|---|---|---|---|---|---|
Dependent variable | DO (50) | 492 | 0.170 | 0.222 | 0 | 1.201 |
DO (100) | 492 | 0.204 | 0.248 | 0 | 1.203 | |
DO (150) | 492 | 0.224 | 0.262 | 0 | 1.224 | |
DO (194) | 492 | 0.231 | 0.264 | 0 | 1.224 | |
Independent variable | RES_AGR | 492 | 0.053 | 0.094 | 0 | 0.311 |
RES_MIN | 492 | 0.039 | 0.078 | 0 | 0.601 | |
RES_ENE | 492 | 0.030 | 0.0178 | 0 | 0.077 | |
AGG_EMP | 492 | 36953 | 56086 | 167 | 520480 | |
AGG_INI | 492 | 0.249 | 0.114 | 0 | 0.528 | |
AGG_INT | 492 | 0.254 | 0.147 | 0 | 0.543 | |
AGG_TEC | 492 | 167.4 | 537.8 | 0 | 5,600 | |
GOV_NAT | 492 | 0.015 | 0.026 | 0 | 0.333 | |
GOV_LEV | 492 | 28.68 | 40.29 | 0 | 138 | |
GLO_EXP | 492 | 0.165 | 0.180 | 0 | 0.898 | |
GLO_FOR | 492 | 0.062 | 0.052 | 0 | 0.339 | |
SPA_BJ | 492 | 0.231 | 0.165 | 0 | 0.870 | |
SPA_TJ | 492 | 0.256 | 0.137 | 0.022 | 0.775 | |
RES_TRA | 492 | 0.036 | 0.014 | 0 | 0.115 |
Note: The dependent variable DO(S) represent the industrial agglomeration intensity within the range of S calculated according to Formula 4. |
Table 5 Probit regression results of the first stage in the hurdle model |
Model | 2004‒2008 | 2004‒2013 | ||||||
---|---|---|---|---|---|---|---|---|
(1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | |
S | 50 km | 100 km | 150 km | 194 km | 50 km | 100 km | 150 km | 194 km |
RES_AGR | -3.813** | -4.153** | -4.106** | -3.703** | -3.042** | -3.158** | -3.119** | -2.837* |
(1.695) | (1.734) | (1.709) | (1.694) | (1.467) | (1.488) | (1.489) | (1.509) | |
RES_MIN | 1.325 | 1.262 | 1.201 | 0.915 | 0.249 | 0.162 | 0.153 | 0.202 |
(2.039) | (2.099) | (2.067) | (1.995) | (1.412) | (1.432) | (1.424) | (1.413) | |
RES_ENE | -13.264 | -14.357* | -13.884 | -10.649 | 6.624 | 7.071 | 7.579 | 7.808 |
(8.470) | (8.597) | (8.643) | (8.524) | (5.391) | (5.520) | (5.511) | (5.437) | |
AGG_EMP | 0.287*** | 0.321*** | 0.328*** | 0.373*** | 0.240*** | 0.252*** | 0.258*** | 0.311*** |
(0.086) | (0.083) | (0.084) | (0.085) | (0.058) | (0.057) | (0.057) | (0.058) | |
AGG_INI | 1.086 | 1.716 | 1.720 | 1.459 | -0.637 | -0.345 | -0.428 | -0.570 |
(1.706) | (1.701) | (1.690) | (1.698) | (1.278) | (1.293) | (1.298) | (1.318) | |
AGG_INT | 0.206 | 0.516 | 0.395 | 0.570 | 0.929 | 1.133 | 1.119 | 1.268 |
(1.609) | (1.632) | (1.608) | (1.611) | (1.377) | (1.396) | (1.399) | (1.433) | |
AGG_TEC | -0.014 | -0.006 | -0.028 | -0.044 | -0.061 | -0.060 | -0.069* | -0.067 |
(0.077) | (0.084) | (0.084) | (0.094) | (0.041) | (0.042) | (0.042) | (0.043) | |
GOV_NAT | 0.651 | 0.756 | 0.224 | 0.883 | -1.233 | -1.382 | -1.634 | -0.937 |
(3.807) | (4.088) | (4.056) | (3.903) | (2.271) | (2.284) | (2.299) | (2.266) | |
GOV_LEV | -0.007 | -0.010 | -0.006 | -0.002 | -0.001 | -0.002 | -0.001 | -0.001 |
Model | 2004‒2008 | 2004‒2013 | ||||||
(1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | |
(0.006) | (0.006) | (0.006) | (0.007) | (0.003) | (0.003) | (0.003) | (0.003) | |
GLO_EXP | 0.611 | 0.551 | 0.608 | 0.456 | ||||
(0.585) | (0.627) | (0.621) | (0.594) | |||||
GLO_ FOR | 5.552** | 6.333** | 6.367** | 4.989* | 4.690** | 4.830** | 4.891** | 3.434* |
(2.780) | (3.052) | (3.089) | (2.979) | (2.003) | (2.132) | (2.150) | (2.034) | |
SPA_BJ | 2.568*** | 2.583*** | 2.592*** | 2.204*** | 2.501*** | 2.491*** | 2.515*** | 2.357*** |
(0.672) | (0.712) | (0.713) | (0.706) | (0.517) | (0.533) | (0.534) | (0.539) | |
SPA_TJ | 1.751** | 1.945** | 1.658** | 1.118 | 2.580*** | 2.769*** | 2.623*** | 2.316*** |
(0.694) | (0.756) | (0.735) | (0.718) | (0.562) | (0.583) | (0.579) | (0.583) | |
RES_TRA | -13.759 | -15.439* | -15.359* | -13.606 | -12.833** | -13.414** | -13.757** | -12.867** |
(8.473) | (8.735) | (8.670) | (8.390) | (6.412) | (6.547) | (6.547) | (6.558) | |
Constant | -2.321** | -2.652** | -2.656** | -2.771** | -2.218** | -2.353*** | -2.342*** | -2.525*** |
(1.118) | (1.100) | (1.092) | (1.092) | (0.894) | (0.896) | (0.897) | (0.910) | |
Time fixed effect | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Observations | 324 | 324 | 324 | 324 | 492 | 492 | 492 | 492 |
Pseudo R2 | 0.224 | 0.247 | 0.241 | 0.219 | 0.223 | 0.237 | 0.236 | 0.229 |
Note: The numbers in brackets are robust standard error, *, **, and *** mean significant at the level of 10%, 5%, and 1% respectively, the same below. |
Table 6 Agglomeration intensity of the top 5 two-digit industries in the agricultural resources input in 2013 |
Industry | Proportion of agricultural resources input | Agglomeration intensity within each distance | |||
---|---|---|---|---|---|
50 km | 100 km | 150 km | 194 km | ||
Agricultural and sideline food processing industry | 0.301 | 0.018 | 0.021 | 0.021 | 0.022 |
Food manufacturing | 0.301 | 0.022 | 0.023 | 0.023 | 0.024 |
Beverage manufacturing | 0.301 | 0.000 | 0.000 | 0.000 | 0.000 |
Textile industry | 0.245 | 0.144 | 0.156 | 0.169 | 0.193 |
Textile clothing, shoes, hats manufacturing | 0.126 | 0.161 | 0.250 | 0.258 | 0.258 |
Average of all industries | 0.057 | 0.152 | 0.180 | 0.195 | 0.200 |
Note: Only two-digit industries are involved in Input-Output Table. In this paper, the proportion of agricultural resource input is calculated by approximate matching method, and the agglomeration intensity is the average value of the three-digit industry. |
Table 7 OLS regression results of the second stage in the hurdle model |
Model | 2004-2008 | 2004-2013 | ||||||
---|---|---|---|---|---|---|---|---|
(1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | |
S | 50 km | 100 km | 150 km | 194 km | 50 km | 100 km | 150 km | 194 km |
RES_AGR | -0.001 | -0.009 | -0.025 | -0.027 | -0.086 | -0.041 | -0.055 | -0.058 |
(0.196) | (0.210) | (0.223) | (0.225) | (0.217) | (0.220) | (0.228) | (0.227) | |
RES_MIN | 0.413 | 0.439 | 0.389 | 0.402 | -0.120 | -0.089 | -0.133 | -0.126 |
(0.263) | (0.285) | (0.299) | (0.290) | (0.184) | (0.191) | (0.197) | (0.198) | |
RES_ENE | -4.905*** | -4.938*** | -5.619*** | -5.443*** | -3.143*** | -3.165*** | -3.724*** | -3.403*** |
(1.191) | (1.290) | (1.289) | (1.307) | (0.925) | (0.962) | (0.972) | (0.998) | |
AGG_EMP | -0.011 | -0.001 | 0.002 | 0.005 | -0.015 | -0.005 | -0.001 | -0.001 |
(0.013) | (0.014) | (0.014) | (0.013) | (0.009) | (0.010) | (0.010) | (0.010) | |
AGG_INI | 0.746*** | 0.800*** | 0.829*** | 0.857*** | 0.365** | 0.470*** | 0.498*** | 0.524*** |
(0.199) | (0.209) | (0.211) | (0.207) | (0.182) | (0.180) | (0.181) | (0.179) | |
AGG_INT | 0.350** | 0.414** | 0.481*** | 0.527*** | 0.155 | 0.288 | 0.338* | 0.385** |
(0.163) | (0.177) | (0.183) | (0.184) | (0.189) | (0.187) | (0.189) | (0.188) | |
AGG_TEC | 0.008 | 0.004 | 0.013 | 0.014 | -0.002 | -0.005 | -0.000 | -0.001 |
(0.009) | (0.010) | (0.010) | (0.010) | (0.007) | (0.007) | (0.007) | (0.007) | |
GOV_NAT | 0.248 | 0.070 | 0.074 | 0.090 | 0.140 | -0.127 | -0.102 | -0.121 |
(0.504) | (0.533) | (0.453) | (0.443) | (0.477) | (0.532) | (0.496) | (0.497) | |
GOV_LEV | -0.001 | -0.002* | -0.002** | -0.002** | -0.001 | -0.001** | -0.002*** | -0.002*** |
(0.001) | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | |
GLO_EXP | -0.043 | -0.028 | -0.027 | -0.018 | ||||
(0.086) | (0.096) | (0.096) | (0.094) | |||||
GLO_ FOR | 0.581 | 0.421 | 0.411 | 0.445 | 0.611** | 0.443 | 0.443 | 0.499 |
(0.379) | (0.416) | (0.412) | (0.410) | (0.290) | (0.309) | (0.312) | (0.312) | |
SPA_BJ | 0.224* | 0.311** | 0.342** | 0.325** | 0.198* | 0.274** | 0.315*** | 0.305*** |
(0.123) | (0.136) | (0.136) | (0.135) | (0.103) | (0.109) | (0.110) | (0.110) | |
SPA_TJ | 0.259 | 0.413** | 0.462*** | 0.441*** | 0.223 | 0.349** | 0.395*** | 0.384*** |
(0.171) | (0.178) | (0.165) | (0.162) | (0.139) | (0.146) | (0.139) | (0.137) | |
RES_TRA | -1.484 | -2.003 | -1.807 | -1.899 | -0.196 | -0.846 | -0.554 | -0.580 |
(1.608) | (1.688) | (1.682) | (1.663) | (1.204) | (1.268) | (1.284) | (1.311) | |
Constant | 0.061 | -0.039 | -0.080 | -0.115 | 0.190 | 0.074 | 0.027 | -0.001 |
(0.134) | (0.142) | (0.146) | (0.143) | (0.122) | (0.127) | (0.128) | (0.125) | |
Time fixed effect | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Observations | 248 | 253 | 255 | 267 | 377 | 382 | 384 | 398 |
R2 | 0.345 | 0.307 | 0.346 | 0.348 | 0.240 | 0.231 | 0.264 | 0.262 |
Figure 4 Schematic diagram of two stages of manufacturing agglomeration |
Figure 5 The relationship between the regression coefficient of independent variables and the distance in the formation stage of agglomerationsNote: Figure a show the variables that can have a positive effect on the dependent variable, and Figure b shows the variables that have a negative effect. The dotted line in the figure indicates that the regression coefficient is not significant, and the solid line indicates that the regression coefficient is significant at 10% or 5%, the same as below. |
Figure 6 The relationship between the regression coefficient of independent variables and the distance in the agglomeration development stage |
[1] |
|
[2] |
|
[3] |
|
[4] |
|
[5] |
|
[6] |
|
[7] |
|
[8] |
|
[9] |
|
[10] |
|
[11] |
|
[12] |
|
[13] |
|
[14] |
|
[15] |
|
[16] |
|
[17] |
|
[18] |
|
[19] |
|
[20] |
|
[21] |
|
[22] |
|
[23] |
|
[24] |
|
[25] |
|
[26] |
|
[27] |
|
[28] |
|
[29] |
|
[30] |
|
[31] |
|
[32] |
|
[33] |
|
[34] |
|
[35] |
|
[36] |
|
[37] |
|
[38] |
|
[39] |
|
[40] |
|
[41] |
|
[42] |
|
[43] |
|
[44] |
|
[45] |
|
[46] |
|
[47] |
|
[48] |
|
[49] |
|
[50] |
|
/
〈 | 〉 |