2018
DOI: 10.1016/j.compenvurbsys.2018.01.010
|View full text |Cite
|
Sign up to set email alerts
|

Big enterprise registration data imputation: Supporting spatiotemporal analysis of industries in China

Abstract: A B S T R A C TBig, fine-grained enterprise registration data that includes time and location information enables us to quantitatively analyze, visualize, and understand the patterns of industries at multiple scales across time and space. However, data quality issues like incompleteness and ambiguity, hinder such analysis and application. These issues become more challenging when the volume of data is immense and constantly growing. High Performance Computing (HPC) frameworks can tackle big data computational … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 26 publications
(17 citation statements)
references
References 47 publications
0
17
0
Order By: Relevance
“…The point dataset used in experiments was enterprises registration data in Hubei Province, China (369,826 points in total) from 1949 to 2015 recorded by the bureaus of Administration for Industry and Commerce (AIC) of China. The dataset was cleaned with imputation methods [56], and the study area was administrative boundary of Hubei Province. Spatiotemporal point analysis on these enterprise entities could support evaluation and assessment of industry concentration.…”
Section: Experiments Designmentioning
confidence: 99%
“…The point dataset used in experiments was enterprises registration data in Hubei Province, China (369,826 points in total) from 1949 to 2015 recorded by the bureaus of Administration for Industry and Commerce (AIC) of China. The dataset was cleaned with imputation methods [56], and the study area was administrative boundary of Hubei Province. Spatiotemporal point analysis on these enterprise entities could support evaluation and assessment of industry concentration.…”
Section: Experiments Designmentioning
confidence: 99%
“…The versions of Spark and Hadoop are v2.3 and v2.7 respectively. The experiment datasets were resampled from the enterprises registration data in Chongqing, China (2,119,419 points in total) from year 1949 to year 2018 recorded by the bureaus of Administration for Industry and Commerce (AIC) of China after imputation (Li et at., 2018). For more performance analysis on space-time K function, please refer to our previous research (Wang et al, 2020).…”
Section: Methodsmentioning
confidence: 99%
“…For example, a huge number of Landsat images are utilized in mapping high-resolution global forest cover and the global forest changes in the twentyfirst century are explored (Hansen et al 2013), which is impossible without the support of geospatial big data and the related automatic processing techniques. Based on the huge enterprise registration data in China, the economic and social development situations and trends are revealed by the non-statistic data and novel approaches (Li et al 2018). City-wide fine-grained urban population distribution at building level is achieved by integrating and fusing multisource geospatial big data (Yao et al 2017), which is usually not desired in traditional research.…”
Section: Geospatial Big Data For Urban Planning and Urban Managementmentioning
confidence: 99%