2022
DOI: 10.1177/23998083221108185
|View full text |Cite
|
Sign up to set email alerts
|

Modeling clusters from the ground up: A web data approach

Abstract: This paper proposes a new methodological framework to identify economic clusters over space and time. We employ a unique open source dataset of geolocated and archived business webpages and interrogate them using Natural Language Processing to build bottom-up classifications of economic activities. We validate our method on an iconic UK tech cluster – Shoreditch, East London. We benchmark our results against existing case studies and administrative data, replicating the main features of the cluster and providi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 90 publications
0
2
0
Order By: Relevance
“…Our approach to identifying these microclusters draws upon recent studies that have explored clustering at a fine geographical level using data scraped from company websites (e.g., Papagiannidis et al, 2018; Rammer et al, 2020; Siepel et al, 2020; Stich et al, 2023). These studies generally use the addresses provided on company websites to identify and inductively map clusters of activity, often in a way that differs or is more insightful than standard SIC codes (as in Papagiannidis et al, 2018).…”
Section: Methodology and Datamentioning
confidence: 99%
See 1 more Smart Citation
“…Our approach to identifying these microclusters draws upon recent studies that have explored clustering at a fine geographical level using data scraped from company websites (e.g., Papagiannidis et al, 2018; Rammer et al, 2020; Siepel et al, 2020; Stich et al, 2023). These studies generally use the addresses provided on company websites to identify and inductively map clusters of activity, often in a way that differs or is more insightful than standard SIC codes (as in Papagiannidis et al, 2018).…”
Section: Methodology and Datamentioning
confidence: 99%
“…These studies generally use the addresses provided on company websites to identify and inductively map clusters of activity, often in a way that differs or is more insightful than standard SIC codes (as in Papagiannidis et al, 2018). Our approach differs from previous studies in that, whereas other studies looking at clustering using web data have looked at clustering in an individual neighbourhood or city (Rammer et al, 2020; Stich et al, 2023) or one region (Papagiannidis et al, 2018), we attempt to map across the whole of England, and we try to use this data to identify clusters in rural areas. To our knowledge, neither of these has been done previously.…”
Section: Methodology and Datamentioning
confidence: 99%