2020) The contiguous United States in eleven zip codes: identifying and mapping socio-economic census data clusters and exemplars using affinity propagation, Journal of Maps, 16:1, 57-67,
ABSTRACTThe United States is a diverse and heterogeneous place. Accurately organizing and mapping the U.S. into different regions based on characteristics such as wealth, race, education, language, and occupation is a complicated and arduous task. This paper demonstrates the application of affinity propagation to map socio-economic patterns and identify representative exemplars. Affinity propagation clusters data based on representative exemplars and considers all data points as potential cluster exemplars. We use socio-economic data from the United States census to cluster zip codes tabulation areas and identify representative locations of socio-economic diversity of the United States. The 11 socio-economic clusters were mapped individually and together using area-based generalization. Mapping the results illustrated distinct regionalization and historical migration trends within the United States as well as national urban/suburban/rural patterns. Future applications of this technique may be useful for data-driven socio-economic analysis and purposive sampling.
ARTICLE HISTORY
This study demonstrates the application of affinity propagation as a data-driven approach to identifying and mapping typologies of place along the urban-rural continuum. The authors characterize Zip Code Tabulation Areas using demographic, economic, land cover, and accessibility to transportation infrastructure, which results in 22 clusters, 15 of which have a major rural component. The spatial pattern of these clusters varies, reflecting the heterogeneity in U.S. rurality. Rural is not a single concept that can be simply defined by population density. By comparing three economic indicators before and after the global financial crisis of 2007 to 2012, the authors find that the degree of economic recovery is captured by rural typologies. They compare both the methodological results and analysis of socioeconomic resilience to two of the most used threshold-based regional typologies, one developed by the U.S. Department of Agriculture Economic Research Service and one used by the American Communities Project.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.