Background: The built environment is a structural determinant of health and has been shown to influence health expenditures, behaviors, and outcomes. Traditional methods of assessing built environment characteristics are timeconsuming and difficult to combine or compare. Google Street View (GSV) images represent a large, publicly available data source that can be used to create indicators of characteristics of the physical environment with machine learning techniques. The aim of this study is to use GSV images to measure the association of built environment features with health-related behaviors and outcomes at the census tract level. Methods: We used computer vision techniques to derive built environment indicators from approximately 31 million GSV images at 7.8 million intersections. Associations between derived indicators and health-related behaviors and outcomes on the census-tract level were assessed using multivariate regression models, controlling for demographic factors and socioeconomic position. Statistical significance was assessed at the α = 0.05 level. Results: Single lane roads were associated with increased diabetes and obesity, while non-single-family home buildings were associated with decreased obesity, diabetes and inactivity. Street greenness was associated with decreased prevalence of physical and mental distress, as well as decreased binge drinking, but with increased obesity. Socioeconomic disadvantage was negatively associated with binge drinking prevalence and positively associated with all other health-related behaviors and outcomes. Conclusions: Structural determinants of health such as the built environment can influence population health. Our study suggests that higher levels of urban development have mixed effects on health and adds further evidence that socioeconomic distress has adverse impacts on multiple physical and mental health outcomes.
Neighborhood attributes have been shown to influence health, but advances in neighborhood research has been constrained by the lack of neighborhood data for many geographical areas and few neighborhood studies examine features of nonmetropolitan locations. We leveraged a massive source of Google Street View (GSV) images and computer vision to automatically characterize national neighborhood built environments. Using road network data and Google Street View API, from December 15, 2017-May 14, 2018 we retrieved over 16 million GSV images of street intersections across the United States. Computer vision was applied to label each image. We implemented regression models to estimate associations between built environments and county health outcomes, controlling for county-level demographics, economics, and population density. At the county level, greater presence of highways was related to lower chronic diseases and premature mortality. Areas characterized by street view images as ‘rural’ (having limited infrastructure) had higher obesity, diabetes, fair/poor self-rated health, premature mortality, physical distress, physical inactivity and teen birth rates but lower rates of excessive drinking. Analyses at the census tract level for 500 cities revealed similar adverse associations as was seen at the county level for neighborhood indicators of less urban development. Possible mechanisms include the greater abundance of services and facilities found in more developed areas with roads, enabling access to places and resources for promoting health. GSV images represents an underutilized resource for building national data on neighborhoods and examining the influence of built environments on community health outcomes across the United States.
Objectives Built environments can affect health, but data in many geographic areas are limited. We used a big data source to create national indicators of neighborhood quality and assess their associations with health. Methods We leveraged computer vision and Google Street View images accessed from December 15, 2017, through July 17, 2018, to detect features of the built environment (presence of a crosswalk, non–single-family home, single-lane roads, and visible utility wires) for 2916 US counties. We used multivariate linear regression models to determine associations between features of the built environment and county-level health outcomes (prevalence of adult obesity, prevalence of diabetes, physical inactivity, frequent physical and mental distress, poor or fair self-rated health, and premature death [in years of potential life lost]). Results Compared with counties with the least number of crosswalks, counties with the most crosswalks were associated with decreases of 1.3%, 2.7%, and 1.3% of adult obesity, physical inactivity, and fair or poor self-rated health, respectively, and 477 fewer years of potential life lost before age 75 (per 100 000 population). The presence of non–single-family homes was associated with lower levels of all health outcomes except for premature death. The presence of single-lane roads was associated with an increase in physical inactivity, frequent physical distress, and fair or poor self-rated health. Visible utility wires were associated with increases in adult obesity, diabetes, physical and mental distress, and fair or poor self-rated health. Conclusions The use of computer vision and big data image sources makes possible national studies of the built environment’s effects on health, producing data and results that may inform national and local decision-making.
Background Social media platforms such as Twitter can serve as a potential data source for public health research to characterize the social neighborhood environment. Few studies have linked Twitter-derived characteristics to individual-level health outcomes. Objective This study aims to assess the association between Twitter-derived social neighborhood characteristics, including happiness, food, and physical activity mentions, with individual cardiometabolic outcomes using a nationally representative sample. Methods We collected a random 1% of the geotagged tweets from April 2015 to March 2016 using Twitter’s Streaming Application Interface (API). Twitter-derived zip code characteristics on happiness, food, and physical activity were merged to individual outcomes from restricted-use National Health and Nutrition Examination Survey (NHANES) with residential zip codes. Separate regression analyses were performed for each of the neighborhood characteristics using NHANES 2011-2016 and 2007-2016. Results Individuals living in the zip codes with the two highest tertiles of happy tweets reported BMI of 0.65 (95% CI –1.10 to –0.20) and 0.85 kg/m2 (95% CI –1.48 to –0.21) lower than those living in zip codes with the lowest frequency of happy tweets. Happy tweets were also associated with a 6%-8% lower prevalence of hypertension. A higher prevalence of healthy food tweets was linked with an 11% (95% CI 2% to 21%) lower prevalence of obesity. Those living in areas with the highest and medium tertiles of physical activity tweets were associated with a lower prevalence of hypertension by 10% (95% CI 4% to 15%) and 8% (95% CI 2% to 14%), respectively. Conclusions Twitter-derived social neighborhood characteristics were associated with individual-level obesity and hypertension in a nationally representative sample of US adults. Twitter data could be used for capturing neighborhood sociocultural influences on chronic conditions and may be used as a platform for chronic outcomes prevention.
Deep learning and, specifically, convoltional neural networks (CNN) represent a class of powerful models that facilitate the understanding of many problems in computer vision. When combined with a reasonable amount of data, CNNs can outperform traditional models for many tasks, including image classification. In this work, we utilize these powerful tools with imagery data collected through Google Street View images to perform virtual audits of neighborhood characteristics. We further investigate different architectures for chronic disease prevalence regression through networks that are applied to sets of images rather than single images. We show quantitative results and demonstrate that our proposed architectures outperform the traditional regression approaches. INDEX TERMS Set regression, multi-task learning, permutation invariant network, chronic disease prevalence, Google street view images.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.