2017
DOI: 10.1186/s12889-017-4914-3
|View full text |Cite
|
Sign up to set email alerts
|

A systematic review of data mining and machine learning for air pollution epidemiology

Abstract: BackgroundData measuring airborne pollutants, public health and environmental factors are increasingly being stored and merged. These big datasets offer great potential, but also challenge traditional epidemiological methods. This has motivated the exploration of alternative methods to make predictions, find patterns and extract information. To this end, data mining and machine learning algorithms are increasingly being applied to air pollution epidemiology.MethodsWe conducted a systematic literature review on… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
104
1
2

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 192 publications
(107 citation statements)
references
References 61 publications
0
104
1
2
Order By: Relevance
“…The new possibilities of finding new causal patterns using bigger sets of data is surely the best advantages of using DL for epidemiological purposes [65]. Besides, such data are the result of integrating multimodal sources, like visual sources combined with classic informational sources [66], but the future, with different modes and more data capture devices could integrate smell, taste, movements of agents, etc. Deep convolutional neural networks can help us, for example, to estimate environmental exposures using images and other complementary data sources such as cell phone mobility and social media information.…”
Section: Can DL Be Of Some Utility For the Epidemiological Debates Onmentioning
confidence: 99%
“…The new possibilities of finding new causal patterns using bigger sets of data is surely the best advantages of using DL for epidemiological purposes [65]. Besides, such data are the result of integrating multimodal sources, like visual sources combined with classic informational sources [66], but the future, with different modes and more data capture devices could integrate smell, taste, movements of agents, etc. Deep convolutional neural networks can help us, for example, to estimate environmental exposures using images and other complementary data sources such as cell phone mobility and social media information.…”
Section: Can DL Be Of Some Utility For the Epidemiological Debates Onmentioning
confidence: 99%
“…We note that tree-based and lasso approaches have been demonstrated to have appealing features that allow handling the analytical challenges of the chemical mixture data as presented in the workshop, and these methods are also increasingly employed in the field of high dimensional data analysis [20][21][22][23][24][25]. Motivated by a recent report which used combined CART and variable selection methods to analyze multiple pollutants and their interactions [3], we propose to use an improved two-step procedure of combining the random forest (RF) [25,26] with adaptive lasso [27] approaches.…”
Section: Review Of Existing Statistical Approaches and Their Limitationsmentioning
confidence: 99%
“…[2] conducted a systemic review of applying data mining and machine learning techniques in an air pollution epidemiology. According to [2], all 400 reviewed articles are separated into 3 main research areas which include a source apportionment, a prediction of air pollution or quality or exposure, and a hypothesis generation. Our work closely aligns with the second category.…”
Section: Related Workmentioning
confidence: 99%