2019
DOI: 10.1093/aje/kwz189
|View full text |Cite
|
Sign up to set email alerts
|

What is Machine Learning? A Primer for the Epidemiologist

Abstract: Machine learning is a branch of computer science that has the potential to transform epidemiologic sciences. Amid a growing focus on “Big Data,” it offers epidemiologists new tools to tackle problems for which classical methods are not well-suited. In order to critically evaluate the value of integrating machine learning algorithms and existing methods, however, it is essential to address language and technical barriers between the two fields that can make it difficult for epidemiologists to read and assess ma… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
270
0
6

Year Published

2020
2020
2024
2024

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 303 publications
(276 citation statements)
references
References 130 publications
0
270
0
6
Order By: Relevance
“…Either deep learning algorithms were not used [ 19 , 21 , 29 , 30 , 33 , 36 ], the datasets were small [ 12 , 20 , 22 , 23 , 31 ], or the used computational software could not simultaneously employ a higher number of models [ 12 , 37 ]. Moreover, the global demand for machine learning solutions often exceeds the expertise of healthcare providers to effectively utilize ML [ 38 ].…”
Section: Introductionmentioning
confidence: 99%
“…Either deep learning algorithms were not used [ 19 , 21 , 29 , 30 , 33 , 36 ], the datasets were small [ 12 , 20 , 22 , 23 , 31 ], or the used computational software could not simultaneously employ a higher number of models [ 12 , 37 ]. Moreover, the global demand for machine learning solutions often exceeds the expertise of healthcare providers to effectively utilize ML [ 38 ].…”
Section: Introductionmentioning
confidence: 99%
“…In addition to quantifying the contribution of each characteristic, this approach differentiates between mediation effects due to disparate levels versus disparate effects of a mediator, providing a model that is applicable to a wide range of health-related inquiries. Machine learning or decision tree approaches constitute another proposed analytic framework ( Bi et al, 2019 ; Cairney et al, 2014 ). This one involves successive subgroupings of a sample based on ‘within group’ similarities in terms of the health outcome variable.…”
Section: Introductionmentioning
confidence: 99%
“…In supervised techniques, the outcome or dependent variable is available for a given set of observations. Supervised techniques are further divided into regression or classification techniques depending upon the data type of the outcome variable: continuous or categorical [ 40 ]. In the literature, artificial neural network–based deep learning and tree-based gradient tree–boosting techniques have demonstrated better prediction capabilities in exploring nonlinear relationships among correlated predictors [ 41 - 49 ].…”
Section: Methodsmentioning
confidence: 99%