2019
DOI: 10.1111/1365-2656.13076
|View full text |Cite
|
Sign up to set email alerts
|

How to make more from exposure data? An integrated machine learning pipeline to predict pathogen exposure

Abstract: Predicting infectious disease dynamics is a central challenge in disease ecology.Models that can assess which individuals are most at risk of being exposed to a pathogen not only provide valuable insights into disease transmission and dynamics but can also guide management interventions. Constructing such models for wild animal populations, however, is particularly challenging; often only serological data are available on a subset of individuals and nonlinear relationships between variables are common.2. Here … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
52
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 36 publications
(52 citation statements)
references
References 53 publications
0
52
0
Order By: Relevance
“…We used an extensive ML ensemble pipeline [18], which is comprised of ve supervised algorithms, including logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosting (GBM), and extreme gradient boosting (XGB) to build a predictive model for the SARS-CoV-2 symptom status. We used multiple ML algorithms and compared the differences in their predictive performances.…”
Section: Data Pre-processingmentioning
confidence: 99%
See 2 more Smart Citations
“…We used an extensive ML ensemble pipeline [18], which is comprised of ve supervised algorithms, including logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosting (GBM), and extreme gradient boosting (XGB) to build a predictive model for the SARS-CoV-2 symptom status. We used multiple ML algorithms and compared the differences in their predictive performances.…”
Section: Data Pre-processingmentioning
confidence: 99%
“…We used multiple ML algorithms and compared the differences in their predictive performances. Some ML algorithms are less exible to missing data (i.e., SVM) [18], and therefore we imputed missing data using the RF-based nonparametric routine implemented in 'missForest' R package [19]. Also, because some of our selected algorithms (i.e., LR) cannot accommodate strongly collinear features, we removed features with the largest mean absolute correlation ( > 0.7) [20].…”
Section: Data Pre-processingmentioning
confidence: 99%
See 1 more Smart Citation
“…In the context of veterinary disease control, relatively little research has applied game-theoretic techniques, mostly using the standard static strategic-form game approach (21,22). Here, we combine epidemiological and economic parameters in type of game called a stochastic game that aims to analyse the adoption of the new diagnostic ELISA test for sheep scab in Scotland, where sheep scab is a notifiable disease requiring treatment upon confirmatory diagnosis.…”
Section: Introductionmentioning
confidence: 99%
“…To facilitate the design of such kind of experiments (which, in theory, could involve hundreds or thousands of combinations) we propose a methodology based upon a machine learning pipeline approach [25] in order to predict the survival of triatomines by different combinations of micro-climatic temperature values and exposure times. This approach leverages recent advance in machine learning to construct powerful but interpretable predictive models that can guide what combinations of variables could be important in shaping a species thermal limit and thus can configure feasible experimental designs.…”
Section: Introductionmentioning
confidence: 99%