2023
DOI: 10.3389/fmicb.2023.1250909
|View full text |Cite
|
Sign up to set email alerts
|

Overview of data preprocessing for machine learning applications in human microbiome research

Eliana Ibrahimi,
Marta B. Lopes,
Xhilda Dhamo
et al.

Abstract: Although metagenomic sequencing is now the preferred technique to study microbiome-host interactions, analyzing and interpreting microbiome sequencing data presents challenges primarily attributed to the statistical specificities of the data (e.g., sparse, over-dispersed, compositional, inter-variable dependency). This mini review explores preprocessing and transformation methods applied in recent human microbiome studies to address microbiome data analysis challenges. Our results indicate a limited adoption o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 99 publications
(107 reference statements)
0
4
0
Order By: Relevance
“…Preprocessing of microbiome data is a crucial step in the analysis pipeline (Ibrahimi et al, 2023 ; Papoutsoglou et al, 2023 ). The microbiome data undergo several preprocessing steps.…”
Section: Methodsmentioning
confidence: 99%
“…Preprocessing of microbiome data is a crucial step in the analysis pipeline (Ibrahimi et al, 2023 ; Papoutsoglou et al, 2023 ). The microbiome data undergo several preprocessing steps.…”
Section: Methodsmentioning
confidence: 99%
“…In order to conduct a robust analysis, the initial dataset has been strategically partitioned into a validation dataset and a test dataset to. This partitioning is designed to ensure a representative and unbiased evaluation of the models developed during the study (Ibrahimi et al, 2023 ). The validation dataset consists of 22 samples from the province of Salerno and 33 samples from the province of Caserta.…”
Section: Methodsmentioning
confidence: 99%
“…This is even more important when considering metagenomic data. An overview of preprocessing steps for preparing microbiome sequencing data for machine learning is given by ( Ibrahimi et al, 2023 ). Normalization of sequencing results, for example, in transcriptomics, is essential, and a recent evaluation can be found here ( Ni and Qin, 2021 ).…”
Section: Challenges Of Deep Learning In Bioinformaticsmentioning
confidence: 99%