2023
DOI: 10.3390/genes14020248
|View full text |Cite
|
Sign up to set email alerts
|

Omics Data Preprocessing for Machine Learning: A Case Study in Childhood Obesity

Abstract: The use of machine learning techniques for the construction of predictive models of disease outcomes (based on omics and other types of molecular data) has gained enormous relevance in the last few years in the biomedical field. Nonetheless, the virtuosity of omics studies and machine learning tools are subject to the proper application of algorithms as well as the appropriate pre-processing and management of input omics and molecular data. Currently, many of the available approaches that use machine learning … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 40 publications
0
4
0
Order By: Relevance
“…As with much of the current research into the microbiome, the study focused on developing methods to understand complex interactions between different bacteria. Similarly, a Spanish collaboration used genomic and biochemical predictors of insulin resistance in adolescents to act as a case study [7 ▪ ]. Their study highlighted the importance of careful model selection and data preparation to ensure that reliable and clinically important results were obtained.…”
Section: The Genome Metabolome and Microbiomementioning
confidence: 99%
“…As with much of the current research into the microbiome, the study focused on developing methods to understand complex interactions between different bacteria. Similarly, a Spanish collaboration used genomic and biochemical predictors of insulin resistance in adolescents to act as a case study [7 ▪ ]. Their study highlighted the importance of careful model selection and data preparation to ensure that reliable and clinically important results were obtained.…”
Section: The Genome Metabolome and Microbiomementioning
confidence: 99%
“…Quality control should be applied to all omics layers such as through quantifying GC content in RNA-seq and removing duplicated and fragmented reads for sequence alignment to be performed [ 186 ]. Feature selection is commonly conducted to reduce the dimensionality and redundancy of the high-throughput data, and discriminate desired features contained within the data [ 187 , 188 ]. The goal of the data processing is to reduce dimensionality, bias, and variation of the mined data in order to ensure robustness and efficiency of analysis, especially prior to multi-omics integration.…”
Section: Towards Integrative Multi-omics and Systems Bioinformatics T...mentioning
confidence: 99%
“…In the contribution “Omics Data Preprocessing for Machine Learning: A Case Study in Childhood Obesity”, by Álvaro Torres-Martos et al [ 5 ], the main goal is the analysis of the childhood obesity, which is a multifactorial disease influencing the development of a range of metabolic disorders, where adipose tissue has been proved to be fundamental. The authors present machine learning predictive models with multi-omics human data, and provides a collection of best practices and guidelines that could be applied to other human diseases with complex fundamentals (e.g., obesity).…”
Section: Contributionsmentioning
confidence: 99%