2021
DOI: 10.1038/s41598-021-83660-9
|View full text |Cite
|
Sign up to set email alerts
|

Recursive ensemble feature selection provides a robust mRNA expression signature for myalgic encephalomyelitis/chronic fatigue syndrome

Abstract: Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is a chronic disorder characterized by disabling fatigue. Several studies have sought to identify diagnostic biomarkers, with varying results. Here, we innovate this process by combining both mRNA expression and DNA methylation data. We performed recursive ensemble feature selection (REFS) on publicly available mRNA expression data in peripheral blood mononuclear cells (PBMCs) of 93 ME/CFS patients and 25 healthy controls, and found a signature of 23 … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
27
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 21 publications
(29 citation statements)
references
References 55 publications
2
27
0
Order By: Relevance
“…To date, Moustafa et al [65] is the only team to have attempted to build a blood-based miRNA diagnostic signature for endometriosis composed of six miRNAs based on Random Forest analysis. In agreement with previous studies [40][41][42], it would appear illusory that so few miRNAs could reflect the diversity of a multifactorial disorder such as endometriosis, which involves multiple and poorly known signaling pathways. Therefore, we hypothesized the value of (i) analyzing a specific selection of miRNAs, which resulted in a selection of 109; (ii) reducing the number of features to improve the final accuracy; and finally, (iii) using Random Forest model with high accuracy, which supports the value of AI technology.…”
Section: Discussionsupporting
confidence: 89%
See 1 more Smart Citation
“…To date, Moustafa et al [65] is the only team to have attempted to build a blood-based miRNA diagnostic signature for endometriosis composed of six miRNAs based on Random Forest analysis. In agreement with previous studies [40][41][42], it would appear illusory that so few miRNAs could reflect the diversity of a multifactorial disorder such as endometriosis, which involves multiple and poorly known signaling pathways. Therefore, we hypothesized the value of (i) analyzing a specific selection of miRNAs, which resulted in a selection of 109; (ii) reducing the number of features to improve the final accuracy; and finally, (iii) using Random Forest model with high accuracy, which supports the value of AI technology.…”
Section: Discussionsupporting
confidence: 89%
“…The study and data analysis followed the Standards for Reporting of Diagnostic Accuracy (STARD) reporting guidelines [33] (Annex S1). The study consisted of two parts: (i) identification of a biomarker based on genome-wide miRNA expression profiling by small RNA sequencing using next-generation sequencing (NGS) and (ii) development of a saliva-based miRNA diagnostic signature according to expression and accuracy profiling using an ML algorithm [17,21,[34][35][36][37][38][39][40][41][42].…”
Section: Ethics Statementmentioning
confidence: 99%
“…The study and data analysis followed the STAndards for the Reporting of Diagnostic accuracy studies (STARD) guidelines 27 (Annex 1 ). The study consisted of two parts: (i) biomarker discovery based on genome-wide miRNA expression profiling by small RNA sequencing using next generation sequencing (NGS), and (ii) development of a miRNA diagnostic signature according to expression and accuracy profiling using an ML algorithm 28 38 .…”
Section: Methodsmentioning
confidence: 99%
“…For the open field data, we assigned the label ‘0’ for ≥10 and ‘1’ for <10 number of entries into the zone. A feature selection algorithm for recursive ensemble feature selection (REFS) algorithm [ 71 ] was run based on the Borda Method [ 72 ]. In this algorithm, 8 different classifiers were used from the sci-kit learning toolbox [ 73 ]: Bagging, Random Forest, Logistic Regression, Gradient Boosting, Support Vector, Stochastic Gradient Descent, Passive Aggressive and Ridge.…”
Section: Methodsmentioning
confidence: 99%