2022
DOI: 10.7717/peerj.13200
|View full text |Cite
|
Sign up to set email alerts
|

ExhauFS: exhaustive search-based feature selection for classification and survival regression

Abstract: Feature selection is one of the main techniques used to prevent overfitting in machine learning applications. The most straightforward approach for feature selection is an exhaustive search: one can go over all possible feature combinations and pick up the model with the highest accuracy. This method together with its optimizations were actively used in biomedical research, however, publicly available implementation is missing. We present ExhauFS—the user-friendly command-line implementation of the exhaustive … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 59 publications
0
6
0
Order By: Relevance
“…Since 3′-end isomiR variations do not alter the isomiR targetome much (seed region not affected), we summed up the expression of isomiRs originating from the same miRNA, which have identical 5′-end sequences (5′-isomiRs). We used our previous notation for 5′-isomiRs ( Nersisyan et al, 2022b ): a number after the “|” symbol stands for a shift from the canonical 5′-end in the 5′–3′ direction. For example, hsa-miR-192-5p|+1 differs from the canonical hsa-miR-192-5p miRNA by the absence of the first nucleotide on its 5′-end.…”
Section: Methodsmentioning
confidence: 99%
“…Since 3′-end isomiR variations do not alter the isomiR targetome much (seed region not affected), we summed up the expression of isomiRs originating from the same miRNA, which have identical 5′-end sequences (5′-isomiRs). We used our previous notation for 5′-isomiRs ( Nersisyan et al, 2022b ): a number after the “|” symbol stands for a shift from the canonical 5′-end in the 5′–3′ direction. For example, hsa-miR-192-5p|+1 differs from the canonical hsa-miR-192-5p miRNA by the absence of the first nucleotide on its 5′-end.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, we propose that building machine learning models based on meta-analysis of multiple studies would enhance the robustness and reliability of these models, especially given the increasing availability of data and recent publications in this field. In addition, computational tools such as ExhauFS were employed to identify top ranked features for machine learning predictions 16 . For model development and validation, our study (n = 18, 27 for disease, and n = 77, 27 for severity) served as the training set, while the study by Zeng et al (n = 61, 48 for disease and n = 52, 48 for severity) was utilized as the filtration set, with the Guttman Study (n = 12, 18 for disease and n = 18, 18 for severity) and Garcia Study served as validation sets (n = 13, 15 for severity, with no healthy controls).…”
Section: Resultsmentioning
confidence: 99%
“…Notably, we observed a significant down-regulation of a large cluster of miRNAs located on chromosome 14 (14q32), comprising over 90 members. We built machine learning models based on meta-analysis results combined with an exhaustive feature selection tool 16 that was able to predict COVID-19 disease and severity across multiple independently published studies. Our results suggest a robust method for building miRNA-based models for disease diagnosis and prognosis and highlight overlapping roles of different miRNA biomarkers.…”
Section: Introductionmentioning
confidence: 99%
“…Exhaustive feature searching is a brute-force technique that evaluates all possible subsets of features and selects the one that maximizes a given criterion function [56], [57]. Let's assume that a set of…”
Section: Exhaustive Feature Searchingmentioning
confidence: 99%