2020
DOI: 10.3390/ijms21030713
|View full text |Cite
|
Sign up to set email alerts
|

Flexible Data Trimming Improves Performance of Global Machine Learning Methods in Omics-Based Personalized Oncology

Abstract: (1) Background: Machine learning (ML) methods are rarely used for an omics-based prescription of cancer drugs, due to shortage of case histories with clinical outcome supplemented by high-throughput molecular data. This causes overtraining and high vulnerability of most ML methods. Recently, we proposed a hybrid global-local approach to ML termed floating window projective separator (FloWPS) that avoids extrapolation in the feature space. Its core property is data trimming, i.e., sample-specific removal of irr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
19
0

Year Published

2020
2020
2025
2025

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 24 publications
(19 citation statements)
references
References 69 publications
(84 reference statements)
0
19
0
Order By: Relevance
“…Several methods were proposed for the assessment of drug efficiency based on gene/protein expression [ 16 , 17 , 18 , 19 ] or mutation patterns [ 20 , 21 , 22 ]. Unfortunately, most such methods are either proprietary or employ machine learning on preceding cases [ 23 , 24 , 25 , 26 ]. So, for evaluating a cannabis drug’s individual action, we have suggested a novel approach, the cannabis drug efficiency index (CDEI).…”
Section: Methodsmentioning
confidence: 99%
“…Several methods were proposed for the assessment of drug efficiency based on gene/protein expression [ 16 , 17 , 18 , 19 ] or mutation patterns [ 20 , 21 , 22 ]. Unfortunately, most such methods are either proprietary or employ machine learning on preceding cases [ 23 , 24 , 25 , 26 ]. So, for evaluating a cannabis drug’s individual action, we have suggested a novel approach, the cannabis drug efficiency index (CDEI).…”
Section: Methodsmentioning
confidence: 99%
“…Among these three categories, reinforcement learning is relatively less used for multi-omics data analysis. Developing the methodologies is an active area of research ( 21 25 ). Pan-cancer analysis is also being done.…”
Section: Introductionmentioning
confidence: 99%
“…Many ML methods may be used for such applications, e.g. decision trees [12,13], random forests, RF [14,15], linear [16], logistic [17], lasso [18,19], ridge [15,20] regressions, multi-layer perceptron, MLP [12,15,21,22], support vectors machines [12,13,15,[23][24][25], adaptive boosting [26][27][28], as well as binomial naïve Bayesian [15] method.…”
Section: Introductionmentioning
confidence: 99%
“…Intelligent data filtering is, therefore, needed to reduce dimensionality of data [8]. However, a recent approach using dynamic feature extraction, or flexible data trimming, can significantly improve performances of ML-based methods for the real-world datasets [15,25].…”
Section: Introductionmentioning
confidence: 99%