2019
DOI: 10.1007/978-3-030-21642-9_7
|View full text |Cite
|
Sign up to set email alerts
|

A Semi-supervised Learning Approach for Pan-Cancer Somatic Genomic Variant Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 9 publications
0
7
0
Order By: Relevance
“…One public effort to create such a shared knowledgebase is the AACR Project GENIE, which is a multidisciplinary effort that integrates de-identified clinical-grade cancer genomic data with clinical outcome data for tens of thousands of cancer patients treated at multiple institutions worldwide (AACR Project Genie Consortium, 2017). The GENIE database (JW has grant support from GENIE) has been used to train machine-learning classifiers and is in the process of adding more in-depth phenotype information (Nicora et al, 2019). In the future, the international oncology community should continue to converge on common standards for knowledge representation, so that a comprehensive and unified knowledgebase that links tumor molecular profile data with approved therapies and available clinical trials, which would lead to the achievement of the goal of precision oncology, can be achieved.…”
Section: Discussionmentioning
confidence: 99%
“…One public effort to create such a shared knowledgebase is the AACR Project GENIE, which is a multidisciplinary effort that integrates de-identified clinical-grade cancer genomic data with clinical outcome data for tens of thousands of cancer patients treated at multiple institutions worldwide (AACR Project Genie Consortium, 2017). The GENIE database (JW has grant support from GENIE) has been used to train machine-learning classifiers and is in the process of adding more in-depth phenotype information (Nicora et al, 2019). In the future, the international oncology community should continue to converge on common standards for knowledge representation, so that a comprehensive and unified knowledgebase that links tumor molecular profile data with approved therapies and available clinical trials, which would lead to the achievement of the goal of precision oncology, can be achieved.…”
Section: Discussionmentioning
confidence: 99%
“…The first proposals by Carter et al [19] and Capriotti et al [20] were based on these algorithms. Among the SVM-based approaches, whereas most papers adopted the traditional SVM algorithm [20,22,24,27,31,32,39,55,56,57,58], we observed three papers using OneClass SVM [45,49,59] and one paper using Sequential Minimal Optimization (SMO) [28]. SVM is a popular and consolidated technique in the field, as it continues to be largely applied throughout the years since 2011.…”
Section: Methods Based On Supervised Learningmentioning
confidence: 99%
“…Ensemble methods train multiple weak classifiers, such as decision trees, and combine their output (e.g., with majority voting) to achieve a better predictive performance. We observed a frequent use of Random Forests (RF) [19,26,27,34,37,38,43,49,53,54,55,56,57,58], as well as Gradient Boosting Trees (GBT) [37,42], and eXtreme Gradient Boosting (XGBoost) [57,58]. While RF [114] uses bagging (i.e., bootstrap aggregating) and random features subsets to train multiple and diverse trees independently, Gradient Boosting [115] builds one tree at a time, introducing a weak learner to improve shortcoming of existing trees by assigning more weight on instances with wrong predictions and high errors.…”
Section: Methods Based On Supervised Learningmentioning
confidence: 99%
See 2 more Smart Citations