2021
DOI: 10.3390/cancers13102366
|View full text |Cite
|
Sign up to set email alerts
|

Sequence Neighborhoods Enable Reliable Prediction of Pathogenic Mutations in Cancer Genomes

Abstract: Identifying cancer-causing mutations from sequenced cancer genomes hold much promise for targeted therapy and precision medicine. “Driver” mutations are primarily responsible for cancer progression, while “passengers” are functionally neutral. Although several computational approaches have been developed for distinguishing between driver and passenger mutations, very few have concentrated on using the raw nucleotide sequences surrounding a particular mutation as potential features for building predictive model… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2

Relationship

4
1

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 89 publications
0
5
0
Order By: Relevance
“…In the case of whole genome/exome data, the MultiQC report was generated based on the results from tools such as FastQC and SnpEff. Two cancer-related data analysis tools, NBDriver ( 29 ), which uses a machine learning approach to identify the context of mutations, and cTaG ( 30 ), a tool to predict whether a given gene is a tumor suppressor (TSG) or an oncogene (OG), were also incorporated in the final pipeline (Figure 2 ).…”
Section: Methodsmentioning
confidence: 99%
“…In the case of whole genome/exome data, the MultiQC report was generated based on the results from tools such as FastQC and SnpEff. Two cancer-related data analysis tools, NBDriver ( 29 ), which uses a machine learning approach to identify the context of mutations, and cTaG ( 30 ), a tool to predict whether a given gene is a tumor suppressor (TSG) or an oncogene (OG), were also incorporated in the final pipeline (Figure 2 ).…”
Section: Methodsmentioning
confidence: 99%
“…In the case of whole genome/exome data, the MultiQC report was generated based on the results from tools such as FastQC and SnpEff. Two cancer-related data analysis tools, NBDriver [31], which uses a machine learning approach to identify the context of mutations, and cTaG [32], a tool to predict whether a given gene is a tumor suppressor (TSG) or an oncogene (OG), were also incorporated in the final pipeline(Figure 2) .…”
Section: Methodsmentioning
confidence: 99%
“…Despite the dramatic advances in developing predictive algorithms to differentiate between driver and passenger mutations, very few have concentrated on utilizing the local sequence context as potential features for further analysis. To capture this information, we built a robust machine learning model called NBDriver [31], which uses raw nucleotide sequences surrounding cancer-causing mutations as features to build machine learning models.…”
Section: Prediction Of Driver and Passenger Mutations Using Nbdrivermentioning
confidence: 99%
“…Various computational methods exist for identifying driver genes. Tools classify either mutations as driver events ( Mao et al, 2013 ; Tokheim and Karchin, 2019 ; Banerjee et al, 2021 ), or the genes mutated as driver genes ( Tokheim et al, 2016 ). Driver mutation prediction relies on the functional impact or neighborhood sequence.…”
Section: Introductionmentioning
confidence: 99%