2022
DOI: 10.3390/biology11030365
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning-Based Identification of Colon Cancer Candidate Diagnostics Genes

Abstract: Background: Colorectal cancer (CRC) is the third leading cause of cancer-related death and the fourth most commonly diagnosed cancer worldwide. Due to a lack of diagnostic biomarkers and understanding of the underlying molecular mechanisms, CRC’s mortality rate continues to grow. CRC occurrence and progression are dynamic processes. The expression levels of specific molecules vary at various stages of CRC, rendering its early detection and diagnosis challenging and the need for identifying accurate and meaning… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(14 citation statements)
references
References 75 publications
0
7
0
Order By: Relevance
“…At present, there are also some studies on constructing CRC diagnostic classifiers based on machine learning algorithms. Koppad et al [ 21 ] screened CRC diagnosis-related genes by the random forest algorithm, which has the advantage of avoiding data overfitting and reducing the computational load of the model. We aimed to filter biomarkers that could diagnose cancer.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…At present, there are also some studies on constructing CRC diagnostic classifiers based on machine learning algorithms. Koppad et al [ 21 ] screened CRC diagnosis-related genes by the random forest algorithm, which has the advantage of avoiding data overfitting and reducing the computational load of the model. We aimed to filter biomarkers that could diagnose cancer.…”
Section: Discussionmentioning
confidence: 99%
“…Lian et al [ 20 ] trained medulloblastoma stemness index based on a machine learning method of one-class logistic regression to obtain gene expression-based stemness index and methylation-based stemness index and further identified their corresponding potential drugs, which provides new ideas for the survival of medulloblastoma patients or targeting stem cells. Koppad et al [ 21 ] screened diagnostic candidate genes for CRC based on six methods of machine learning classification including Adaboost, ExtraTrees, logistic regression, Naive Bayes classifier, random forest, and XGBoost. Thus, there is potential for wider application of novel bioinformatics methods to identify novel diagnostic biomarkers based on public databases.…”
Section: Introductionmentioning
confidence: 99%
“…This algorithm shows a balance between prediction performance and explainability, which indicates the ability of machine learning algorithms to explain or justify the results in terms that are understandable by humans (Al' Aref et al 2020, Chen andGuestrin 2016). XGBoost is a common choice for dealing with the classification problem of multiple diseases, such as Parkinson's disease (Gao et al 2018), colon cancer (Koppad et al 2022), and breast cancer (Thalor et al 2022). Due to its simplicity, interpretability, and ability to handle imbalanced datasets, we chose the XGBoost algorithm to construct our IBD classifier (Shorthouse et al 2018).…”
Section: Open Accessmentioning
confidence: 99%
“…40 An example of this is quantifying biomarkers for disease states in tissues. [41][42][43] In the aforementioned diagnostics context, each of the masked pixels taken from the 4 × 4 MM is treated as an instance in our dataset. Each instance has 16 features, with each feature corresponding to one of the 16 channels of the MM.…”
Section: Artificial Intelligence and Machine Learning Techniquesmentioning
confidence: 99%