2019
DOI: 10.3329/jbs.v28i0.44712
|View full text |Cite
|
Sign up to set email alerts
|

Performance evaluation of different machine learning algorithms in presence of outliers using gene expression data

Abstract: Classification of samples into one or more populations is one of the main objectives of gene expression data (GED) analysis. Many machine learning algorithms were employed in several studies to perform this task. However, these studies did not consider the outliers problem. GEDs are often contaminated by outliers due to several steps involve in the data generating process from hybridization of DNA samples to image analysis. Most of the algorithms produce higher false positives and lower accuracies in presence … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…In various experiments that ignored the outlier problem, (Shahjaman et al, 2020) employed a variety of machine learning techniques to complete the task. The process of generating data from samples for image analysis involves several steps, which can often result in contamination by outliers, thus reducing the accuracy of most algorithms.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In various experiments that ignored the outlier problem, (Shahjaman et al, 2020) employed a variety of machine learning techniques to complete the task. The process of generating data from samples for image analysis involves several steps, which can often result in contamination by outliers, thus reducing the accuracy of most algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…To evaluate the performance of the five machine learning algorithms, we considered three separate outlier rates: 5%, 10%, and 50%. The results show that RF is more suitable than the other four algorithms for randomly taken values (Shahjaman et al, 2020). The ability and viability of six machine learning techniques, including RF, to forecast vertical displacements brought on by tunnel excavation were investigated by .…”
Section: Related Workmentioning
confidence: 99%