2020
DOI: 10.1016/j.jpdc.2019.12.015
|View full text |Cite
|
Sign up to set email alerts
|

A Parallel Multilevel Feature Selection algorithm for improved cancer classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 32 publications
0
10
0
Order By: Relevance
“…The parallelization method designed in this paper mainly relies on Spark's unique data format--Resilient Distributed Datasets (RDD). In traditional parallel methods based on data parallelism, the training data is created as RDD [50]- [52]. In the parallelization method based on parametric parallelism proposed in this paper, the initial population of GA is created as RDD.…”
Section: Input Matrix Lstm Layermentioning
confidence: 99%
“…The parallelization method designed in this paper mainly relies on Spark's unique data format--Resilient Distributed Datasets (RDD). In traditional parallel methods based on data parallelism, the training data is created as RDD [50]- [52]. In the parallelization method based on parametric parallelism proposed in this paper, the initial population of GA is created as RDD.…”
Section: Input Matrix Lstm Layermentioning
confidence: 99%
“…At present, the application of machine learning methods to cancer classification is a significant research field in bioinformatics [4,5]. Many traditional machine learning methods have been successfully applied to the classification analysis of gene expression data [6][7][8][9], such as RF, decision tree, KNN, and neural networks. However, with the increasing amounts and diversification of data, the traditional classification algorithm has been unable to meet the requirements of processing existing data and solving practical problems [10,11].…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, an appropriate method should be selected to eliminate abnormal samples. IF is a famous unsupervised abnormally detection algorithm, which divides the sample space by feature points until all sample points are isolated 11 . In the IF model, normal samples with large similarity with surrounding samples need to be divided several times to be isolated, and abnormal samples with large differences from surrounding samples are easily isolated.…”
Section: Methodsmentioning
confidence: 99%
“…IF is a famous unsupervised abnormally detection algorithm, which divides the sample space by feature points until all sample points are isolated. 11 In the IF model, normal samples with large similarity with surrounding samples need to be divided several times to be isolated, and abnormal samples with large differences from surrounding samples are easily isolated. In addition, the value of decision function is used to represent the abnormal situation of samples in the IF model.…”
Section: Abnormal Detection and Sample Divisionmentioning
confidence: 99%