2022
DOI: 10.1371/journal.pone.0278583
|View full text |Cite
|
Sign up to set email alerts
|

Bi-dimensional principal gene feature selection from big gene expression data

Abstract: Gene expression sample data, which usually contains massive expression profiles of genes, is commonly used for disease related gene analysis. The selection of relevant genes from huge amount of genes is always a fundamental process in applications of gene expression data. As more and more genes have been detected, the size of gene expression data becomes larger and larger; this challenges the computing efficiency for extracting the relevant and important genes from gene expression data. In this paper, we provi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 32 publications
0
2
0
Order By: Relevance
“…Utilizing LUAD RNA-seq data from the Cancer Genome Atlas (TCGA), mutual information (MI) was employed followed by recursive feature elimination (RFE) feature selection procedures along with SVM classifier. A new Bi-dimensional Principal Feature Selection (BPFS) procedure for efficiently extracting critical genes was proposed for high dimensional gene expression datasets [ 19 ]. This procedure utilizes the principal component analysis (PCA) technique on sample and gene domains successively, in order to identify the informative genes and reduce redundancies while losing less information.…”
Section: Introductionmentioning
confidence: 99%
“…Utilizing LUAD RNA-seq data from the Cancer Genome Atlas (TCGA), mutual information (MI) was employed followed by recursive feature elimination (RFE) feature selection procedures along with SVM classifier. A new Bi-dimensional Principal Feature Selection (BPFS) procedure for efficiently extracting critical genes was proposed for high dimensional gene expression datasets [ 19 ]. This procedure utilizes the principal component analysis (PCA) technique on sample and gene domains successively, in order to identify the informative genes and reduce redundancies while losing less information.…”
Section: Introductionmentioning
confidence: 99%
“…In the rapidly evolving landscape of genomics and bioinformatics, the emergence of gene expression data as a prime example of big data presents both opportunities and formidable challenges. Big data, characterized by its immense size and complexity, demands innovative approaches for efficient processing and analysis [1]. Gene expression data, in particular, involves measuring the activity levels of thousands of genes across various biological samples or conditions.…”
Section: Introductionmentioning
confidence: 99%