2022
DOI: 10.1038/s41598-022-21474-z
|View full text |Cite
|
Sign up to set email alerts
|

Adapted tensor decomposition and PCA based unsupervised feature extraction select more biologically reasonable differentially expressed genes than conventional methods

Abstract: Tensor decomposition- and principal component analysis-based unsupervised feature extraction were proposed almost 5 and 10 years ago, respectively; although these methods have been successfully applied to a wide range of genome analyses, including drug repositioning, biomarker identification, and disease-causing genes’ identification, some fundamental problems have been identified: the number of genes identified was too small to assume that there were no false negatives, and the histogram of P values derived w… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
23
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 12 publications
(23 citation statements)
references
References 45 publications
0
23
0
Order By: Relevance
“…In contrast to the other methods that must assume some statistical properties on histone modification distribution along the genome, the proposed method assumes only that PCs and SVVs obtained obey Gaussian distributions. Other methods can be successfully applied to gene expression [13] and DNA methylation [14] because target specific methods can be developed. However, since no assumptions about the statistical properties of histone modification can be made, other methods are not robust when applied to identification of histone modification.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In contrast to the other methods that must assume some statistical properties on histone modification distribution along the genome, the proposed method assumes only that PCs and SVVs obtained obey Gaussian distributions. Other methods can be successfully applied to gene expression [13] and DNA methylation [14] because target specific methods can be developed. However, since no assumptions about the statistical properties of histone modification can be made, other methods are not robust when applied to identification of histone modification.…”
Section: Discussionmentioning
confidence: 99%
“…To fulfill this requirement, the method of tensor de-composition (TD) and principal component analysis (PCA)- based unsupervised feature extraction (FE) with optimized standard deviation (SD) that were successfully applied to gene expression [13] and DNA methylation [14] were tested to determine if histone modification could also be identified. In contrast to two previous studies [13], [14] where PC or singular value vectors (SVVs) obey the Gaussian distribution which is assumed in the null hypothesis after SD optimization, in our method PC and SVVs follow a mixed Gaussian distribution instead. Nevertheless, empirically, SD optimization allows the correct identification of histone modification to some extent, even when compared to various state-of-the-art methods.…”
Section: Introductionmentioning
confidence: 99%
“…A full explanation is provided in our previous paper (Taguchi and Turki, 2022) and our recent book (Taguchi, 2020). In brief, the purpose of the method is to select a limited number of variables associated with the classification; in this study, the classifications were clinical, medical, or biological information, e.g., healthy controls and patients.…”
Section: Pca-based Unsupervised Fe With Optimized Standard Deviationmentioning
confidence: 99%
“…See also Algorithm S2. A full explanation is, again, provided in our previous paper (Taguchi and Turki, 2022) and our recent book (Taguchi, 2020). As TD was only applied to EH1072 here, we assumed that the tensor represented the EH1072 dataset.…”
Section: Td-based Unsupervised Fe With Optimized Standard Deviationmentioning
confidence: 99%
See 1 more Smart Citation