2019
DOI: 10.1016/j.eswa.2018.10.020
|View full text |Cite
|
Sign up to set email alerts
|

Tree-structured multi-stage principal component analysis (TMPCA): Theory and applications

Abstract: A PCA based sequence-to-vector (seq2vec) dimension reduction method for the text classification problem, called the tree-structured multi-stage principal component analysis (TMPCA) is presented in this paper. Theoretical analysis and applicability of TMPCA are demonstrated as an extension to our previous work (Su, Huang, & Kuo, in press). Unlike conventional word-to-vector embedding methods, the TMPCA method conducts dimension reduction at the sequence level without labeled training data. Furthermore, it can p… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 26 publications
(14 citation statements)
references
References 27 publications
0
14
0
Order By: Relevance
“…Also, we use a linear regression model in this study and find many features are correlated. In our future work, we should apply a dimension reduction technique, such as PCA, to reduce dimensionality on the entire feature space [38,39].…”
Section: Discussionmentioning
confidence: 99%
“…Also, we use a linear regression model in this study and find many features are correlated. In our future work, we should apply a dimension reduction technique, such as PCA, to reduce dimensionality on the entire feature space [38,39].…”
Section: Discussionmentioning
confidence: 99%
“…This analytical approach did better result and the accuracy was 85.23%. The tree-structured multi-linear principal component analysis (TMPCA) [41] proposed for text classification is a novel data processing technique. To facilitate the machine learning task that follows, the TMPCA can effectively decrease the size of the entire sentence data.…”
Section: Related Workmentioning
confidence: 99%
“…Besides, combining neural network architecture with other classification is done to reach maximum accuracy [5]. A hybrid approach using deep learning and other machine learning would be a good interest to reach better accuracy [50][51][52][53][54].…”
Section: Classification In Handwriting Analysismentioning
confidence: 99%