2021
DOI: 10.7717/peerj-cs.677
|View full text |Cite
|
Sign up to set email alerts
|

Topic2features: a novel framework to classify noisy and sparse textual data using LDA topic distributions

Abstract: In supervised machine learning, specifically in classification tasks, selecting and analyzing the feature vector to achieve better results is one of the most important tasks. Traditional methods such as comparing the features’ cosine similarity and exploring the datasets manually to check which feature vector is suitable is relatively time consuming. Many classification tasks failed to achieve better classification results because of poor feature vector selection and sparseness of data. In this paper, we propo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 38 publications
(47 reference statements)
0
6
0
Order By: Relevance
“…The three categories into which sentiment analysis can be divided are the machine learning technique, the lexicon-based approach, and the hybrid strategy that combines the previous two approaches [10]. Nowadays, computational technologies are being used in various domains of life, including healthcare [14], security [15] [21] [25] and also in safety purposes [16], disaster [17], and situational awareness [19] [26] [27] in the educational domain [18] as well. Sentiment analysis is a prominent research topic in demand under the category of NLP [20].…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…The three categories into which sentiment analysis can be divided are the machine learning technique, the lexicon-based approach, and the hybrid strategy that combines the previous two approaches [10]. Nowadays, computational technologies are being used in various domains of life, including healthcare [14], security [15] [21] [25] and also in safety purposes [16], disaster [17], and situational awareness [19] [26] [27] in the educational domain [18] as well. Sentiment analysis is a prominent research topic in demand under the category of NLP [20].…”
Section: Literature Reviewmentioning
confidence: 99%
“…No methodology to extract sentiments from customer feedback [15] Enhanced text mining to understand subjective and objective knowledge from text…”
Section: Literature Reviewmentioning
confidence: 99%
“…Their model approached an accuracy of 95.97 percent for AM-FED+ Dataset, 94.89 percent for the AFEW dataset, and 91.14 percent for MELD. In the same contrast, deep learning is currently used in most common image recognition tools [22], natural language processing (NLP) [23] and speech recognition software. These tools are starting to appear in applications as diverse as self-driving cars and language translation services.…”
Section: A Local Binary Pattern Approachmentioning
confidence: 99%
“…Few TM alternatives are also available such as latent semantic analysis (LSA), probabilistic LSA (pLSA), or LDA. Given the information and the length of text in these charts, we selected LDA for use as it has greater accuracy and is easy to interpret the results as it provides a more efficient representation of results (30,31).…”
Section: Latent Dirichlet Allocationmentioning
confidence: 99%