2019
DOI: 10.12783/dtcse/iccis2019/31986
|View full text |Cite
|
Sign up to set email alerts
|

Text Classification Based on LDA and Semantic Analysis

Abstract: The quality of text features directly affects the text classification effect, in order to get the text features which have the high contribution to the text classification in class, this paper proposes a text classification method based on LDA model and category semantic similarity method. The method selects text document topic features by the LDA model and calculates the semantic similarity between these features and categories combined with the word vector model. According to the size of similarity, the weig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 4 publications
0
1
0
Order By: Relevance
“…The advantage of this method is that it can automatically learn without manual intervention, while the disadvantage is that it requires a large amount of data to train the model. Machine learning based text classification methods mainly train models to automatically learn the rules of classification, thereby classifying new texts [6]. The advantage of this method is that it can learn automatically, does not require manual intervention, and can achieve good results when the data volume is sufficient.…”
Section: Introductionmentioning
confidence: 99%
“…The advantage of this method is that it can automatically learn without manual intervention, while the disadvantage is that it requires a large amount of data to train the model. Machine learning based text classification methods mainly train models to automatically learn the rules of classification, thereby classifying new texts [6]. The advantage of this method is that it can learn automatically, does not require manual intervention, and can achieve good results when the data volume is sufficient.…”
Section: Introductionmentioning
confidence: 99%