2020
DOI: 10.30534/ijatcse/2020/162952020
|View full text |Cite
|
Sign up to set email alerts
|

Dimensionality Reduction for Classification of Filipino Text Documents based on Improved BayesianVectorization Technique

Abstract: Dimensionality reduction of feature vector size plays a vital role in enhancing the text processing capabilities to reduce the size of the feature vector used in the mining tasks to achieve a higher classification accuracy. While dimensionality reduction for text classification is becoming a great area of research in most languages, Filipino documents have received little or no attention from researchers. Thus, this paper addresses the issue of dimensionality reduction in representing relevant data from Filipi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…Documents in the bag of words containing no words were removed as well as their entry labels. A support vector machines (SVM)-based supervised classification model was used using the word frequency counts from the bag-of-words model and the labels [21], [22]. A multiclass linear classifier specifies the counts of the bag-of-words model to be the predictor, and the event type labels to be the response.…”
Section: Thematic Classification and Machine Learningmentioning
confidence: 99%
“…Documents in the bag of words containing no words were removed as well as their entry labels. A support vector machines (SVM)-based supervised classification model was used using the word frequency counts from the bag-of-words model and the labels [21], [22]. A multiclass linear classifier specifies the counts of the bag-of-words model to be the predictor, and the event type labels to be the response.…”
Section: Thematic Classification and Machine Learningmentioning
confidence: 99%