2018
DOI: 10.1155/2018/8909357
|View full text |Cite
|
Sign up to set email alerts
|

Performance Assessment of Multiple Classifiers Based on Ensemble Feature Selection Scheme for Sentiment Analysis

Abstract: Sentiment classification or sentiment analysis has been acknowledged as an open research domain. In recent years, an enormous research work is being performed in these fields by applying various numbers of methodologies. Feature generation and selection are consequent for text mining as the high-dimensional feature set can affect the performance of sentiment analysis. This paper investigates the inability or incompetency of the widely used feature selection methods (IG, Chi-square, and Gini Index) with unigram… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 20 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…The methodology (Figure 2) used the approach for text classification (Ghosh and Sanyal, 2018), it is start from data collection, text preprocessing, exploratory data analysis, and evaluation. After collecting the data, for improving the quality of raw data text in to a clean text dataset, it is necessary to do preprocessing the data (Vijayarani et al 2015) text preprocess activity are removing punctuation, digits, capitalization, extra area -white space, tokenization, and de-pluralization the terms, weed out unnecessarily popular phrases, delete a collection of specific stop words.…”
Section: Methodsmentioning
confidence: 99%
“…The methodology (Figure 2) used the approach for text classification (Ghosh and Sanyal, 2018), it is start from data collection, text preprocessing, exploratory data analysis, and evaluation. After collecting the data, for improving the quality of raw data text in to a clean text dataset, it is necessary to do preprocessing the data (Vijayarani et al 2015) text preprocess activity are removing punctuation, digits, capitalization, extra area -white space, tokenization, and de-pluralization the terms, weed out unnecessarily popular phrases, delete a collection of specific stop words.…”
Section: Methodsmentioning
confidence: 99%
“…Using statistical feature selection methods, Ghosh et al [11] proposed an ensemble feature selection technique to improve the performance of the sentiment analysis process. To find the best feature set, they use information gain, the Gini index, and the Chi-square method.…”
Section: Related Workmentioning
confidence: 99%
“…The objective of the Easy Ensemble method [9,10,11,26] is to overcome the deficiency of information loss introduced in the traditional RUS method. Easy Ensemble can be considered as an unsupervised learning algorithm that explores the majority class data by using independent random sampling with replacement [8].…”
Section: Related Workmentioning
confidence: 99%