2015 IEEE International Conference on Information Reuse and Integration 2015
DOI: 10.1109/iri.2015.39
|View full text |Cite
|
Sign up to set email alerts
|

Using Random Undersampling to Alleviate Class Imbalance on Tweet Sentiment Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
54
1
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 117 publications
(56 citation statements)
references
References 9 publications
0
54
1
1
Order By: Relevance
“…Interestingly, they found random under-sampling did make significant improvement compared with not using any technique at all, which is quite different from the conclusion drawn in [49].…”
Section: Single Classifier Based Approachescontrasting
confidence: 65%
See 1 more Smart Citation
“…Interestingly, they found random under-sampling did make significant improvement compared with not using any technique at all, which is quite different from the conclusion drawn in [49].…”
Section: Single Classifier Based Approachescontrasting
confidence: 65%
“…From our observation of the data we acquire via Twitter API, the number of tweets with positive emojis is always greater than the amount with negative Emojis (from the raw data we draw, the size of positive ones could double the negative ones). From paper [49], [91], imbalanced data could introduce some potential problem on the classification result.…”
Section: Data Resamplingmentioning
confidence: 99%
“…Random undersampling is a method of data sampling which randomly selects most of the class instances and removes them until the desired class distribution is attained [14].…”
Section: Samplingmentioning
confidence: 99%
“…The minority class is not modified during this procedure. The random under-sampling technique has been shown to improve model classification performance, as compared to training on the entire dataset without RUS [47,49].…”
Section: Random Under-sampling (Rus)mentioning
confidence: 99%