2018 International Arab Conference on Information Technology (ACIT) 2018
DOI: 10.1109/acit.2018.8672697
|View full text |Cite
|
Sign up to set email alerts
|

The Effects of Natural Language Processing on Big Data Analysis: Sentiment Analysis Case Study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0
3

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(18 citation statements)
references
References 11 publications
0
15
0
3
Order By: Relevance
“…Chi-square method used for feature selection helps to reduce the dimensionality as well as noise in data and increase the accuracy of classifier from 81.5 to 92.3 in TF-IDF, 83 to 90 in FF and 83.1 to 93 in FP [21]. Number of feature selected highly effect the accuracy of result but after a limit increase in size of feature set does not increase classification accuracy [22]- [25].…”
Section: E Feature Selection Methods and Number Of Features Selectedmentioning
confidence: 99%
See 1 more Smart Citation
“…Chi-square method used for feature selection helps to reduce the dimensionality as well as noise in data and increase the accuracy of classifier from 81.5 to 92.3 in TF-IDF, 83 to 90 in FF and 83.1 to 93 in FP [21]. Number of feature selected highly effect the accuracy of result but after a limit increase in size of feature set does not increase classification accuracy [22]- [25].…”
Section: E Feature Selection Methods and Number Of Features Selectedmentioning
confidence: 99%
“…Classification accuracy highly depends on data quality and various researches [18]- [21] show the effect of preprocessing technique on efficiency of classification. Effects of feature selection methods and number of features on classification accuracy are explored in various researches [22]- [28]. All these factors collectively effect overall efficiency and need to be addressed in sentiment analysis.…”
Section: Literature Surveymentioning
confidence: 99%
“…The stopword list which is used is made by itself, refers to the context of words which are often used in hospital services and numbers. Number have no effect on sentiment analysis, and removing them can reduce noise and increase efficiency [27]. An example of a stopword list used is shown in Table 4.…”
Section: Conversation Analysismentioning
confidence: 99%
“…Daftar stopword yang digunakan dibuat sendiri mengacu pada konteks kata yang sering dipakai dalam Dapat dilihat pada Tabel 4, kata "yang" merupakan salah satu daftar stopword yang paling banyak muncul, yaitu sebanyak 214 kali, dan kata "di" yang muncul sebanyak 104 kali. Selain dalam bentuk kata angka juga masuk ke dalam stopword, angka tidak berpengaruh pada analisis sentimen dan dapat dihapus, sehingga dapat mengurangi kebisingan dan meningkatkan efisiensi [26]. Hasil keseluruhan proses prapemrosesan dapat dilihat pada Tabel 5.…”
Section: Analisis Ulasanunclassified