Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning

Kusumaningrum, Retno; Nisa, Iffa Zainan; Nawangsari, Rizka Putri; Wibowo, Adi

doi:10.26555/ijain.v7i3.737

Cited by 8 publications

(6 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the text processing process, text data will be processed to remove irrelevant information and improve data quality. In the process of text processing, several actions are performed such as: (Kusumaningrum et al, 2021b) 1) Remove punctuation and make text lowercase 2) Remove stopwords like "the", "and", "a", "to", etc 3) Doing lemmatization by modifying words into basic forms (lemma) such as "relaxed" becomes "relaxed" The results of the text after going through the text processing process are cleaner and have more relevant and concentrated information as shown in the following figure. After the text data cleaning process, the number of sentiments in the resulting data is often unbalanced, where the number of positive or negative sentiments tends to be more than the number of neutral sentiments.…”

Section: Figure 5 Research Variables Examined Text Preprocessingmentioning

confidence: 99%

“…Previously, sentiment analysis of hotel reviews was generally carried out on datasets originating from various sites such as Agoda (Sambas et al, 2022), Traveloka (Cendani et al, 2023, Google Map (Sambas et al, 2022), andTripadvisor (Baskoro et al, 2021). Various algorithms that have been researched include Random Forest (Utami, 2021a), Convolutional Neural Network (Kusumaningrum et al, 2021a), Long Short-Term Memory (LSTM) (Priyantina & Sarno, 2019), dan Reccurent Neural Network (Utami, 2021a). This research itself will use the LSTM algorithm because it has been widely used for processing text data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA

Lin

Livando

Chandra

et al. 2023

SinkrOn

View full text Add to dashboard Cite

This study examines the importance of hotel review data analysis and the use of Natural Language Processing (NLP) technology in predicting hotel review sentiment. In this study, deep learning models such as Long Short-Term Memory (LSTM) and Efficiently Learning an Encoder that Classifies Token Replacements Accurately (ELECTRA) are used to predict hotel review sentiment in Indonesian. Hotel review data was obtained through a data scraping process with webscraper.io from the Tripadvisor website and a total of 977 hotel review data were obtained from Grand Mercure Maha Cipta Medan Angkasa. Before the sentiment prediction process is carried out, hotel review data must go through the text preprocessing stage to remove punctuation marks, capital letters, stopwords, and a lemmatizer process is carried out to facilitate further data processing. In addition, sentiments that were previously unbalanced need to be balanced through the undersampling process. The data that has been cleaned and balanced is then labeled as negative (0), neutral (1) and positive (2) sentiments. The test results show that the ELECTRA model produces better performance than the LSTM with an accuracy of 47% by ELECTRA and 30% by LSTM.

show abstract

Section: Figure 5 Research Variables Examined Text Preprocessingmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA

Lin

Livando

Chandra

et al. 2023

SinkrOn

View full text Add to dashboard Cite

show abstract

“…Proses selanjutnya adalah pembersihan data, dimana angka, tanda baca, dan tanda unik dihilangkan. Langkah selanjutnya adalah lower case, di mana semua kata distandarisasi dengan huruf kecil [18]. Setelah semua kata direduksi menjadi huruf kecil, masuk ke proses stopword removal.…”

Section: Data Pre-processingunclassified

“…Dalam tahap ini akan menggunakan confusion matrix, confusion matrix adalah tabulasi dari [17] perhitungan yang didasari pada evaluasi kinerja model klasifikasi berdasarkan jumlah objek penelitian yang diprediksi dengan benar dan salah. Secara singkat confusion matrix memberikan perincian terkait kesalahan klasifikasi [18]. Dalam tahap ini akan dibandingan model mana yang memiliki nilai F1-Score yang paling tinggi.…”

Section: Tahap Conclusionunclassified

Analisis Perbandingan Metode Tf-Idf dan Word2vec pada Klasifikasi Teks Sentimen Masyarakat Terhadap Produk Lokal di Indonesia

Hendrawan Rifky,

Utami,

Hartanto Dwi

2022

smartcomp

View full text Add to dashboard Cite

“…In the condition of determining aspect term keywords, it will certainly cause errors in the text extraction process in determining aspects and opinion terms which will have an impact on inaccurate determination of aspect categories and sentiment polarity. Therefore, a text extraction method is needed to get the right and accurate aspects and opinion terms in hotel reviews [12,13].…”

Section: Introductionmentioning

confidence: 99%

Attention-based Sentence Extraction for Aspect-based Sentiment Analysis with Implicit Aspect Cases in Hotel Review Using Machine Learning Algorithm, Semantic Similarity, and BERT

2023

IJIES

View full text Add to dashboard Cite

The development of the aspect-based sentiment analysis (ABSA) method to work on the case of implicit hotel reviews in depth has not been done much. The problem of extracting aspect and opinion words based on syntaxis and semantics is not only influenced by different of sentence structure types but can also be influenced by word sense disambiguation (WSD) level. So, it needs deep attention to solve these problems. For example, the review "You can't say its cheap because food is cheaper in Chinatown.", where "food is cheaper in Chinatown" is still widely extracted as target terms because there are explicit element of aspect and opinion. In fact, it requires in-depth attention to be able to extract and capture the implicit element "can't say its cheap" as a target term. However, there has been not many research that discusses the details of the ABSA process related to this case. Therefore, we propose an attention-based sentence extraction method for ABSA with implicit aspect cases in hotel review. The method purpose is to improve the ABSA accuracy for hotel reviews based on the cases that have not been solved. First, we develop a pre-processing method to the make the data ready to be processed. Then, we build a set rule-based algorithm to get the word types and the relationship of each word in the sentence. These rules function to identify and mark the candidates of aspect and opinion terms based on the review sentence structure types (simple, compound, complex, compound-complex) and to identify and mark the factors that influence the WSD level (conjunction, punctuation, contrast, intensification) in each sentence. The candidates result of aspect and opinion terms are used as input for the aspect categorization process. The aspect categorization process is carried out using machine learning algorithm, implicit aspect corpus, BERT embedding, and semantic similarity to obtain the aspect categories of each review. Furthermore, the ABSA process is carried out using the BERT sentiment analysis method. Finally, the evaluation process for aspect categorization and ABSA are done with the good result. The evaluation result of aspect categorization obtains 91.31% for accuracy, 91.81% for precision, 89.43% for recall, and 90.61% for f1-measure. Meanwhile, the evaluation result of ABSA obtains 98.10% for accuracy, 98.11% for precision, 96.98% for recall, and 97.54% for f1-measure.

show abstract

Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning

Cited by 8 publications

References 28 publications

Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA

Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA

Analisis Perbandingan Metode Tf-Idf dan Word2vec pada Klasifikasi Teks Sentimen Masyarakat Terhadap Produk Lokal di Indonesia

Attention-based Sentence Extraction for Aspect-based Sentiment Analysis with Implicit Aspect Cases in Hotel Review Using Machine Learning Algorithm, Semantic Similarity, and BERT

Contact Info

Product

Resources

About