Abstract Research on text mining has grown more than ever in various sectors. Public figures have also grown in interest towards the field and have the tendency to get to know more about consumers’ perceptions toward relevant goods and the reputation of an individual in social media. Sentiment analysis is a state-of-the-art technique that can be utilized to evaluate such trends or general views, for instance the reputation of a fashion brand. The dataset is built upon the crawled tweets that are relevant with the required topics which have the purpose to analyze the preferred fashion brand of the public. This study shows that the public leads to a positive notion toward foreign bag brands. The algorithms that are being compared includes Logistic Regression, Multinomial Naïve Bayes, Decision Tree, K-Nearest Neighbors, Random Forest, and Support Vector Machine. Support Vector Machine provides the best model which reaches 69% in accuracy. The Synthetic Minority Oversampling Technique (SMOTE) was also conducted to improve the model. Result shows that the Support Vector Machine model has successfully increased its accuracy by 13%, reaching an accuracy of 82%. Keywords: Sentiment Analysis, Brand, Machine Learning, Classification, SMOTE Abstrak Penelitian mengenai text mining telah mengalami peningkatan dibanding sebelumnya di dalam berbagai sektor. Figur publik juga semakin tertarik terhadap bidang tersebut dan memiliki kecenderungan untuk mengetahui lebih banyak mengenai persepsi konsumen terhadap suatu barang dan mengenai reputasi seseorang di media sosial. Sentimen analisis merupakan sebuah teknik state-of-the-art yang dapat digunakan untuk mengevaluasi suatu tren atau pandangan umum mengenai suatu hal, misalnya reputasi sebuah merek fashion. Sumber himpunan data yang digunakan pada penelitian ini dibuat berdasarkan crawling tweet yang relevan dengan topik yang dibutuhkan, yang bertujuan untuk menganalisis merek fashion yang disukai oleh masyarakat. Penelitian ini menunjukkan bahwa persepsi masyarakat mengarah pada persepsi positif terhadap merek tas luar negeri. Pada penelitian ini, beberapa algoritma digunakan sebagai perbandingan, antara lain Logistic Regression, Multinomial Naïve Bayes, Decision Tree, K-Nearest Neighbors, Random Forest, dan Support Vector Machine. Hasil pengujian model menunjukkan algoritma Support Vector Machine memiliki performa terbaik dengan accuracy sebesar 69%. Kemudian digunakan teknik Synthetic Minority Oversampling Technique (SMOTE) untuk meningkatkan performa dari model. Hasil menunjukkan bahwa model algoritma Support Vector Machine telah berhasil ditingkatkan dengan accuracy sebesar 13%, mencapai accuracy sebesar 82%. Kata kunci: Sentimen Analisis, Merek, Pembelajaran Mesin, Klasifikasi, SMOTE
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.