Domain-specific Stop Words in Malaysian Parliamentary Debates 1959 – 2018

Rahman, Anis Nadiah Che Abdul; Abdullah, Imran Ho; Zainudin, Intan Safinaz; Tiun, Sabrina; Jaludin, Azhar

doi:10.17576/gema-2021-2102-01

Cited by 2 publications

(1 citation statement)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Removal of stop words is essential in Natural Language Processing (NLP) tasks and text analysis [ 2 ]. This process can be followed routine using a pre-defined library or using a list of stop words.…”

Section: Proposed Methodologymentioning

confidence: 99%

Performance evaluation of machine learning models on large dataset of android applications reviews

Qureshi

Ahmad

Ullah

et al. 2023

Multimed Tools Appl

View full text Add to dashboard Cite

With an ever-increasing number of mobile users, the development of mobile applications (apps) has become a potential market during the past decade. Billions of users download mobile apps for divergent use from Google Play Store, fulfill tasks and leave comments about their experience. Such reviews are replete with a variety of feedback that serves as a guide for the improvement of existing apps and intuition for novel mobile apps. However, application reviews are challenging and very broad to approach. Such reviews, when segregated into different classes guide the user in the selection of suitable apps. This study proposes a framework for analyzing the sentiment of reviews for apps of eight different categories like shopping, sports, casual, etc. A large dataset is scrapped comprising 251661 user reviews with the help of ‘Regular Expression’ and ‘Beautiful Soup’. The framework follows the use of different machine learning models along with the term frequency-inverse document frequency (TF-IDF) for feature extraction. Extensive experiments are performed using preprocessing steps, as well as, the stats feature of app reviews to evaluate the performance of the models. Results indicate that combining the stats feature with TF-IDF shows better performance and the support vector machine obtains the highest accuracy. Experimental results can potentially be used by other researchers to select appropriate models for the analysis of app reviews. In addition, the provided dataset is large, diverse, and balanced with eight categories and 59 app reviews and provides the opportunity to analyze reviews using state-of-the-art approaches.

show abstract

Section: Proposed Methodologymentioning

confidence: 99%