The complexities and tangles of Arabic dialect in orthography and morphology typically make the sentimental analysis quite challenging. Moreover, most of the classification approaches have addressed this problem based on hand-crafted features. Since the Arabic language has multi-dialects and the language has no word-based order, the extraction process and the classification tasks are more difficult and time consuming. Deep neural network approaches applied to the Arabic language colloquial are very limited. These deep learning approaches typically comprise a structure that is very complex for small quantities of data. The structures are based on wide convolutional networks that are not capable of capturing the entire semantic and sentiment features for Arabic dialects. In this paper, a narrow structure of the convolutional neural network (CNN) has been proposed in order to obtain the tweets representations and classify the Arabic tweets into five, three and two polarities. Sensitivity analysis has been conducted to evaluate the impact of various combination structural properties, such as the number of convolutional filters, pooling size, and filter size on the classification performances. The proposed Arabic narrow convolutional neural network (NCNN) has captured the entire semantic and sentiment information contained in the tweet by maximizing the features of the detector's range. The NCNN performances were estimated to be at its optimum when structured by three convolutional layers, each one followed by the max pooling layer. The model has been developed without using lexicon resources and lexical features or augmented the dataset with extra training data. The narrow model is the first baseline model for Arabic dialects sentiment classifications for a sentence level as it is the first narrow CNN model addressing the Arabic Dialect tweets. NCNN model achieved the lowest macro average mean absolute error (MAE M ) for five polarity and higher Macro average recall (P) for three and two polarities on the SemEval-2017 Arabic dialect Twitter datasets when compared to the other state-of-the-art approaches.INDEX TERMS Arabic tweets sentiment analysis, multilayers convolutional neural network, narrow convolutional neural network.
Limited approaches have been applied to Arabic sentiment analysis for a five-point classification problem. These approaches are based on single task learning with a handcrafted feature, which does not provide robust sentence representation. Recently, hierarchical attention networks have performed outstandingly well. However, when training such models as single-task learning, these models do not exhibit superior performance and robust latent feature representation in the case of a small amount of data, specifically on the Arabic language, which is considered a low-resource language. Moreover, these models are based on single task learning and do not consider the related tasks, such as ternary and binary tasks (cross-task transfer). Centered on these shortcomings, we regard five ternary tasks as relative. We propose a multitask learning model based on hierarchical attention network (MTLHAN) to learn the best sentence representation and model generalization, with shared word encoder and attention network across both tasks, by training three-polarity and five-polarity Arabic sentiment analysis tasks alternately and jointly. Experimental results showed outstanding performance of the proposed model, with high accuracy of 83.98%, 87.68%, and 84.59 on LABR, HARD, and BRAD datasets, respectively, and a minimum macro mean absolute error of 0.632% on the Arabic tweets dataset for five-point Arabic sentiment classification problem.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.