While most text classification studies focus on monolingual documents, in this article, we propose an empirical study of poly-languages text sentiment classification model, based on Convolutional Networks ConvNets. The novel approach consists on feeding the deep neural network with one input text source composed by reviews all written in different languages, without any code-switching indication, or language translation. We construct a multi-lingual opinion corpus combining three languages: English French and Greek all from Restaurants Reviews. Despite the limited contextual information due to relatively compact text content, no prior knowledge is used. The neural networks exploit n-gram level information, and the experimental results achieve high accuracy for sentiment polarity prediction, both positive and negative, which lead us to deduce that ConvNets features extraction is language independent.
In this article we introduce an empirical study of multilingual and multi-topic opinion classification. The particularity relies on the reviews that are written in different languages and refer to different but semantically close topics: Restaurants and Hotels. Our key objective is to emphasize the ability of a deep learning model to establish the sentiment polarity of reviews and topics Classification in a multilingual environment without any prior knowledge. For this work, we use unstructured text data, collected from the web, written in French, English and Greek (a less opinion-present language). The incorporate corpusbased input is raw, used without any pre-processing, translation, annotation nor additional knowledge features. For the machine learning approach, we use two different deep neural networks, Convolutional Neural Networks (CONVNETS) and Recurrent Neural Networks (RNNS). The learning model exploits n-gram level information, and achieves high accuracy for sentiment polarity and topics classification according to the experimental tests and results. From our hypothesis, we argue that the multilingual environment composed of reviews in semantically close domains, does not impact the network performance, and lead us to deduce that semantic features extraction with ConvNets and RNNs are language and context independent. Following these results, we tend to promote the inception of simple yet powerful approach for feeding deep networks in multilingual context.
In the geolocation field where high-level programs and low-level devices coexist, it is often difficult to find a friendly user interface to configure all the parameters. The challenge addressed in this paper is to propose intuitive and simple, thus natural language interfaces to interact with low-level devices. Such interfaces contain natural language processing and fuzzy representations of words that facilitate the elicitation of business-level objectives in our context.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.