In this paper, a framework for mining unstructured documents in the form of Vietnamese text comments about locations is proposed. In the first step, the evaluation of users in the form of text comments will be extracted to produce a set of sentences in Vietnamese standard grammars by web analytic processing. Through the second step, using Skip-gram based model, the similarity between each phrase in sentences will be detected. The core of this research focuses on contributing an approach for word representation that reflects Vietnamese semantic information contexts for analyzing as an input of a machine learning based classification, SVM. The application of this research is applied to the STAAR project's opinion mining system.
Purpose -The growth of online data and services on the Web have have led to the Web become an indispensable tool for the tourist industry. It is not denied that various approaches bring benefits for visitors, in supporting their searching for tourist attractions, such as interesting places for the visit, eating or staying. However, like a coin has two sides, too much information would present a difficulty for people when planning their journeys. Generally, tourists usually have problems when trying to find satisfactory accommodation if references to nearby restaurants, sights or event locations are lacking. In addition, travelers suffer from the information overload when they look for information about potential destinations, events and related services. Providing relevant and up-to-date information for the tourists with different personal interests is still a challenging task for the tourist guide information systems. The purpose of this paper is to propose a semantic approach for searching tourist information and generating travel itinerary. Design/methodology/approach -The paper focus on introducing an ontological model for representation of tourist resources as well as traveler's profile. Based on this model, smart user interfaces facilitating the semantic search have been implemented in the mobile travel guide application. In addition, the authors propose an algorithm for generating travel itinerary which combines semantic matching with ant colony optimization technique. Findings -The Semantic Tourist informAtion Access and Recommending (STAAR) system, which has been implemented to promote the travel activity in Hanoi, reveals the advantages of the semantic approach in the development of smart application with ontology-based, user-friendly interface and high precision information search features. An experiment was conducted to show that the proposed algorithm generates travel itinerary relevant to both criteria of itinerary length and user interest. Originality/value -This paper presents a novel algorithm for planning a travel itinerary that is optimized on the length as well as semantically matching the user interests.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.