Ontology is one of the key components in semantic webs. It contains the core knowledge for an effective search. However, building ontology requires the carefully-collected knowledge which is very domain-sensitive. In this work, we present the practice of ontology construction for a case study of health tourism in Thailand. The whole process follows the METHONTOLOGY approach, which consists of phases: information gathering, corpus study, ontology engineering, evaluation, publishing, and the application construction. Different sources of data such as structure web documents like HTML and other documents are acquired in the information gathering process. The tourism corpora from various tourism texts and standards are explored. The ontology is evaluated in two aspects: automatic reasoning using Pellet, and RacerPro, and the questionnaires, used to evaluate by experts of the domains: tourism domain experts and ontology experts. The ontology usability is demonstrated via the semantic web application and via example axioms. The developed ontology is actually the first health tourism ontology in Thailand with the published application.
In recent years, Bitcoin and other cryptocurrencies have been increasingly considered an investment option for emerging markets. However, its erratic behavior has discouraged some potential investors. To get insights into its behavior and price fluctuation, past studies have discovered the correlation between Twitter sentiments and Bitcoin behavior. Most of them have focused exclusively on their relationships, instead of the Twitter sentiment analysis itself. Finding the most suitable classification algorithms for sentiment analysis for this kind of data is challenging. For enormous data of Twitter, unlabeled data can be time-consuming and expensive for the supervised sentiment analysis approach, which has been studied to be superior to unsupervised ones. As such, we propose HyVADRF: Hybrid VADER -Random Forest and Grey Wolf Optimizer Model. Semantic and rule-based VADER was used to calculate polarity scores and classify sentiments, which overcame the weakness of manual labeling, while Random Forest was utilized as its supervised classifier. Furthermore, considering Twitter's massive size, we collected over 3.6 million tweets and analyzed various dataset sizes as these are related to the model's learning process. Lastly, Grey Wolf Optimizer parameter tuning was conducted to optimize the classifier's performance. The results show that 1) HyVADRF Model returned an accuracy of 75.29 %, precision of 70.22%, recall of 87.70%, and F1-score of 78%. 2) The most ideal percentage of dataset size is 90% of the total collected tweets (n=1,249,060). 3) With standard deviations of 0.0008 for accuracy and F1-score and 0.0011 for precision and recall. Hence, HyVADRF Model consistently delivers stable results.
Resource Description Framework (RDF) data represents information linkage around the Internet. It uses Internationalized Resources Identifier (IRI) which can be referred to external information. Typically, an RDF data is serialized as a large text file which contains millions of relationships. In this work, we propose a framework based on TripleID-Q, for query processing of large RDF data in a GPU. The key elements of the framework are 1) a compact representation suitable for a Graphics Processing Unit (GPU) and 2) its simple representation conversion method which optimizes the preprocessing overhead. Together with the framework, we propose parallel algorithms which utilize thousands of GPU threads to look for specific data for a given query as well as to perform basic query operations such as union, join, and filter. The TripleID representation is smaller than the original representation 3-4 times. Querying from TripleID using a GPU is up to 108 times faster than using the traditional RDF tool. The speedup can be more than 1,000 times over the traditional RDF store when processing a complex query with union and join of many subqueries.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.