In this paper, we propose a refined term frequency inversed document frequency (TF-IDF) algorithm called TA TF-IDF to find hot terms, based on time distribution information and user attention. We also put forward a method to generate new terms and combined terms, which are split by the Chinese word segmentation algorithm. Then, we extract hot news according to the hot terms, grouping them into K-means clusters so as to realize the detection of hot topics in news. The experimental results indicated that our method based on the refined TF-IDF algorithm can find hot topics effectively.
Abstract-Software as a service (SaaS) is an emerging software framework in which business data and logic typically integrate with other applications. It requires a unified subscriber to describe SaaS to make for easy integration; however, SaaS provides services to different tenants by running only one instance. In order to satisfy personalized needs from different tenants, the business logic becomes correspondingly complex. As this logic is cumbersome to reveal to every individual tenant, we propose the use of Web Services Conversation Language (WSCL) to express the views of tenant and provider separately. To overcome deficiencies in WCSL for expressing heterogeneous data, process rules, and business rules, we extend the syntax of WSCL. We also put forward a new modeling method for constructing SaaS Service, describing the modeling process and the algorithm for obtaining the tenant model from the business model. In conclusion, we describe the modeling tools and validation methods.
Based on the statistical features, short text messages published by different gender users are different in terms of the words and semantics used. In this paper, two new features are constructed after constructing a gender-specific thesaurus. A new classification model is constructed by combining the traditional statistical features and the improved text implicitness feature. The experimental evaluation performed on the Sina Weibo dataset demonstrated the effectiveness of gender-specific thesaurus-based features, and the improved text implicitness feature improved the accuracy of gender classification to 84.7%.
Service chain discovery and recommendation are significant in services composition. A complex network module based algorithm using services invocable relations is proposed to search useful service chains on the network. Furthermore, a new scheme for discovering composite services processes automatically and recommending service chains by ranking their QoS is provided. Simulations are carried out and the results indicate that some useful service chains in the dataset provided by the WSC2009 can be found by the new algorithm.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.