Purpose -This study aims to present a new web page recommendation system that can help users to reduce navigational time on the internet. Design/methodology/approach -The proposed design is based on the primacy effect of browsing behavior, that users prefer top ranking items in search results. This approach is intuitive and requires no training data at all. Findings -A user study showed that users are more satisfied with the proposed search methods than with general search engines using hot keywords. Moreover, two performance measures confirmed that the proposed search methods out-perform other metasearch and search engines.Research limitations/implications -The research has limitations and future work is planned along several directions. First, the search methods implemented are primarily based on the keyword match between the contents of web pages and the user query items. Using the semantic web to recommend concepts and items relevant to the user query might be very helpful in finding the exact contents that users want, particularly when the users do not have enough knowledge about the domains in which they are searching. Second, offering a mechanism that groups search results to improve the way search results are segmented and displayed also assists users to locate the contents they need. Finally, more user feedback is needed to fine-tune the search parameters including a and b to improve the performance. Practical implications -The proposed model can be used to improve the search performance of any search engine. Originality/value -First, compared with the democratic voting procedure used by metasearch engines, search engine vector voting (SVV) enables a specific combination of search parameters, denoted as a and b, to be applied to a voted search engine, so that users can either narrow or expand their search results to meet their search preferences. Second, unlike page quality analysis, the hyperlink prediction (HLP) determines qualified pages by simply measuring their user behavior function (UBF) values, and thus takes less computing power. Finally, the advantages of HLP over statistical analysis are that it does not need training data, and it can target both multi-site and site-specific analysis.
Purpose -Web-snippet clustering has recently attracted a lot of attention as a means to provide users with a succinct overview of relevant results compared with traditional search results. This paper seeks to research the building of a web-snippet clustering system, based on a mixed clustering method. Design/methodology/approach -This paper proposes a mixed clustering method to organise all returned snippets into a hierarchical tree. The method accomplishes two main tasks: one is to construct the cluster labels and the other is to build a hierarchical tree. Findings -Five measures were used to measure the quality of clustering results. Based on the results of the experiments, it was concluded that the performance of the system is better than current commercial and academic systems. Originality/value -A high performance system is presented, based on the clustering method. A divisive hierarchical clustering algorithm is also developed to organise all returned snippets into a hierarchical tree.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.