No abstract
Social bookmarking is the process through which users share tags for online resources like blogs with others. Such collaborative tags provide valuable metadata for retrieval systems. While the successes of collaborative tagging systems have been demonstrated by popular websites like Del.icio.us, these sites cover only a small fraction of the available blogs on the web. The vast majority of the blogs are not available on any collaborative tagging system and are often tagged only by the authors. This lack of coverage of collaborative tags is a considerable roadblock in using the tag metadata in a web scale information retrieval system. To solve this problem we propose and implement a system to automatically recommend collaborative tags for a blog. The automatically generated tags will help to surface the blogs by making them available on social book marking sites and allow them to be easily discovered and potentially further tagged by a wider population.
Abstract-The nature of the Blogosphere determines that the majority of bloggers are only connected with a small number of fellow bloggers, and similar bloggers can be largely disconnected from each other. Aggregating them allows for cost-effective personalized services, targeted marketing, and exploration of new business opportunities. As most bloggers have only a small number of adjacent bloggers, the problem of aggregating similar bloggers presents challenges that demand novel algorithms of connecting the non-adjacent due to the fragmented distributions of bloggers. In this work, we define the problem, delineate its challenges, and present an approach that uses innovative ways to employ contextual information and collective wisdom to aggregate similar bloggers. A real-world blog directory is used for experiments. We demonstrate the efficacy of our approach, report findings, and discuss related issues and future work.
Combining multiple data sources, each with its own features, to achieve optimal inference has received a lot of attention in recent years. In inference from multiple data sources, each source can be thought of as providing one view of the underlying object. In general, different views may provide complementary information for the inference task. However, often not all the views are available all the time for the available instances in an application. In this paper, we propose a view completion approach based on canonical correlation analysis that heuristically predicts the missing views and further ranks all within-view features, through learning the intrinsic correlation among the views from training set. We evaluate our approach and compare it with existing approaches in the literature, using web page classification and photo tag recommendation as case studies. Experiments demonstrate the improved performance of the proposed approach. The results suggest that the work has great potential for inference problems with multiple information sources.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.