A topical (or focused) crawler is a web crawler aiming to search and retrieve web pages from the World Wide Web, which are related to a specific topic. Rather than downloading all accessible Web pages, a topical crawler analyzes the frontier of the crawled region to visit only the portion of the web that contains relevant web pages, and at the same time, try to skip irrelevant regions. This leads to significant savings in both computation and communication resources.
In this paper,we present an integrated topical crawler: Au-ToCrawler. The main features of AuToCrawler consist of a user interest specification module that mediates between users and search engines to identify target examples and keywords that together specify the topic of their interest, and a URL ordering strategy that combines features of several previous approaches and achieves significant improvement.It also provides a graphic user interface such that users can evaluate and visualize the crawling results that can be used as feedback to reconfigure the crawler.
A computationally efficient two-stage greedy capacity maximization (GCM) relay-and-antenna joint selection is proposed for a dual-hop nonregenerative amplify-and-forward (AF) multiple-input multiple-output (MIMO) multiple-relay system with multiple antennas equipped at each node in correlated fading channels. This modified GCM (MGCM) antenna selection strategy selects a subset of antenna pairs from available relays based on the concept of channel capacity maximization subject to an optimal power allocation constraint across the activated antenna pairs. In order to reduce system hardware complexity, antenna selection schemes are performed at the destination node as well. Finally, simulations are conducted to compare the channel capacity of the proposed two-layered antenna selection technique with other existing antenna selection algorithms for half-duplex AF-MIMO multiple-relay systems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.