With the explosive growth of information in the Web, users face difficulties while finding their desired information. Search engine helps the user by retrieving useful information from this huge collection based on his/her search query and presents a list of relevant web pages as a search result. However, without proper ranking of pages in the result through the relevancy of pages to the search query, the user may need to explore the whole list for discovering the appropriate page(s), thereby involving huge search time. Although a number of ranking algorithms such as HITS, PageRank, Weighted PageRank and etc., are developed to assist the search engine, but none of them provides page ranking with high accuracy. In this paper, we propose a scorebased web page ranking algorithm involving web content mining and usage information of the pages. Our algorithm considers both syntactical and semantic matches of the search query to the pages. For a web page, syntactical score is calculated based on the total number of exact matches of the search words in the page. Besides, semantic score is measured using synonym matches of the search words. Moreover, we incorporate the usage information of the pages as page popularity in order to comprise the user interest in the ranking order. The total relevant score of each page is calculated using the summation of the syntactical and semantic scores of the page and its page popularity. Finally, the pages are ranked according to their total relevant score. Based on several performance evaluation measures, experimental results show considerable improvement in the page ranking using our algorithm as compared to other known approaches.
General TermsSearch Engine, Ranking Algorithm, Web content mining, Web usage mining.