2016
DOI: 10.4236/cs.2016.79230
|View full text |Cite
|
Sign up to set email alerts
|

An Approach for Content Retrieval from Web Pages Using Clustering Techniques

Abstract: Mining the content from an information database provides challenging solutions to the industry experts and researchers, due to the overcrowded information in huge data. In web searching, the information retrieved is not an appropriate, because it gives ambiguous information for the user query, and the user cannot get relevant information within the stipulated time. To overcome these issues, we propose a new methodology for information retrieval EPCRR by providing the top most exact information to the user, by … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…The synergy between these approaches creates a stable foundation, forming the cornerstone for subsequent enhancements [64]. The consistent reliance on collaborative filtering and content-based filtering reflects a strategic choice made by researchers to build upon a strong foundation while incorporating additional algorithms to fine-tune and optimize system performance [64], [65], [66], [67]. This strategic combination, as observed in the majority of instances, reflects an efficient approach to building recommendation systems that have successfully navigated a spectrum of challenges [66].…”
Section: Interpretation Of Results For Rq3mentioning
confidence: 99%
“…The synergy between these approaches creates a stable foundation, forming the cornerstone for subsequent enhancements [64]. The consistent reliance on collaborative filtering and content-based filtering reflects a strategic choice made by researchers to build upon a strong foundation while incorporating additional algorithms to fine-tune and optimize system performance [64], [65], [66], [67]. This strategic combination, as observed in the majority of instances, reflects an efficient approach to building recommendation systems that have successfully navigated a spectrum of challenges [66].…”
Section: Interpretation Of Results For Rq3mentioning
confidence: 99%
“…The design for identifying the web text is analyzing the characteristic of text, single pass, and web article text extraction [12]. Automatic annotation of query result from web database is classified into automatic annotation, decorative tag detector, cluster based shifting and Simple probabilistic , Tag Path(TP), Combining tag and vale Similarity (CTV'S), Local interface Schema (LIS), New Combining tag and vale Similarity (NCTV'S), Integrated Interface Schema(IIS) [8], [9] and [12]. Automatic annotation and wrapper generation is based on six annotators namely schema value, table, query based, frequency based, in-text prefix/suffix, common knowledge annotator.…”
Section: Literature Review : Surveymentioning
confidence: 99%
“…Support Vector Machine (SVM) is used to identify the classifying of non-content blocks. Hierarchical clustering is used to find the similarity measure value of DOM tree [9]. There are two kinds of automatic page annotation search results namely tabling annotator and query based annotator [13].…”
Section: Literature Review : Surveymentioning
confidence: 99%