2024
DOI: 10.21203/rs.3.rs-4392630/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A DOM-structural Cohesion Analysis Approach for Segmentation of Modern Web Pages

Hieu Huynh,
Tri Le,
Vu Nguyen
et al.

Abstract: Web page segmentation is a fundamental technique applied in information retrieval systems to enhance web crawling tasks and information extraction. Its purpose is to gain deep insights from crawling results and extract the main content of a webpage by disregarding the irrelevant regions. Over time, several solutions have been proposed to address the segmentation problem using different approaches and learning strategies. Among these, the structural cue, which is a characteristic of the DOM tree, is widely util… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 29 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?