2021
DOI: 10.13052/jwe1540-9589.2042
|View full text |Cite
|
Sign up to set email alerts
|

A New Semantic Approach to Improve Webpage Segmentation

Abstract: Webpage analysis is carried out for various purposes such as webpage segmentation. The goal of webpage segmentation is to divide a page into blocks that have similar elements. A fusion approach that combines different analyses is required in order to obtain high segmentation accuracy. In this paper, we propose a new fusion model for webpage segmentation, where we (1) merge webpage content into basic-blocks by simulating human perception; and, (2) identify similar blocks using semantic text similarity and regro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 33 publications
0
1
0
Order By: Relevance
“…It is that the task of pixel by pixel classification. Semantic segmentation has a wide range of application scenarios in daily life, such as automatic driving, medical diagnosis and augmented reality [1,2]. Image semantic segmentation refers to the technology of recognising and segmenting each object in an image at the pixel level.…”
Section: Introductionmentioning
confidence: 99%
“…It is that the task of pixel by pixel classification. Semantic segmentation has a wide range of application scenarios in daily life, such as automatic driving, medical diagnosis and augmented reality [1,2]. Image semantic segmentation refers to the technology of recognising and segmenting each object in an image at the pixel level.…”
Section: Introductionmentioning
confidence: 99%
“…Fusion-Block [28] is Gestalt law-inspired and subsequential re-segmentation, which uses semantic text similarity.…”
mentioning
confidence: 99%
“…Integrated-Block [28] uses DOM structure, is vision-based, and uses text-based similarity metrics analysis based on web page segmentation.…”
mentioning
confidence: 99%