2015 2nd International Conference on Knowledge-Based Engineering and Innovation (KBEI) 2015
DOI: 10.1109/kbei.2015.7436183
|View full text |Cite
|
Sign up to set email alerts
|

Web content extraction using contextual rules

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…Other important fields where template extraction is particularly useful are boilerplate removal [12,16,46], wrapper generation [32,43,60], wrapper induction [40,59], wrapper maintenance [29,39], and automated data extraction (see, e.g., [16,27,31]).…”
Section: Introductionmentioning
confidence: 99%
“…Other important fields where template extraction is particularly useful are boilerplate removal [12,16,46], wrapper generation [32,43,60], wrapper induction [40,59], wrapper maintenance [29,39], and automated data extraction (see, e.g., [16,27,31]).…”
Section: Introductionmentioning
confidence: 99%
“…Methods based on blocks mainly contain these kinds of algorithms: document object model (DOM) based page segmentation [5][6][7][8], vision-based page segmentation [9,10], specific tag based page segmentation [11,12], hybrid methods [13], and semantic based page segmentation. DOM based page segmentation uses hierarchical relations in tags to extract the main content [5,14]. Xpath can be used to locate content nodes in html where DOM is a kind of XML [15].…”
Section: Introductionmentioning
confidence: 99%