2013
DOI: 10.1007/s00778-013-0323-0
|View full text |Cite
|
Sign up to set email alerts
|

The ontological key: automatically understanding and integrating forms to access the deep Web

Abstract: Forms are our gates to the web. They enable us to access the deep content of web sites. Automatic form understanding provides applications, ranging from crawlers over meta-search engines to service integrators, with a key to this content. Yet, it has received little attention other than as component in specific applications such as crawlers or meta-search engines. No comprehensive approach to form understanding exists, let alone one that produces rich models for semantic services or integration with linked ope… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0
1

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 33 publications
0
14
0
1
Order By: Relevance
“…Also, the ability of ROSEANN to annotate different sections of the DOM with different annotation pools, together with its reconciliation capabilities, reduce the noise in the annotations that is the main source of errors in annotationdriven wrapper inducers such as [3]. Figure 6 shows the use of ROSEANN within DIADEM, in particular, for the unsupervised segmentation of classified listings on the web [6] and understanding of forms [5].…”
Section: Applications Of Roseannmentioning
confidence: 98%
“…Also, the ability of ROSEANN to annotate different sections of the DOM with different annotation pools, together with its reconciliation capabilities, reduce the noise in the annotations that is the main source of errors in annotationdriven wrapper inducers such as [3]. Figure 6 shows the use of ROSEANN within DIADEM, in particular, for the unsupervised segmentation of classified listings on the web [6] and understanding of forms [5].…”
Section: Applications Of Roseannmentioning
confidence: 98%
“…The data wrangling functionality, for example for mapping generation or format transformation, is implemented as a collection of loosely coupled components that build on the concept of a relational transducer (or simply transducer). Transducers were introduced by Abiteboul et al [5], and have been successfully applied and extended in a variety of applications [10], including web data extraction [11].…”
Section: Transducersmentioning
confidence: 99%
“…• δ guard [11] are Vadalog rules that describe whether a transducer is ready to be executed. • δ scopes [11] are Vadalog rules that describe the scope of the transducer (i.e., parts of the knowledge base and external sources that the transducers depend on). • δ map are Vadalog rules that describe the mapping between the knowledge base and other external schemata, and the internal schema of the transducer.…”
Section: Transducersmentioning
confidence: 99%
“…Another interesting method in the same family is the Ontology-based web Pattern Analysis with Logic (OPAL) method (Furche, 2011) (Furche, 2013). This method does not take in entry a set of deep web sources URL but only the domain name of interest.…”
Section: The Form Integration Approachmentioning
confidence: 99%