1999
DOI: 10.1006/jnca.1999.0095
|View full text |Cite
|
Sign up to set email alerts
|

The Web-OEM approach to Web information extraction

Abstract: The enormous amount of information available through the World Wide Web requires the development of effective tools for extracting and summarizing relevant data from Web sources. In this article we present a data model for representing Web documents and an associated SQL-like query language. Our framework provides an easy-to-use and well-formalized method for automatic generation of wrappers extracting data from Web documents.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2004
2004
2015
2015

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 13 publications
0
1
0
Order By: Relevance
“…Prior research studied document management problems either in general or from a particular viewpoint such as document retrieval or document manipulation [Chin, 2001, Foo and Lim, 1997, Jones and Morrison, 1993, Lambrix and Padgham, 2000, Zantout and Marir, 1999. This research focuses on e-document exchange and information extraction, which is an important topic of document management [Chang et al, 2003, Hao et al, 1996, Iocchi, 1999. This paper describes an application developed for the National Natural Science Foundation of China (NSFC) to process research grant applications.…”
Section: Introductionmentioning
confidence: 99%
“…Prior research studied document management problems either in general or from a particular viewpoint such as document retrieval or document manipulation [Chin, 2001, Foo and Lim, 1997, Jones and Morrison, 1993, Lambrix and Padgham, 2000, Zantout and Marir, 1999. This research focuses on e-document exchange and information extraction, which is an important topic of document management [Chang et al, 2003, Hao et al, 1996, Iocchi, 1999. This paper describes an application developed for the National Natural Science Foundation of China (NSFC) to process research grant applications.…”
Section: Introductionmentioning
confidence: 99%