Fifth IEEE International Workshop on Web Site Evolution, 2003. Theme: Architecture. Proceedings.
DOI: 10.1109/wse.2003.1234003
|View full text |Cite
|
Sign up to set email alerts
|

A tool-supported method to extract data and schema from Web sites

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 6 publications
0
8
0
Order By: Relevance
“…On the other hand, in a top-down approach, high-level blocks are first declared before their inner (leaf) content. In a previous work [8], we show that this approach can be optimally used in the context of Web sites re-engineering (e.g., Web data migration towards a database) or when complex data structures need to be declared. According to the exploitation of the extracted data, one approach will always be preferred to the other but we are working on the integration of both views to get a multipurpose environment.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, in a top-down approach, high-level blocks are first declared before their inner (leaf) content. In a previous work [8], we show that this approach can be optimally used in the context of Web sites re-engineering (e.g., Web data migration towards a database) or when complex data structures need to be declared. According to the exploitation of the extracted data, one approach will always be preferred to the other but we are working on the integration of both views to get a multipurpose environment.…”
Section: Discussionmentioning
confidence: 99%
“…Finally, we suggested, in a previous paper [8], a complementary approach for Web data extraction and schema generation. In this work, the mapping between HTML and XML was realized by means of a META file, i.e., an XML representation of page clusters based on the source HTML structure.…”
Section: Related Workmentioning
confidence: 99%
“…Lixto [28] is a wrapper generation tool that is well suitable for building HTML/XML wrappers. Moreira et al [29] propose an approach to integrating WWW information, which is based on the development of a canonical domain model in XML and the wrapping of existing WWW applications with wrappers capable of communicating about entities in this common model with the applications and with an intermediary mediator.…”
Section: Resource Oriented Software Evolutionmentioning
confidence: 99%
“…In practice, most conceptual schemes of information systems and databases are developed essentially from zero. However, over the last decade, several approaches have emerged, with the objective of maintenance Web oriented applications based on the reverse engineering process [1]; [2]; [3]; [4]; [5]; [6]; [7].…”
Section: Introductionmentioning
confidence: 99%