A generic load/extract utility for data transfer between XML documents and relational databases

Bourret, R.; Bornhövd, Christof; Buchmann, A.

doi:10.1109/wecwis.2000.853868

Cited by 42 publications

(17 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In this paper [21], there is described an algorithm for lossless schema mapping to generate a database schema from a DTD, which makes several improvements over existing algorithms. Also the other strategies for mapping XML to the relation table have been proposed by researchers such as [22] [23] or [24].…”

Section: Analysis and Design Of Solutionmentioning

confidence: 99%

Evaluation of XPath Queries Over XML Documents Using SparkSQL Framework

Hricov

Šenk

Kroha

et al. 2017

Beyond Databases, Architectures and Structures. Towards Efficient Solutions for Data Analysis and Knowledge Representation

View full text Add to dashboard Cite

Validity:Until the end of winter semester 2016/17 InstructionsSparkSQL framework enables distributed and parallel data processing of various formats using SQL-like query language. The main goal of the master thesis is to use the SparkSQL framework to implement a subset of expressions from the XPath query language, which is used for querying XML data.1. Get acquainted with the Apache Spark engine, mainly focus on its SparkSQL framework. 2. Study the works related to the process of mapping the XML database technology (XML documents) to the relational database technology. 3. Based on your knowledge, design a query engine that will be able to evaluate XPath queries over XML documents. 4. Implement a prototype of the designed solution using the SparkSQL framework. 5. Perform suitable testing on the implemented prototype, primarily aim on its functional properties. 6. Create a summary of the performed testing and assess the possibility of its deployment in a highly distributed environment. ReferencesWill be provided by the supervisor. DeclarationI hereby declare that the presented thesis is my own work and that I have cited all sources of information in accordance with the Guideline for adhering to ethical principles when elaborating an academic final thesis.I acknowledge that my thesis is subject to the rights and obligations stipulated by the Act No. 121/2000 Coll., the Copyright Act, as amended, in particular that the Czech Technical University in Prague has the right to conclude a license agreement on the utilization of this thesis as school work under the provisions of Article 60 (1) Citation of this thesis AbstractThe main goal of this thesis is to use Spark SQL framework to implement a subset of expressions from XPath query language. The first part of this thesis is focused on introducing the Apache Spark project. The second part covers analysis of mapping XML documents into the tabular form using an encoding of nodes that keeps a document order. Also the approach to the solution that uses Spark's features is described in the second part. The third part of the thesis is focused on implementation and testing of designed solution.Keywords XML, XPath, SQL, Spark, Spark SQL, DataFrame, Dewey order encoding AbstraktCieľom tejto práce je implementovať podmnožinu výrazov jazyka XPath pomocou systému Spark SQL. Prvá časť práce je zameraná na predstavenie projektu Apache Spark. Druhá časť pokrýva analýzu možnosti mapovania ix XML dokumentov do formy tabuľky použitím kódovania prvkov, ktoré zachováva ich poradie v rámci dokumentu. V druhej časti je taktiež popísaných niekoľko spôsobov riešenia, ktoré využívajú funkcie systému Spark. Tretia časť tejto práce je zameraná na implementáciu a testovanie navrhnutého riešenia.

show abstract

Section: Analysis and Design Of Solutionmentioning

confidence: 99%

Evaluation of XPath Queries Over XML Documents Using SparkSQL Framework

Hricov

Šenk

Kroha

et al. 2017

Beyond Databases, Architectures and Structures. Towards Efficient Solutions for Data Analysis and Knowledge Representation

View full text Add to dashboard Cite

show abstract

“…Bourret et al [4] developed XML-DBMS, a generic tool for loading XML documents into relational tables. Although similar to ShreX in motivation, the mappings supported by this tool are limited to the basic, shared, and hybrid techniques described in [10].…”

Section: Related Workmentioning

confidence: 99%

ShreXManaging XML Documents in Relational Databases

DU¹,

AMERYAHIA²,

FREIRE³

2004

Proceedings 2004 VLDB Conference

View full text Add to dashboard Cite

We describe ShreX, a freely-available system for shredding, loading and querying XML documents in relational databases. ShreX supports all mapping strategies proposed in the literature as well as strategies available in commercial RDBMSs. It provides generic (mapping-independent) functions for loading shredded documents into relations and for translating XML queries into SQL. ShreX is portable and can be used with any relational database backend.

show abstract

“…For example, Bourret et al [3] introduced an XML-RDB mapping language to specify transformation rules for generating an RDB schema from an existing XML DTD (Document Type Definition).…”

Section: Introductionmentioning

confidence: 99%

An UML-XML-RDB Model Mapping Solution for Facilitating Information Standardization and Sharing in Construction Industry

Wu¹,

Hsieh

2002

Proceedings of the 19th International Symposium on Automation and Robotics in Construction (ISARC)

View full text Add to dashboard Cite

Abstract:To facilitate information standardization and sharing in Construction Industry, this paper presents a simple but effective approach that maps the UML (Unified Modeling Language) object-oriented information model related to a construction project to an XML schema, then to a Relational DataBase (RDB) schema. First of all, the mapping between UML model and XML schema is discussed since UML has been a popular tool to model the static structure and dynamic behaviors of the information and processes in a construction project, while XML has become a de-facto standard for information sharing and exchange. Then, a set of consistent rules for mapping from XML schema to RDB's Entity-Relational (E-R) model are studied and established since RDB has been the most popular choice for information management. The present study focuses on making the set of rules simple and easy-to-implement for most applications in construction industry. Finally, a mapping tool for automatically generating RDB schemas from XML Schemas is developed.

show abstract

A generic load/extract utility for data transfer between XML documents and relational databases

Cited by 42 publications

References 4 publications

Evaluation of XPath Queries Over XML Documents Using SparkSQL Framework

Evaluation of XPath Queries Over XML Documents Using SparkSQL Framework

ShreXManaging XML Documents in Relational Databases

An UML-XML-RDB Model Mapping Solution for Facilitating Information Standardization and Sharing in Construction Industry

Contact Info

Product

Resources

About