BackgroundAs more and more data is made available through the Web, mediation of information from heterogeneous sources becomes a crucial task for future Web information systems. We describe the features of our information mediator Mix Mediation of Information using XML, which is being developed as part of a joint project between SDSC and UCSD. Like its predecessor TSIMMIS PAGM96 , Mix relies on the well-known mediator architecture Wie92 to provide the user with an integrated view of the underlying sources. To facilitate a uniform and exible representation of arbitrary source data, Mix employs XML, which is not only a document mark-up language, but also a semistructured data model. However, the exibility of the semistructured model can become its own stumbling block when schema information is buried" within the actual data and the user is faced with the problem of formulating meaningful queries against the semistructured database.As a solution to this problem, we use XML DTDs as a structural description in e ect, a schema" of the data exchanged by the components of the mediator architecture. More precisely, w e focus on valid XML documents, i.e., documents which conform to an associated DTD. The schema provided by a DTD is more versatile than relational schemas, and at the same time provides more structure than the plain semistructured model of existing approaches like TSIMMIS. Given the central role of DTDs in our approach, semi-automatic inference of view DTDs becomes an important issue. The DTD inference task is to infer the DTDs of the mediator view, given the mediator view de nition and the source DTDs; see PV99 for an algorithm on Mix's DTD inference.The novel features of the Mix system include: Data exchange and integration solely relies on XML, i.e., instance and schema information is represented by XML documents and XML DTDs, respectively. XML queries are denoted in a high-level, declarative query language Xmas 1 , which builds upon ideas of languages like XML-QL, Yat, MSL, and UnQL XML98, CDSS98, PAGM96, BDFS97 . For example, Xmas allows object fusion and pattern matching on the input XML data. Additionally, Xmas features powerful grouping and order constructs for generating new integrated XML objects" from existing ones.The graphical user interface Bbq Blended Browsing and Querying is completely driven by the mediator view DTD and integrates browsing and querying of XML data. Complex queries can be constructed in an intuitive w ay, which resembles QBE. Due to the nested nature of XML data and DTDs, Bbq employs a n o vel graphical way to specify the nesting and grouping of query results. In Section 2, we brie y discuss the overall architecture of the system. A closer look at the corresponding modules is given in Section 3 using a concrete example. Section 4 summarizes the main points of the prototype and its demonstration.
No abstract
BackgroundAs more and more data is made available through the Web, mediation of information from heterogeneous sources becomes a crucial task for future Web information systems. We describe the features of our information mediator Mix Mediation of Information using XML, which is being developed as part of a joint project between SDSC and UCSD. Like its predecessor TSIMMIS PAGM96 , Mix relies on the well-known mediator architecture Wie92 to provide the user with an integrated view of the underlying sources. To facilitate a uniform and exible representation of arbitrary source data, Mix employs XML, which is not only a document mark-up language, but also a semistructured data model. However, the exibility of the semistructured model can become its own stumbling block when schema information is buried" within the actual data and the user is faced with the problem of formulating meaningful queries against the semistructured database.As a solution to this problem, we use XML DTDs as a structural description in e ect, a schema" of the data exchanged by the components of the mediator architecture. More precisely, w e focus on valid XML documents, i.e., documents which conform to an associated DTD. The schema provided by a DTD is more versatile than relational schemas, and at the same time provides more structure than the plain semistructured model of existing approaches like TSIMMIS. Given the central role of DTDs in our approach, semi-automatic inference of view DTDs becomes an important issue. The DTD inference task is to infer the DTDs of the mediator view, given the mediator view de nition and the source DTDs; see PV99 for an algorithm on Mix's DTD inference.The novel features of the Mix system include: Data exchange and integration solely relies on XML, i.e., instance and schema information is represented by XML documents and XML DTDs, respectively. XML queries are denoted in a high-level, declarative query language Xmas 1 , which builds upon ideas of languages like XML-QL, Yat, MSL, and UnQL XML98, CDSS98, PAGM96, BDFS97 . For example, Xmas allows object fusion and pattern matching on the input XML data. Additionally, Xmas features powerful grouping and order constructs for generating new integrated XML objects" from existing ones.The graphical user interface Bbq Blended Browsing and Querying is completely driven by the mediator view DTD and integrates browsing and querying of XML data. Complex queries can be constructed in an intuitive w ay, which resembles QBE. Due to the nested nature of XML data and DTDs, Bbq employs a n o vel graphical way to specify the nesting and grouping of query results. In Section 2, we brie y discuss the overall architecture of the system. A closer look at the corresponding modules is given in Section 3 using a concrete example. Section 4 summarizes the main points of the prototype and its demonstration.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.