A group of senior database researchers gathers every few years to assess the state of database research and to point out problem areas that deserve additional focus. This report summarizes the discussion and conclusions of the sixth ad-hoc meeting held May 4-6, 2003 in Lowell, Mass. It observes that information management continues to be a critical component of most complex software systems. It recommends that database researchers increase focus on: integration of text, data, code, and streams; fusion of information from heterogeneous data sources; reasoning about uncertain data; unsupervised data mining for interesting correlations; information privacy; and self-adaptation and repair.
Provenance for transactional updates is critical for many applications such
as auditing and debugging of transactions. Recently, we have introduced
MV-semirings, an extension of the semiring provenance model that supports
updates and transactions. Furthermore, we have proposed reenactment, a
declarative form of replay with provenance capture, as an efficient and
non-invasive method for computing this type of provenance. However, this
approach is limited to the snapshot isolation (SI) concurrency control protocol
while many real world applications apply the read committed version of snapshot
isolation (RC-SI) to improve performance at the cost of consistency. We present
non-trivial extensions of the model and reenactment approach to be able to
compute provenance of RC-SI transactions efficiently. In addition, we develop
techniques for applying reenactment across multiple RC-SI transactions. Our
experiments demonstrate that our implementation in the GProM system supports
efficient re-construction and querying of provenance.Comment: long versions of CIKM pape
The existence of the variety of data models and their associated data processing technologies make data management extremely complex. In this paper, we envision a single Multi-Model DataBase Management Systems (MMDBMS) providing declarative accesses to a variety of data models. We briefly review the history of the evolution of the DBMS technology to derive requirements of MMDBMSs and then we illustrate our ideas of building MMDBMSs satisfying those requirements. Since the relational algebra is not powerful enough to provide a mathematical foundation for MMDBMSs, we promote the category theory as a new theoretical foundation, which is a generalization of the set theory. We also suggest a set of shared data infrastructure services among data models to support "Just-In-Time" multi-model data access autonomously.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.