2018
DOI: 10.1016/j.is.2018.02.007
|View full text |Cite
|
Sign up to set email alerts
|

Schema profiling of document-oriented databases

Abstract: In document-oriented databases, schema is a soft concept and the documents in a collection can be stored using different local schemata. This gives designers and implementers augmented flexibility; however, it requires an extra effort to understand the rules that drove the use of alternative schemata when sets of documents with different -and possibly conflicting-schemata are to be analyzed or integrated. In this paper we propose a technique, called schema profiling, to explain the schema variants within a col… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
40
0
1

Year Published

2018
2018
2020
2020

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 56 publications
(41 citation statements)
references
References 26 publications
0
40
0
1
Order By: Relevance
“…[4] also identifies the smallest set of core attributes, but their approach is more complex and computationally expensive than the one we present here. Finally, [28] goes a step further by not finding a common schema, but trying to explain the different variants found in documents by means of association rules.…”
Section: Related Workmentioning
confidence: 99%
“…[4] also identifies the smallest set of core attributes, but their approach is more complex and computationally expensive than the one we present here. Finally, [28] goes a step further by not finding a common schema, but trying to explain the different variants found in documents by means of association rules.…”
Section: Related Workmentioning
confidence: 99%
“…Studying the inference of RE(&) has several practical motivations, such as schema inference. The presence of a schema for XML documents has many advantages, such as for query processing and optimization, data integration and exchange [11,30]. However, many XML documents in practice are not accompanied by a valid schema [16], making schema inference an attractive research topic [2,3,10,14,31].…”
Section: Introductionmentioning
confidence: 99%
“…Recent years have witnessed an erosion of the relational DBMS predominance to the benefit of DBMSs based on alternative representation models (e.g., document-oriented and graph-based) which adopt a schemaless representation for data. Schemaless databases are preferred to relational ones for storing heterogeneous data with variable schemas and structural forms; typical schema variants within a collection consist in missing or additional fields, in different names or types for an field, and in different structures for instances [1]. The absence of a unique schema grants flexibility to operational applications but adds complexity to analytical applications, in which a single analysis often involves large sets of data with different schemas.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper we propose an original approach to multidimensional querying and OLAP on schemaless sources, in particular on collections stored in document-oriented databases (DODs) such as MongoDB 1 . The basic idea is to stop fighting against data heterogeneity and schema variety, and welcome it as an inherent source of information wealth in schemaless sources.…”
Section: Introductionmentioning
confidence: 99%