2016 IEEE International Conference on Big Data (Big Data) 2016
DOI: 10.1109/bigdata.2016.7840924
|View full text |Cite
|
Sign up to set email alerts
|

NoSQL schema evolution and big data migration at scale

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
3
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 24 publications
(18 citation statements)
references
References 7 publications
0
12
0
Order By: Relevance
“…For the algorithm used by Baazizi et al ẇe used the kind equivalence relation, as we do with JSONoid. As noted previously, we are also able to produce a schema that contains the same information as the reduced structure identification graph of Klettke et al [21]. However, since their approach cannot scale beyond main memory, we exclude it from our comparative analysis.…”
Section: Runtimementioning
confidence: 99%
“…For the algorithm used by Baazizi et al ẇe used the kind equivalence relation, as we do with JSONoid. As noted previously, we are also able to produce a schema that contains the same information as the reduced structure identification graph of Klettke et al [21]. However, since their approach cannot scale beyond main memory, we exclude it from our comparative analysis.…”
Section: Runtimementioning
confidence: 99%
“…Some approaches suggest lazy data migration for very large NoSQL databases, for instance, in Scherzinger et al [13], and in Saur et al [12], the overhead of lazy migration is discussed in terms of NoSQL databases. The foundations of different data migration strategies like incremental and predictive migration have been introduced in [6] and investigated in Klettke et al [4]. Although workload monitoring can be applied for different tasks like automated database tuning [11], we are not aware of any approaches for automated selection and adaptation of these data migration approaches as has been presented in this paper.…”
Section: Related Workmentioning
confidence: 99%
“…The estimates of [15] are based on discrete-event simulation using workload and structure models taken from logs, as well as the schema of the tobe-migrated database, whereas we investigate a range of migration scenario characteristics in a Monte Carlo approach. Since the traditional approach of eager migration can become very expensiveespecially in a cloud environment [15,22]-other approaches to data migration such as lazy [27,35] and proactive [22] approaches have been proposed. To our knowledge, there is no related study on the effects of the various migration strategies, comparable to ours in its systematics.…”
Section: Related Workmentioning
confidence: 99%
“…If, on the other hand, saving costs is the most important criterion, then a lazy migration strategy should be applied which minimizes migration costs, as data remains unchanged in the event of a release. In this case, if a legacy entity is accessed, it is migrated individually and on-the-fly, then being in congruence with the latest schema version, yet introducing a considerable runtime overhead [22,27,35]. These two metrics, migration costs and data access latency, are in fact competitors in a tradeoff, which is schematically depicted in Figure 1.…”
Section: Introductionmentioning
confidence: 99%