2017
DOI: 10.1007/s11227-017-2037-3
|View full text |Cite
|
Sign up to set email alerts
|

Atrak: a MapReduce-based data warehouse for big data

Abstract: As warehouse data volumes expand, single-node solutions can no longer analyze the immense volume of data. Therefore, it is necessary to use shared nothing architectures such as MapReduce. Inter-node data segmentation in MapReduce creates node connectivity issues, network congestion, improper use of node memory capacity and inefficient processing power. In addition, it is not possible to change dimensions and measures without changing previously stored data and big dimension management. In this paper, a method … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 23 publications
0
9
0
Order By: Relevance
“…Data format unification can be applied to problems in other fields. This method can be used in data warehouse like Aras and Atrak methods [22,23], graph processing [24], integrating multidimensional data sources [25] and specific problems like finding patient similarity [26]. For future works, this method can also be used for interactive query processing, online data mining and stream processing.…”
Section: Resultsmentioning
confidence: 99%
“…Data format unification can be applied to problems in other fields. This method can be used in data warehouse like Aras and Atrak methods [22,23], graph processing [24], integrating multidimensional data sources [25] and specific problems like finding patient similarity [26]. For future works, this method can also be used for interactive query processing, online data mining and stream processing.…”
Section: Resultsmentioning
confidence: 99%
“…Historical data is stored in warehouse system in huge volume of data comparing to OLTP, Online Transaction Processing [8]. Historical data can be useful in helping to predict the future when conducting predictive analyses [43].…”
Section: Historical Databasementioning
confidence: 99%
“…Data is managed through several ways to ensure data integrity and its consistency considering its fault tolerance, so several NewSQL databases provide horizontal and vertical scalability to ensure previous features and others. Horizontal scaling concerns of increasing the commodity nodes (hardware) whereas vertical scaling considers of the CPU and RAM power enhancement into current nodes/commodities [8]. Figure 12: Taxonomy 4 reveals mechanism and technologies categories sup-ported in NewSQL.…”
Section: Horizontal and Vertical Scalability Of Distributed System Gementioning
confidence: 99%
See 1 more Smart Citation
“…Flink supports various concepts in time-based windows such as event-based processing, timebased processing, and row count-based processing. Aras [40], Atrak [41] and Hengam [42] use data unification and in-Memory database to achieve higher performance on data warehouse query execution.…”
Section: Related Workmentioning
confidence: 99%