2011 International Conference on Parallel Processing 2011
DOI: 10.1109/icpp.2011.54
|View full text |Cite
|
Sign up to set email alerts
|

SQLMR : A Scalable Database Management System for Cloud Computing

Abstract: Abstract-As the size of data set in cloud increases rapidly, how to process large amount of data efficiently has become a critical issue. MapReduce provides a framework for large data processing and is shown to be scalable and fault-tolerant on commondity machines. However, it has higher learning curve than SQL-like language and the codes are hard to maintain and reuse. On the other hand, traditional SQL-based data processing is familiar to user but is limited in scalability. In this paper, we propose a hybrid… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(7 citation statements)
references
References 8 publications
0
7
0
Order By: Relevance
“…The architecture of SLMR consists of: (a) a SQL-MapReduce compiler, which converts SQL statements to sequential MapReduce jobs, (b) a query result manager, which searches the log to find if similar query results are available in the cache, (c) a database partitioning and indexing manager, which is responsible for managing data files, partitioning the new data, and creating indexes, and (d) an optimized Hadoop, which is responsible for the generation of optimized MapReduce jobs. Hsieh et al [116] conducted several experiments to illustrate the scalability of data and system with respect to increasing data sizes. The approach suffers from the network load unbalancing because of the random placement of reducers, causing the reducers to become stragglers on a busy rack.…”
Section: Data Management Approaches and Systemsmentioning
confidence: 99%
See 2 more Smart Citations
“…The architecture of SLMR consists of: (a) a SQL-MapReduce compiler, which converts SQL statements to sequential MapReduce jobs, (b) a query result manager, which searches the log to find if similar query results are available in the cache, (c) a database partitioning and indexing manager, which is responsible for managing data files, partitioning the new data, and creating indexes, and (d) an optimized Hadoop, which is responsible for the generation of optimized MapReduce jobs. Hsieh et al [116] conducted several experiments to illustrate the scalability of data and system with respect to increasing data sizes. The approach suffers from the network load unbalancing because of the random placement of reducers, causing the reducers to become stragglers on a busy rack.…”
Section: Data Management Approaches and Systemsmentioning
confidence: 99%
“…Hsieh et al [116] proposed SQLMR, which is a data management system for the cloud. SQLMR combines SQL and MapReduce.…”
Section: Data Management Approaches and Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…A lot of time is consumed in pre-partitioning phase. Hsieh et al (2011) Implemented one system model named "SQLMR", which is a hybrid approach to fill the gap between SQL-based and MapReduce data processing. With effective part partitioning and B tree indexing, low overhead file construction, optimized rack awareness algorithm, query result cache mechanism the system produced best results as compare to HadoopDB.…”
Section: Related Workmentioning
confidence: 99%
“…These techniques can achieve automatic scalability by changing environments and loads. In [3], a hybrid solution, called SQLMR was proposed. The SQLMR combines the programming advantage of SQL with the fault tolerant, heterogeneous cluster, scalable capabilities of MapReduce.…”
Section: Introductionmentioning
confidence: 99%