2018
DOI: 10.1007/s10723-018-9431-9
|View full text |Cite
|
Sign up to set email alerts
|

iHOME: Index-Based JOIN Query Optimization for Limited Big Data Storage

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
9

Relationship

2
7

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 32 publications
0
6
0
Order By: Relevance
“…This protocol shows that SQL systems are not practical for single-patient queries since response times are slower. The query optimization problem was addressed in [22], where the authors addressed the issue of the efficient execution of JOIN queries in the Hadoop query language, Hive, over limited big data storages.…”
Section: Related Workmentioning
confidence: 99%
“…This protocol shows that SQL systems are not practical for single-patient queries since response times are slower. The query optimization problem was addressed in [22], where the authors addressed the issue of the efficient execution of JOIN queries in the Hadoop query language, Hive, over limited big data storages.…”
Section: Related Workmentioning
confidence: 99%
“…In particular, the BlockJoin algorithm is based on two concepts, index-join and late materialization, which are known in the context of parallel dataflow engines. In addition, an index-based system for reusing data called indexing HiveQL Optimization for join over Multisession Big Data Environment (iHOME) was presented [26]. e proposed iHOME system addresses eight cases of join queries which are classified into three groups: Similar-to-iHOME, Compute-on-iHOME, and Filter-of-iHOME.…”
Section: Join Optimization Different Mapreduce Join Strategiesmentioning
confidence: 99%
“…Substantially, the shuffle step is considered expensive since it needs to sort and join all tuples. erefore, the shuffling operations need to be optimized to improve the join performance and reduce the total intermediate data size of join query [26]. However, exploiting sharing opportunities including loading, sorting, and joining data among multiple join queries is a challenging task.…”
Section: Introductionmentioning
confidence: 99%
“…All algorithms are implemented and tested in Hadoop (HDFS) and HBase, utilizing different queries on tables of various sizes and different score-attribute distributions. The query optimization problem is addressed in [20], where authors deal with the efficiently execution of JOIN queries on top of Hadoop query language, Hive, over limited Big Data storages. A novel data integration methodology to query data individually from different relational and NoSQL database systems is proposed in [21].…”
Section: Background and Related Workmentioning
confidence: 99%