2011 IEEE International Conference on Cluster Computing 2011
DOI: 10.1109/cluster.2011.86
|View full text |Cite
|
Sign up to set email alerts
|

FastQuery: A Parallel Indexing System for Scientific Data

Abstract: Modern scientific datasets present numerous data management and analysis challenges. State-of-theart index and query technologies such as FastBit can significantly improve accesses to these datasets by augmenting the user data with indexes and other secondary information. However, a challenge is that the indexes assume the relational data model but the scientific data generally follows the array data model. To match the two data models, we design a generic mapping mechanism and implement an efficient input and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 45 publications
(30 citation statements)
references
References 21 publications
0
30
0
Order By: Relevance
“…In contrast, PARLO addresses heterogeneous access patterns induced by a range of general query types. Unlike prior post-processing approaches [5], [25], PARLO is integrated with parallel I/O middleware to achieve efficient run-time in-memory layout optimization and index building.…”
Section: Run-time Layout Optimization Performance Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…In contrast, PARLO addresses heterogeneous access patterns induced by a range of general query types. Unlike prior post-processing approaches [5], [25], PARLO is integrated with parallel I/O middleware to achieve efficient run-time in-memory layout optimization and index building.…”
Section: Run-time Layout Optimization Performance Evaluationmentioning
confidence: 99%
“…For instance, SciDB [2] and work on space-filling curves (SFC) [17] focus on spatial LO. Likewise, FastBit [5], [25] and ISABELA-QA [16] explore value-based LO methods. However, systems optimized for only a single access pattern cannot address the mix of access patterns observed in practice.…”
Section: Introductionmentioning
confidence: 99%
“…In this work, we use FastQuery [8], [6], [7] to accelerate the data analysis process of the trillion particle dataset. Here, we briefly recap the salient features of FastQuery, and elaborate on the new hybrid parallel implementation.…”
Section: B Indexing/querying With Hybrid Parallel Fastquerymentioning
confidence: 99%
“…Supposing a floating-point number with a very extensive domain, the use of inequalities can be more suitable to generate indexes (less number of bitmap columns) and query certain intervals (of values) of the attribute indexed (search space limited by the generated indexes). FastBit tool, FastQuery [29] and SDS/Q framework are examples of related work that employ bitmap indexing. However, none of these solutions can manage data element through dataflow generation.…”
Section: Indexing Raw Data From Filesmentioning
confidence: 99%