2010
DOI: 10.14778/1920841.1920991
|View full text |Cite
|
Sign up to set email alerts
|

Efficient B-tree based indexing for cloud data processing

Abstract: A Cloud may be seen as a type of flexible computing infrastructure consisting of many compute nodes, where resizable computing capacities can be provided to different customers. To fully harness the power of the Cloud, efficient data management is needed to handle huge volumes of data and support a large number of concurrent end users. To achieve that, a scalable and high-throughput indexing scheme is generally required. Such an indexing scheme must not only incur a low maintenance cost but also support parall… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
62
0
1

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 132 publications
(63 citation statements)
references
References 18 publications
0
62
0
1
Order By: Relevance
“…Several studies [15][16][17][18][19][20][21][22][23] focusing on efficient indexes in cloud storage systems have been conducted. The study in [15] proposed a Trojan index to improve runtime performance.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Several studies [15][16][17][18][19][20][21][22][23] focusing on efficient indexes in cloud storage systems have been conducted. The study in [15] proposed a Trojan index to improve runtime performance.…”
Section: Related Workmentioning
confidence: 99%
“…However, it consumes considerable memory space to cache index information in the client, and it is unsuitable for processing multidimensional queries. The studies in [18,19] proposed an improved B+ tree index. This solution adopt a doublelayer index framework.…”
Section: Related Workmentioning
confidence: 99%
“…However, none of these methods provides real-time OLAP functionality. There are various publications on distributed B-trees for cloud platforms such as [29]. However, these method only supports 1-dimensional indices which are insufficient for OLAP queries.…”
Section: Related Workmentioning
confidence: 99%
“…Therefore, the hash function becomes the main index of data, and the required data can be quickly accessed according to the hash value of keys [5,6,7]. However, in addition to the data query via keys, users also turn to other properties for point search or range search [8]. For example, in an online video system (such as Youtube [9]), each video contains a variety of information, including video ID, program name, upload time, times of plays.…”
Section: Introductionmentioning
confidence: 99%
“…At present, in the cloud computing environment, inverted index, the commonly used secondary index, can scan all storage nodes by multiple MapReduce [10] processes and generate inverted files. Inverted index is an off-line batch process, and it cannot realize timely query of newly inserted data [8]. For example, the record inserted into Google Base cannot be accessed by users until it is re-indexed next time (maybe one day later).…”
Section: Introductionmentioning
confidence: 99%