Virtual chunks: On supporting random accesses to scientific data in compressible storage systems

Zhao, Dongfang; Yin, Jianwei; Qiao, Kan; Raicu, Ioan

doi:10.1109/bigdata.2014.7004238

Cited by 14 publications

(6 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In future, we will integrate the proposed caching mechanism to other systems such as file compression [62,63], data provenance [64,65] and job scheduling [66]. We plan to further investigate the tradeoff between performance (for example, GPU acceleration [67]) and cost (for example, scientific applications on EC2 [68]) with the introduction of memoryclass cache, and explore the viability to extend the current approach into incremental mechanisms [69][70][71].…”

Section: Discussionmentioning

confidence: 99%

Towards cost-effective and high-performance caching middleware for distributed systems

Zhao

Qiao

Raicu

2016

IJBDI

Self Cite

View full text Add to dashboard Cite

One performance bottleneck of distributed systems lies on the hard disk drive (HDD) whose single read/write head has physical limitations to support concurrent I/Os. Although the solid-state drive (SSD) has been introduced for years, HDDs are still dominant storage due to large capacity and low cost. This paper proposes a caching middleware that manages the underlying heterogeneous storage devices in order to allow distributed file systems to achieve both high performance and low cost. Specifically, we design and implement a user-level caching system that offers SSD-like performance at a cost similar to a HDD. We demonstrate how such a middleware improves the performance of distributed file systems, such as the HDFS. Experimental results show that the caching system delivers up to 7X higher throughput and 76X higher IOPS than Linux Ext4 file system, and accelerates HDFS by 28% on 32 nodes.

show abstract

Section: Discussionmentioning

confidence: 99%

Towards cost-effective and high-performance caching middleware for distributed systems

Zhao

Qiao

Raicu

2016

IJBDI

Self Cite

View full text Add to dashboard Cite

show abstract

“…This section presents some real systems that have adopted ZHT as a building block. It also leads to additional publications .…”

Section: Zht As a Building Block For Distributed Systemsmentioning

confidence: 99%

A convergence of key‐value storage systems from clouds to supercomputers

Zhou

Wang

et al. 2015

Concurrency and Computation

Self Cite

View full text Add to dashboard Cite

Summary This paper presents a convergence of distributed key‐value storage systems in clouds and supercomputers. It specifically presents ZHT, a zero‐hop distributed key‐value store system, which has been tuned for the requirements of high‐end computing systems. ZHT aims to be a building block for future distributed systems, such as parallel and distributed file systems, distributed job management systems, and parallel programming systems. ZHT has some important properties, such as being lightweight, dynamically allowing nodes join and leave, fault tolerant through replication, persistent, scalable, and supporting unconventional operations such as append, compare and swap, callback in addition to the traditional insert/lookup/remove. We have evaluated ZHT's performance under a variety of systems, ranging from a Linux cluster with 64 nodes, an Amazon EC2 virtual cluster up to 96 nodes, to an IBM Blue Gene/P supercomputer with 8K nodes. We compared ZHT against other key‐value stores and found it offers superior performance for the features and portability it supports. This paper also presents several real systems that have adopted ZHT, namely, FusionFS (a distributed file system), IStore (a storage system with erasure coding), MATRIX (distributed scheduling), Slurm++ (distributed HPC job launch), Fabriq (distributed message queue management); all of these real systems have been simplified because of key‐value storage systems and have been shown to outperform other leading systems by orders of magnitude in some cases. It is important to highlight that some of these systems are rooted in HPC systems from supercomputers, while others are rooted in clouds and ad hoc distributed systems; through our work, we have shown how versatile key‐value storage systems can be in such a variety of environments. Copyright © 2015 John Wiley & Sons, Ltd.

show abstract

“…It is not uncommon to deal with large objects in a distributed system. In scientific computing, for example as our prior work showed, datasets were so large that they were usually compressed to be serialized onto the hard disk. As another example, in our previous work on implementing a distributed file system , we showed that a large directory composed of thousands of small files could result in a huge metadata blob stored in a distributed key‐value store .…”

Section: Design and Analysismentioning

confidence: 99%

Exploiting multi‐cores for efficient interchange of large messages in distributed systems

Zhao

Qiao

Zhou

et al. 2015

Concurrency and Computation

Self Cite

View full text Add to dashboard Cite

SUMMARYConventional data serialization tools assume that objects to be coded are usually small in size so a single CPU core can encode it in a timely manner. In the era of Big Data, however, object gets increasingly complex and larger, which makes data serialization become a new performance bottleneck. This paper describes an approach to parallelize data serialization by leveraging multiple cores. Parallelizing data serialization introduces new questions such as how to split the (sub)objects, how to allocate the available cores, and how to minimize its overhead in practice. In this paper we design a framework for parallelly serializing large objects and analyze the design tradeoffs under different scenarios. To validate the proposed approach, we implemented parallel protocol buffers-the parallel version of Google's Protocol Buffers, a widely-used data serialization utility. Experimental results confirm the effectiveness of Parallel Protocol Buffers: multiple cores employed in data serialization achieve highly scalable performance and incur negligible overhead.

show abstract

Virtual chunks: On supporting random accesses to scientific data in compressible storage systems

Cited by 14 publications

References 31 publications

Towards cost-effective and high-performance caching middleware for distributed systems

Towards cost-effective and high-performance caching middleware for distributed systems

A convergence of key‐value storage systems from clouds to supercomputers

Exploiting multi‐cores for efficient interchange of large messages in distributed systems

Contact Info

Product

Resources

About