2013 International Conference on Advanced Cloud and Big Data 2013
DOI: 10.1109/cbd.2013.34
|View full text |Cite
|
Sign up to set email alerts
|

Superset: A Non-uniform Replica Placement Strategy towards High-Performance and Cost-Effective Distributed Storage Service

Abstract: Load balance and power proportionality are both important aspects in constructing high-performance and costeffective distributed storage systems. However, traditional replica placement strategies towards load balance usually produce scattered replica layouts which disable power proportionality, while recent strategies towards power proportionality are typically based on uniform replication which compromises the ability of load balance. In this article, we introduce Superset (an organized non-uniform replica pl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 18 publications
0
8
0
Order By: Relevance
“…In such systems, such as Green-HDFS [22], the ultimate goal is to efficiently distribute the machines between these different areas, maximizing the overall performance thanks to improvements in the hot zone, minimizing the overall energy consumption thanks to improvements in the cold zone, increasing the time response as little as possible when reading files from machines switched off (in GreenHDFS, only 2.1% of the readings were affected by this temporary penalty due to switching on the machine at the time of the reading), thereby significantly reducing the energy consumption of servers: 24% in the case of GreenHDFS [23]. • Dynamic replication: Other solutions, such as Superset [27], take the above strategies as a starting point, but also take into account the "temperature" of the data above a threshold, not only to power on/off machines, but also to increase or decrease the number of copies of stored data, thereby preserving the availability of data and reducing overall energy consumption thanks to the switching on/off policies and improved performance. This is achieved by transferring storage space and computing power from the cold files that are not frequently used, to those files that need these resources, i.e, the hottest files.…”
Section: Problem Analysismentioning
confidence: 99%
“…In such systems, such as Green-HDFS [22], the ultimate goal is to efficiently distribute the machines between these different areas, maximizing the overall performance thanks to improvements in the hot zone, minimizing the overall energy consumption thanks to improvements in the cold zone, increasing the time response as little as possible when reading files from machines switched off (in GreenHDFS, only 2.1% of the readings were affected by this temporary penalty due to switching on the machine at the time of the reading), thereby significantly reducing the energy consumption of servers: 24% in the case of GreenHDFS [23]. • Dynamic replication: Other solutions, such as Superset [27], take the above strategies as a starting point, but also take into account the "temperature" of the data above a threshold, not only to power on/off machines, but also to increase or decrease the number of copies of stored data, thereby preserving the availability of data and reducing overall energy consumption thanks to the switching on/off policies and improved performance. This is achieved by transferring storage space and computing power from the cold files that are not frequently used, to those files that need these resources, i.e, the hottest files.…”
Section: Problem Analysismentioning
confidence: 99%
“…The authors propose an energy-aware scheduling policy based on Dynamic Voltage and Frequency Scaling (DVFS) in [32]. Moreover, various approaches look for the reduction of the energy consumption by applying energyproportionality models based on power-proportional distributed file systems in [33]- [35] which generally aim to switch storage-servers off when the replicated data is not needed.…”
Section: Energy Efficiency In Data Centresmentioning
confidence: 99%
“…For Data Replication and Placement to improve energy proportionality, many solutions [3,91,106,145] have been proposed. In [3], authors describe a power-proportional distributed file system which stores copies of data on non-overlapping subsets of resources.…”
Section: Software Solutions For Infrastructure Efficiencymentioning
confidence: 99%