Design, Automation &Amp; Test in Europe Conference &Amp; Exhibition (DATE), 2017 2017
DOI: 10.23919/date.2017.7927007
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating impact of human errors on the availability of data storage systems

Abstract: In this paper, we investigate the effect of incorrect disk replacement service on the availability of data storage systems. To this end, we first conduct Monte Carlo simulations to evaluate the availability of disk subsystem by considering disk failures and incorrect disk replacement service. We also propose a Markov model that corroborates the Monte Carlo simulation results. We further extend the proposed model to consider the effect of automatic disk fail-over policy. The results obtained by the proposed mod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2

Relationship

4
2

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 6 publications
0
8
0
Order By: Relevance
“…A large body of research has investigated and tried to improve the reliability of disk arrays [45], [53], [54], [55], [56], [57], [58], [59], [60], [35], [61]. For the sake of brevity, here we focus on the studies concentrating on SSD arrays.…”
Section: B Analysis and Modeling Of Ssd Array Reliabilitymentioning
confidence: 99%
“…A large body of research has investigated and tried to improve the reliability of disk arrays [45], [53], [54], [55], [56], [57], [58], [59], [60], [35], [61]. For the sake of brevity, here we focus on the studies concentrating on SSD arrays.…”
Section: B Analysis and Modeling Of Ssd Array Reliabilitymentioning
confidence: 99%
“…Increasing number of I/O intensive applications such as Online Transaction Processing (OLTP), High Performance Computing (HPC), web, and email applications arises the demand in data-centers for high-performance storage systems. The most common approach to improving the performance of storage systems is to employ Solid-State Drives (SSDs) [1] in the caching layer of the disk subsystems [2], [3], [4], [5], [6], which are mainly built upon low-performance and lowreliable Hard Disk Drives (HDD) [7], [8], [9] or mid-range SSDs (as shown in Fig. 1).…”
Section: Introductionmentioning
confidence: 99%
“…The availability and reliability of Information systems is seriously affected by human errors [1], [2], [3], [4] where some field studies report human errors as the cause of 19% of system failures [5], [3]. Large datacenters with Exa-Byte (EB) storage capacity (by employing millions of disks drives) are expected to face at least a disk failure per hour.…”
Section: Introductionmentioning
confidence: 99%
“…To this end, we analyze the possible combinations of operational 3 A task that removes LSEs by periodically reading the disk data and checking it with its parity, correcting the corrupted data using the parity and moving it to a new location, and mapping out the damaged sectors. 4 An event in which the whole data of RAID5 array is lost, due to the consecutive failure of two disks. 5 While the incorrect repair service can have many different roots and happen in many different conditions, in this work we focus on IDRS.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation