2015 IEEE 34th Symposium on Reliable Distributed Systems Workshop (SRDSW) 2015
DOI: 10.1109/srdsw.2015.15
|View full text |Cite
|
Sign up to set email alerts
|

Hard Drive Failure Prediction Using Big Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 31 publications
(8 citation statements)
references
References 6 publications
0
8
0
Order By: Relevance
“…The time to failure or censoring of each hard drive and its related smart stats values are obtained using SQL queries. According to Yang et al 26 . and Klein 27 smart stats given in Table 6 can be considered informational:…”
Section: Fitting the Backblaze Hard Drive Datamentioning
confidence: 99%
See 1 more Smart Citation
“…The time to failure or censoring of each hard drive and its related smart stats values are obtained using SQL queries. According to Yang et al 26 . and Klein 27 smart stats given in Table 6 can be considered informational:…”
Section: Fitting the Backblaze Hard Drive Datamentioning
confidence: 99%
“…The logistic regression is performed assuming a Gompertz link function, which provides more flexibility than Logit and Probit functions (Trexler et al., 34 XLSTAT 33 ). To optimize the logistic regression, smart stats are discretized, assuming equal frequencies (Yang et al 26 …”
Section: Fitting the Backblaze Hard Drive Datamentioning
confidence: 99%
“…Through the years, various ML and reasoning approaches based on algorithms such as Decision Trees (DT) [ 8 ], k-Nearest Neighbour [ 9 ], Classification Trees (CT) [ 10 ], and Regression Trees (RT) [ 11 ] have been adopted successfully for PdM (i.e., the failure prediction accuracy rates have improved) [ 12 , 13 , 14 ]. State of the art solutions such as [ 7 , 15 , 16 , 17 , 18 ] have shown to achieve more than 90% accuracy in failure prediction. However, ML on its own lacks context awareness [ 19 ] and therefore, lacks the benefits that comes with it.…”
Section: Introductionmentioning
confidence: 99%
“…This can cost in the form of server down, erased backup, lower reliability, unavailability of Internet and fiasco to fetch the latest data and compromised data storage in the data centers. To minimize the problem, HDDs conditions are monitored which helps in the detection of soon to fail drives [Strom, Lee, Tyndall et al (2007); Ma, Traylor, Douglis et al (2015); Yang, Hu, Liu et al (2015)]. The conditions of HDDs are supervised using sensors like acoustic emission, accelerometers, counters and thermal sensors.…”
Section: Introductionmentioning
confidence: 99%