Date
December 20, 2009
Author
Vipul Agrawal, Chiranjib Bhattacharyya, Thirumale Niranjan, and Sai Susarla
In this paper, we consider events recorded from live disks and show that it is possible to construct decision support systems that can detect hard-disk failures.
Detecting impending failure of hard disks is an important prediction task which might help computer systems to prevent loss of data and performance degradation. Currently most of the hard drive vendors support self-monitoring, analysis and reporting technology (SMART) which are often considered unreliable for such tasks. The problem of finding alternatives to SMART for predicting disk failure is an area of active research. In this paper, we consider events recorded from live disks and show that it is possible to construct decision support systems which can detect such failures. It is desired that any such prediction methodology should have high accuracy and ease of interpretability. Black box models can deliver highly accurate solutions but do not provide an understanding of events which explains the decision given by it. To this end we explore rule based classifiers for predicting hard disk failures from various disk events. We show that it is possible to learn easy to understand rules, from disk events, which have extremely low false alarm rates on real world data.
In Proceedings of the International Conference on Machine Learning and Applications 2009 (ICMLA ’09)
Resources
The author's version of this paper is attached to this posting. Please observe the following copyright:© 2009 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.The definitive version of the paper can be found at: https://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5382104.disk_failure_prediction_icmla.pdf