Hard disk failure data in a data centre


Is a Cloud storage that's astonishingly easy and low-cost.200 million GB stored and 10 billion files recovered.

Back blaze regularly publish studies and observations of the hard drives in their environment, including hard drive failure rates and the effect of temperature on hard drive performance.

Overview of the Hard Drive Data

Each day in the Backblaze data center, A snapshot of each operational hard drive is taken. This snapshot includes basic drive information along with the S.M.A.R.T. statistics reported by that drive. The daily snapshot of one drive is one record or row of data. All of the drive snapshots for a given day are collected into a file consisting of a row for each active hard drive. The format of this file is a "csv" (Comma Separated Values) file. Each day this file is named in the format YYYY-MM-DD.csv, for example, 2013-04-10.csv.

The first row of the each file contains the column names, the remaining rows are the actual data. The columns are as follows:

  • Date – The date of the file in yyyy-mm-dd format.
  • Serial Number – The manufacturer-assigned serial number of the drive.
  • Model – The manufacturer-assigned model number of the drive.
  • Capacity – The drive capacity in bytes.
  • Failure – Contains a “0” if the drive is OK. Contains a “1” if this is the last day the drive was operational before failing.
  • 2013-2014 SMART Stats – 80 columns of data, that are the Raw and Normalized values for 40 different SMART stats as reported by the given drive. Each value is the number reported by the drive.

What are S.M.A.R.T attributes?

S.M.A.R.T. (Self-Monitoring, Analysis and Reporting Technology; often written as SMART) is a monitoring system included in computer hard disk drives (HDDs) and solid-state drives (SSDs) that detects and reports on various indicators of drive reliability, with the intent of enabling the anticipation of hardware failures. To know more visit this page

In the dataset we are using , we are using only 5 S.M.A.R.T features:

  • SMART_1 : Read Error Rate
  • SMART_5 : Reallocated Sectors Count
  • SMART_9 : Power-On Hours
  • SMART_194 : Temperature Celsius
  • SMART_197 : Current Pending Sector Count

In [ ]: