Advantages of RAID6 over RAID5 For Video Surveillance

By Carl Lindgren, Published on Apr 15, 2009

For large scale video surveillance deployments, like casinos, the enhanced redundancy provided in RAID6 over  RAID5 is critical to minimizing video loss and ensuring system performance. [Note: If you are not familiar with RAID, view a RAID tutorial and a general comparison between RAID5 and RAID6 [link no longer available]]

Background

We have been recording all of our cameras using an NVR system since late 2003.  Our original system consisted of 28 servers, each recording up to 32 cameras.  The servers originally used 16-bay RAIDs with 250GB drives in a RAID 5 configuration.  The majority of the RAIDs were SCSI/PATA, which means they used standard IDE desktop drives in the RAID enclosure.  These drives were not designed to handle continuous video recording and began to fail at an alarming rate within a year.  Our drive vendor replaced these with RAID Edition drives in early 2005, which resolved some of the issues.  At the time, we had a bit over 830 drives in use.

Drive Failures

Even after replacing all 830 drives, we still experienced drive failures.  This is normal for any large system.  It has been estimated that approximately 1% of installed hard drives will fail in the first year of operation; with that rate climbing as the drives age.  There are many possible ways for hard drives to fail and RAID systems can recover from most failures by rebuilding the RAID system using the parity information that is striped across the drives.

RAID 5 uses one parity stripe to store data that can be used to reconstruct the contents of a failed drive onto a replacement drive.  That is the reason why most RAID manufacturers recommend installing at least one global hot spare in each RAID chassis.  When a RAID encounters an error with a hard drive, it “rebuilds” the data that was on the failed drive onto the spare using the parity data.  The failed drive can then be replaced with a new drive; which is designated as the new hot spare.  This process can be done over and over as drives fail and theoretically will keep the RAID storage operating continuously with no data lost.

Unfortunately, there are drive failure scenarios that can not be accommodated by most RAID storage systems that are used for recording video.  This issue is unique to video recording and seldom surfaces in RAID systems used by other applications.  The key is that for most applications, written data is “verified” during the write process.  This means that after a piece of data is written, it is read and compared to the original data before the next piece is written.  If the compare process fails, the area of the disk that failed is marked bad by the drive and the data is re-written to another area of the disk reserved for that purpose.

This process works well when the system has the time to verify the write and repair any errors encountered.  For most applications, there is no requirement to write data continuously and the computer’s operating system can wait the relatively short period required to verify each write and relocate data if an error is encountered.

Video recording is a completely different animal.  It has been estimated that CCTV video recording is 90% write versus 10% read.  I am of the opinion that is a conservative estimate.  An analysis of our system leads me to estimate that the percentages are somewhere between 99% to 1% and 99.9% to 0.01%.  RAID systems set up for video recording seldom, if ever, are set up to verify the data as it is written.

This sets up a possibly fatal scenario.  One of the failure modes of computer hard drives is something called “Read Element Failure”.  The best definition I can find of that is the drive is unable to read all or part of the data written to it.  This could be the result of a complete failure of one of the read heads, or just a bad area of a disk that has not been relocated by the drive’s automatic systems.

Get Notified of Video Surveillance Breaking News
Get Notified of Video Surveillance Breaking News

Since the drives in a video recording system don’t normally automatically read the data after it is written and the system operators only play back a very small fraction of the video being recorded, a drive could happily chug along writing data that is unreadable for a long time.  Neither the system nor the operators would ever know that there is a problem.  That is, until a drive fails with a problem that is recognized by the RAID system.

When the RAID system encounters a drive failure that it recognizes, it will attempt to rebuild the RAID set using the parity data recorded across all of the drives.  That is where the problem becomes acute.  If the RAID system also contains a drive that has a Read Element Failure, it is very possible that bad area contains parity data.  If it does, the rebuild will fail.

On a RAID 5 system, if a rebuild fails because the parity data is corrupt or unreadable, the system now has two bad drives and the RAID set is lost.  This happened to us at least six times during the three years that we used our original RAID 5 systems.

RAID 6

RAID 6 works a bit differently than RAID 5.  Although it can encounter the same drive failure scenarios as RAID 5, its ability to recover from them is greatly enhanced by the method RAID 6 records the parity data.  Instead of writing one parity stripe across all drives in a RAID set, RAID 6 writes two completely independent parity stripes.  There are two advantages to this: RAID 6 is able to recover from the simultaneous failure of two drives in the enclosure and its two parity stripes are in different areas, allowing the system to read parity even through multiple failures.;

This has been proven by us in our recording environment.  In 2006, we replaced all of our servers and RAIDs.  Our new RAIDs were set up, at our insistence, as RAID 6.  Although we have experienced at least three instances where two drives failed in an enclosure, including at least two instances where the second drive failed during the rebuild process, we have never lost any data.  The systems rebuilt both failed drives and continued to run flawlessly.

Conclusion

For these reasons, I would never recommend using RAID 5 in a critical video recording environment.  The risks of data loss are too great.

 

1 report cite this report:

How Costly are Hard Drive Failures? on Apr 29, 2009
Storage tends to be one of the more costly and problematic parts of video...

Related Reports

WDR Camera Mega Shootout 2019 - Avigilon, Axis, Bosch, Dahua, Hanwha, Hikvision, Panasonic, Vivotek on Sep 04, 2019
This is the biggest WDR Shootout ever, with 23 cameras from 8...
Access Control and Video Integration Statistics 2020 on Oct 08, 2020
Video Surveillance and Access Control are two of the most common security...
Video Surveillance History on May 06, 2020
The video surveillance market has changed significantly since 2000, going...
Axis "Best Of The Best" 4K Camera Tested (Q1798-LE) on Nov 21, 2019
Axis has released their "best of the best" Q1798-LE bullet camera, touting...
Ubiquiti Access Control Tested on Oct 21, 2020
Ubiquiti has become one of the most widely used wireless and switch providers...
Remote Network Access for Video Surveillance Guide on Jul 27, 2020
Remotely accessing surveillance systems is key in 2020, with more and more...
Video Analytics 101 on Mar 16, 2020
This guide teaches the fundamentals of video surveillance...
Milestone Presents XProtect On AWS on May 04, 2020
Milestone presented its XProtect on AWS offering at the April 2020 IPVM New...
Genetec Stratocast VSaaS Tested on Sep 05, 2019
The VSaaS market is rapidly expanding in 2019, with Verkada, Meraki, Eagle...
VSaaS Online Show June 2020 - On-Demand Recording of 25+ Manufacturers Presentations on Jun 24, 2020
The show featured 25+ VSaaS providers showcasing their latest services. The...
Video Surveillance Cameras 101 on Feb 25, 2020
Cameras come in many shapes, sizes and specifications. This 101 examines the...
Bandwidth Fundamentals For Video Surveillance on Jan 13, 2020
Bandwidth is the most fundamental element of computer networking for video...
2020 Video Surveillance Cameras State Of The Market on Jan 03, 2020
Each year, IPVM explains the main advances and changes for video surveillance...
ROG Security - Cloud AI For Remote Monitoring on Jan 28, 2020
ROG Security is offering cloud-based AI analytics to remote guard companies,...
Uniview Deep Learning Camera Tested on Jul 14, 2020
Uniview's intrusion analytics have performed poorly in our shootouts. Now,...

Recent Reports

Recruiters Online Show LIVE Today! on Oct 29, 2020
IPVM's 7th online show resumes today with 12 recruiters presenting themselves...
Hikvision AcuSense G2 Camera Test on Oct 29, 2020
Hikvision has released their next generation of AcuSense analytic cameras...
Biggest Problems Selling Access Control 2020 on Oct 29, 2020
Access control can cause integrators big headaches. What practical issues do...
Taiwan Geovision AI Analytics and NDAA Examined on Oct 29, 2020
Taiwan manufacturer Geovision's revenue has been falling for years. However,...
Bedside Cough and Sneeze Detector (Sound Intelligence and CLB) on Oct 28, 2020
Coronavirus has increased interest in detecting symptoms such as fever and...
Fever Tablet Thermal Sensors Examined (Melexis) on Oct 28, 2020
Fever tablet suppliers heavily rely on the accuracy and specs of...
Verkada Fires 3 on Oct 28, 2020
Verkada has fired three employees over an incident where female colleagues...
Eagle Eye Networks Raises $40 Million on Oct 27, 2020
Eagle Eye has raised $40 million aiming to "reinvent video...
Hikvision Q3 2020 Global Revenue Rises, US Revenue Falls on Oct 27, 2020
While Hikvision's global revenue rises driven by domestic recovery, its US...
VICE Investigates Verkada's Harassing "RawVerkadawgz" on Oct 26, 2020
This month, IPVM investigated Verkada's sexism, discrimination, and cultural...
Six Flags' FDA Violating Outdoor Dahua Fever Cameras on Oct 26, 2020
As Six Flags scrambled to reopen parks amid plummeting revenues caused by the...
ISC Brasil Digital Experience 2020 Report on Oct 23, 2020
ISC Brasil 2020 rebranded itself to ISC Digital Experience and, like its...
Top Video Surveillance Service Call Problems 2020 on Oct 23, 2020
3 primary and 4 secondary issues stood out as causing the most problems when...
GDPR Impact On Temperature / Fever Screening Explained on Oct 22, 2020
What impact does GDPR have on temperature screening? Do you risk a GDPR fine...
Security And Safety Things (S&ST) Tested on Oct 22, 2020
S&ST, a Bosch spinout, is spending tens of millions of dollars aiming to...