| RAID Alerting |
|
|
|
| Written by Tom Hirt | ||||
| Thursday, 18 June 2009 12:25 | ||||
|
Linux RAID AlertingIn this KB, we will show you how to use mdadm to notify you should a device in your Linux Software RAID array fail. Monitoring your RAID array is important because you must replace a failed device as soon as possible. This is because different levels of RAID have varying abilities to sustain device loss (see our Linux RAID How-to for a description of the different RAID levels and their sustainability with failed devices) so it's important you treat failures with urgency and replace the failed devices as quickly as possible. We have seen in some of our other RAID KB's (Removing Failed RAID Devices and Staring and Stopping RAID Arrays) some examples of commands that can be issued from the shell to get the status of the array ('cat /proc/mdstat' and 'mdadm --detail /dev/md0'.) However these commands won't help you detect or send an alert should a failure occur. In the following examples, we will show you how to be proactive and configure mdadm for alert notification. One final word of advice before we begin, this is a quick and dirty way to monitor RAID (not to mention the lack of scalability in the enterprise.) There are plenty of open source and commercial tools that can also be used to monitor RAID in the enterprise. If you plan to have several systems running software RAID, I would highly encourage the use of an external monitoring package. That said, Let's begin!
I'll leave it to you to figure out how best to setup alerting on your system. Hopefully these examples helped! Best of luck and thanks for reading!
|
||||
| Last Updated on Thursday, 18 June 2009 15:39 |