NetApp Monitoring

List of Instance Names

Jun 3, 2016 1 min read Check NetApp-ZAPI NetApp Monitoring

Today I was asked if it were possible to add a switch to multi-instance checks, such as the check for volumes or aggregates, that returns a list of instance names which could then be used for other purposes (e.g. creating checks using the op5 API). I am leaning towards creating a separate script for this feature. $ …

Testing/Fixing Release 3.4.1_01

Mar 16, 2016 1 min read Check NetApp-ZAPI DiskCount LunSize NetApp Monitoring ShelfEnvironment Usage spare-disk

3.4.1_01 FIXES USAGE/SHELF-ENVIRONMENT, DISKCOUNT, WORKLOAD, LUNSIZE Released 2016-03-18 Fixed Usage-Check: Handling of volumes with no state-attribute shelf-environment-object in cm-getter needs a node-name (sometimes) New and improved new DiskCount check (counts spare-disks per type or storage-pool) new …

No instances should be ok?!

Mar 9, 2016 2 min read Check NetApp-ZAPI Alerting NetApp Monitoring

The --no_instances=OK switch has been available for some time now. It prevents for example a LUN check that can’t find any LUNs with a not always matching UNKNOWN not to seek any unneeded attention. Recently, we received an increasing number of requests that express the desire for such a switch for collectors …

-timeout Bug

Mar 8, 2016 1 min read Check NetApp-ZAPI NetApp Monitoring v3.5

Current versions of check_netapp_pro.pl ignore the --timeout|-t parameter (which limits the runtime of the check). This only affects checks listed with check_netapp_pro.pl. Getter and setter and stand-alone checks are not affected. Since** check_netapp_pro.pl reads data from the local stores, timeouts very rarely …

Snapshot Age Warning

Jan 8, 2016 1 min read Check NetApp-ZAPI Alerting NetApp Monitoring Snapshots

Snapshots can be used not only to monitor volume usage by snapshots, but also the number of snapshots that meet a certain criteria, e.g snapshots that exceed or fall bellow a certain age or have a specific name. Finding outdated snapshots In order to receive alarms for orphaned and outdated snapshots, one would use the …

Configuring the Raidstatus Check properly

Nov 13, 2015 1 min read Check NetApp-ZAPI NetApp Monitoring

Raidstatus monitors the redundancy or the RAID status of one or more aggregates. It is important to make sure that the patterns viewed as OK are as accurate as possible. An example for an OK pattern is the raid_dp status, not OK would be raid_dp, degraded. However, the following configuration --ok_state=raid_dp is …

Testing-Release 3.2.0_01: 7-Mode Checks and Disk-Inventory Info

Sep 10, 2015 1 min read Check NetApp-ZAPI NetApp Monitoring

The Testing-Release 3.2.0_01 will be available 14.09.2015. Should you be interested in receiving this non-stable release for testing purposes, please send us a message at Distribution . from the CHANGES: New and improved SnapMirror has a getter for 7-mode New Check for 7-mode: check_netapp7_snapvault New Check for …

Micro delta - Questions and Answers

Jul 7, 2015 2 min read Check NetApp-ZAPI NetApp Monitoring

I received the following question: If we understand correctly, the blog entry and the microdelta_example.cfg explain that the parameter –delta=10 [–tolerance=5] overrides the interval (e.g 3 mins) set by the getter by collecting the data twice during the 3 minutes?! That is not entirely true. The switch …

Check for Offline or Failed Disks

Jan 23, 2015 2 min read Check NetApp-ZAPI failed disk Monitoring Nagios Plugins for NetApp NetApp NetApp Monitoring offline disks

The new **Disk **check monitors the disks of a NetApp Filer. Depending on the settings, the check sends an error message when offline or failed disks are found. An example using a 7-Mode Filer: $ **./check_netapp_pro.pl Disk -H sim812 ‑‑what=failed** NETAPP_PRO DISK CRITICAL - 14 disks checked, 1 critical and 0 …

Performance Tuning

Oct 6, 2014 1 min read Check NetApp-ZAPI NetApp Monitoring

First signs that Clustermode collectors are reaching their limits in larger environments are emerging. In their current form, the collectors are not able to finish collecting and storing all the data within 60 seconds. Therefore, we have optimized the collectors for volume and _col_snapshot _for experimental purposes. …