Today I was asked if it were possible to add a switch to multi-instance checks, such as the check for volumes or aggregates, that returns a list of instance names which could then be used for other purposes (e.g. creating checks using the op5 API). I am leaning towards creating a separate script for this feature. $ …
The --no_instances=OK switch has been available for some time now. It prevents for example a LUN check that can’t find any LUNs with a not always matching UNKNOWN not to seek any unneeded attention. Recently, we received an increasing number of requests that express the desire for such a switch for collectors …
Current versions of check_netapp_pro.pl ignore the --timeout|-t parameter (which limits the runtime of the check). This only affects checks listed with check_netapp_pro.pl. Getter and setter and stand-alone checks are not affected. Since** check_netapp_pro.pl reads data from the local stores, timeouts very rarely …
Snapshots can be used not only to monitor volume usage by snapshots, but also the number of snapshots that meet a certain criteria, e.g snapshots that exceed or fall bellow a certain age or have a specific name. Finding outdated snapshots In order to receive alarms for orphaned and outdated snapshots, one would use the …
Raidstatus monitors the redundancy or the RAID status of one or more aggregates. It is important to make sure that the patterns viewed as OK are as accurate as possible. An example for an OK pattern is the raid_dp status, not OK would be raid_dp, degraded. However, the following configuration --ok_state=raid_dp is …
I received the following question: If we understand correctly, the blog entry and the microdelta_example.cfg explain that the parameter –delta=10 [–tolerance=5] overrides the interval (e.g 3 mins) set by the getter by collecting the data twice during the 3 minutes?! That is not entirely true. The switch …
First signs that Clustermode collectors are reaching their limits in larger environments are emerging. In their current form, the collectors are not able to finish collecting and storing all the data within 60 seconds. Therefore, we have optimized the collectors for volume and _col_snapshot _for experimental purposes. …