NetApp Monitoring

Checking for runaway processes

Feb 23, 2018 2 min read Check NetApp-ZAPI NetApp Monitoring process-check v3.10.2

We are having a new check proposed by one of our customers who had an issue with a single process eating up all the CPU time on a filer. It’s easy to identify the culprit once you are on the command-line of the filer (priv-mode) by issuing the ps command. To automate that sort of monitoring and getting an alarm …

Update-Mode shows performance gain by collector-architecture

Feb 21, 2018 1 min read Check NetApp-ZAPI NetApp Monitoring update mode v3.10.2

The recently implemented Update-Mode for the getters lead to a reduction in the monitoring systems CPU load between 30 and 50% reported one of our customer, a large automotive company from Germany. The background of this significant performance gain is their unusual configuration system which run the getter for every …

Update function and Grafana compatibility

Feb 17, 2018 2 min read Check NetApp-ZAPI NetApp Monitoring update mode v3.10.2

I can recommend the unstable release 3.10.1_10 to all experimenting monitoring admins. Above all, this has the character of a technology preview. Included are two major innovations: The update mode for all getters The option to output Grafana compatible performance data even for status checks. Update Mode The update …

Detecting unused LUNs

Jan 22, 2018 1 min read Check NetApp-ZAPI LUN NetApp Monitoring

Mapped and online LUNs can remain unused in a filer because a connected initiator is missing. With the NetApp-CLI, such LUNS can be detected by following these steps: First, we look for initiator groups for which **all **initiators are not logged in. node01::> lun igroup show -igroup \*group06\* -instance Vserver …

How to deal with changing causes

Nov 29, 2017 4 min read Check NetApp-ZAPI NetApp Monitoring rm_ack usp

One of the most innovative features of check_netapp_pro is probably the option -rm_ack, which solves the problem of errors not being alarmed for confirmed overall checks. These errors will not be alarmed actively and can therefore be easily overlooked. This switch might soon be replaced by another, more sophisticated …

Monitoring Latency and Transfer Rate per Aggregate

Nov 15, 2017 1 min read Check NetApp-ZAPI aggregate aggregate latency aggregate throughput aggregate transferrate netapp aggregate performance NetApp Monitoring v3.10.0

Our new check PerfAggregate measures various performance counters per aggregate. Currently, the following counters can be monitored: aggr_throughput (B/s) latency (µs) read_data (B/s) total_transfers (/s) total_transfers_hdd (/s) total_transfers_ssd (/s) transmit_failure (-) user_read_blocks (/s) user_read_latency (us) …

Checking the Metro-Cluster Configuration

Nov 7, 2017 1 min read Check NetApp-ZAPI check sfo check storage failover check_netapp_takeover metrocluster configuration NetApp Monitoring sfo storage failover v3.10.0

The newly published check_netapp_takeover has been enhanced to check the metro-cluster configuration too. All of these checks are done in one service-check. Example:``` $ check_netapp_takeover.pl -H filer01f NETAPP_TAKEOVER OK node filer0101 connected (ha mode): The the storage failover facility is enabled. Takeover of …

SnapDrive troubles with Snapshots Check

Nov 7, 2017 1 min read Check NetApp-ZAPI NetApp Monitoring Snapshots v3.10.0

If you are checking your filer with the Snapshots plugin, you may get an error like: Can't call method "attr" on an undefined value at /opt/.../lib/nagios/plugins/check_netapp_pro/checks/Snapshots.pm line 289. The reason for this is, that the SnapDrive application creates flexclones, which are created, …

Trending the Inode-Usage

Oct 23, 2017 1 min read Check NetApp-ZAPI inode monitoring inode-usage NetApp Monitoring trend UsageTrend v3.10.0

The Check UsageTrend can be used to interpolate the usage of a volume or aggregate into the future and send an alarm if it would get filled up within a given time. Until now this check measured the size in bytes. From now on we have an additional argument ‑‑what=size|inodes. Setting this argument to inodes tells the …

Monitoring “inodes-full”

Oct 20, 2017 2 min read Check NetApp-ZAPI en free inodes inode alerting inode monitoring NetApp Monitoring Usage used inodes

Obviously, one of the first plugins that we implemented was designed to monitor volumes and aggregates and their respective usage (“how many bytes are still available”). This plugin is called Usage and from the get-go it could monitor disk space usage in bytes as well as the number available (or used) …