The first stable release (v1.1.0) of our brand new Check E-Series product brings an automated health check for e-series nodes.
check_eseries_health
will run the health checks regularly and automatically report any errors found.
Example
$ ./check_eseries_health --host=10.1.1.125 --system-id 1 [...]
NETAPP ESERIES HEALTH CRITICAL - 16 health checks checked, 1 CRITICAL
StorageGRID-XG-102.missingVolumes: notCompleted (CRITICAL)
StorageGRID-XG-102.integratedHealthCheck: ok
StorageGRID-XG-102.dbSubRecordsValidation: ok
StorageGRID-XG-102.melEventCheck: ok
StorageGRID-XG-102.validPassword: ok
StorageGRID-XG-102.failedDrivesPresent: ok
StorageGRID-XG-102.exclusiveOperations: ok
StorageGRID-XG-102.driveCheck: ok
StorageGRID-XG-102.nvsramDisableCfwDownloads: ok
StorageGRID-XG-102.hotSparesInUse: ok
StorageGRID-XG-102.controllerStatusOptimal: ok
StorageGRID-XG-102.volumeGroupsComplete: ok
StorageGRID-XG-102.objectGraphSyncCheck: ok
StorageGRID-XG-102.configurationDatabaseCheck: ok
StorageGRID-XG-102.spmDatabaseVerification: ok
StorageGRID-XG-102.storageDeviceAccessible: ok
Technical Details
Check E-Series is a monitoring-plugin for Nagios-compatible system-monitoring frameworks like Nagios-XI, Nagios-Core, Icinga, op5-monitor or Shinken. It is written in Go and distributed as a ready-to-run precompiled binary without any dependencies.
Update
We have released an important bugfix in v1.1.1 on 25th of August.
Comments