4 * If the option -I or --htmlinfo was used, the OK output would be
5 printed two times. This has been fixed so the OK output is now
6 correct for HTML output.
7 * A bug was fixed for config file parsing, if the plugin was used in
8 local mode (i.e. no hostname specified). Reported by David Jones.
13 * Added a new option '--hide-servicetag' to censor the servicetag in
14 the plugin output. A corresponding config file option
15 'output_hide_servicetag' was created. Thanks to Sebastian Ahndorf
17 * SNMP: Fixed bug in amperage probes perfdata output when one or more
18 PSUs has lost power, which could cause garbled graphs.
23 * Added new blacklisting keyword 'pdisk_foreign' to suppress warnings
24 about foreign physical disks
25 * SNMP: Get the controller number right when reporting issues with the
26 controller cache battery
27 * Various minor tweaks and bugfixes in the PNP4Nagios template
32 * Major overhaul of the perfdata code. This includes API breakage and
34 - Probes were not sorted correctly
35 - Voltage data was not included
36 * The PNP template check_openmanage.php has been redone to work with
37 the changes in perfdata output from the plugin
38 * A new option '--legacy-perfdata' will make the the plugin output the
39 performance data in the old format
40 * Added support for a configuration file
41 * Added manual page for the configuration file
42 * If using html output, URLs will now open in a new window
43 * Added a compatibility fix for OMSA 6.5.0, related to performance
44 data for amperage probes when the plugin is used in local
45 mode. Thanks to Benedikt Meyer for a patch.
50 * Added workaround for a rare condition in which blade detection fails
51 because the chassis IDs for the blade and interconnect board have
52 switched places in the BaseBoardType SNMP table.
57 * A regression wrt. non-certified drives were fixed. The plugin failed
58 to identify non-certified physical drives via SNMP.
59 * Added the ability to blacklist non-certified drives with the
60 'pdisk_cert' blacklisting keyword.
65 * Fixed typo in help output
66 * SD card check is now included if the parameter '--only chassis' is
68 * The plugin will issue a proper warning if a physical drive is
69 uncertified, instead of an unspecified warning. One or more
70 uncertified drives will make the controller go into a non-critical
72 * Slightly improved reporting of fan status
73 * Exit with value 3 (unknown) if printing debug, help or version
74 info. This is considered best practice for Nagios plugins.
75 * Workaround added for logical SAS connectors to external storage
76 enclosures, when using check_openmanage in local mode with OMSA
77 6.4.0 or later versions. The output from omreport could contain
78 lines that the plugin was unable to parse, which would lead to
84 * Fix counting of components when blacklisting is used. Components
85 should be counted even if blacklisted
86 * Added some unsupported vdisk types to the list. The OMSA MIB
87 identifies these, but lists them as unsupported.
88 * Added option '-B' or '--show-blacklist' to show any blacklistings in
90 * Fixed a bug for checking voltage probes, if the reading is missing
92 * Fixed a regression bug for a power monitoring corner case.
97 * Added more robustness wrt. values from OMSA obtained via SNMP, to
98 avoid internal errors where non-important values are missing.
103 * A few compatibility fixes for OMSA 6.4.0 were added.
108 * Added support for IPv6 when checking via SNMP. IPv6 can be turned on
109 with the option '-6' or '--ipv6'. The default is IPv4 if the option
111 * Added support for TCP when checking vis SNMP. The option '--tcp' can
112 be used to turn on TCP. The default transport protocol is UDP if the
113 option is not present.
114 * The mode of operation (local or SNMP) is shown in the debug
115 output. If SNMP is used, the debug output will also show the SNMP
116 protocol version, IP version and transport protocol (UDP or TCP).
117 * Amperage probe status via SNMP is of type "probe status", not
118 regular status. This has been fixed.
119 * Massive overall robustness improvements to handle OMSA bugs where
120 some information from OMSA is missing.
121 * Memory module enumeration via SNMP changed somewhat to reflect
122 enumeration provided by omreport. This ensures that the plugin's
123 output is identical in SNMP or local mode wrt. dimms IDs.
124 * Fan enumeration via SNMP changed somewhat to reflect enumeration
125 provided by omreport. This ensures that the plugin's output is
126 identical in SNMP or local mode wrt. fan IDs.
131 * Included new check for SD cards. Newer servers such as the R710 can
132 have SD cards installed, these should be monitored. The SD card
133 check is on by default. A new blacklisting keyword 'sd' has been
134 added. The SD card check can be turned off with '--check sdcard=0'.
135 * Handle special cases where power monitoring capability is disabled
136 due to non-redundant and/or non-instrumented power supplies.
137 * For physical disks probed via SNMP, check that values for vendor,
138 product ID and capacity is available before attempting to display
140 * If a physical disk is in sufficiently bad condition, the vendor
141 field reported by OMSA may be empty. The plugin now handles this
142 situation without throwing an internal error.
148 * Storage is no longer allowed to be absent. If the plugin doesn't
149 find a storage controller, it will give an alert. For diskless
150 systems or servers without a Dell controller that OMSA recognizes
151 you will now have to specify '--no-storage' or '--check storage=0'
153 * Report the system revision (if applicable) wherever the model name
154 is printed. E.g. "PowerEdge 2950 III" instead of "PowerEdge 2950".
155 * Small change in search path for omreport: The new location for OMSA
156 6.2.0 and later on Linux will be attempted first.
157 * Small bugfix for the '--check' parameter, if the argument is a
158 filename. The file could not contain a linebreak, this has been
160 * Added an option '--no-storage', which is equivalent to the general
161 option '--check storage=0'.
167 * If a physical disk is a hot spare, display this information in
169 * Report the bus protocol (e.g. SAS, SATA) and media type (e.g. HDD,
170 SDD) for physical disks in the debug output, if applicable
171 * Minor fix for 100GB physical disks, write "100GB" instead of "99GB"
172 * SNMP: Use new features of OMSA 6.3.0 to display occupied and total
173 slots in storage enclosures, if applicable. This information is not
174 available with omreport and check_openmanage will not display this
176 * SNMP: Added new processor IDs from the OMSA 6.3.0 MIBs
177 * SNMP: Use connection tables in a proper way to determine controller
178 and enclosure IDs, for use with physical disks and enclosure
179 components (fan, temp sensors etc.). This fixes a long standing bug
180 for servers with more than one controller, if checked via SNMP.
181 * SNMP: Use the nexus ID as last resort to find the controller for
182 physical disks. Workaround for older, broken OMSA versions.
183 * SNMP: Identify enclosures (e.g. '2:0:0') properly so that the
184 reporting with SNMP corresponds to the same report with omreport.
185 * SNMP: added a couple of workarounds for pre-historic OMSA versions
191 * More fine-grained reporting of temperature warnings for enclosure
193 * Max/min temperature limits for enclosure temp probes are reported in
195 * Report enclosure temperature probes that are "Inactive" as ok
196 * Don't try to print out the reading of enclosure temperature probes
197 if the reading doesn't exist or is not an integer
198 * Report enclosure EMMs that are "Not Installed" as ok, instead of
200 * Corrected typo in the PNP4Nagios template
206 * Remove reporting of which controller a logical drive is "attached"
207 to, since this information can't be reliably extracted via SNMP.
208 * Avoid collecting Lun ID via SNMP for virtual disks, we don't use it
209 * Report total memory and number of dimms in the ok output
210 * Difference in reporting if amperage probes have discrete readings
211 * Workaround for broken amperage probes
212 * Added workaround for bad temperature probes that yields no reading
214 * Get OMSA version via SNMP slightly more efficiently
220 * Added robustness for received SNMP values that are not defined in
221 the MIB. Instead of throwing a perl warning when this happens, the
222 plugin will not report the undefined value.
223 * Defined "Replacing" as a defined state for physical disks in SNMP
224 mode, even though this state is not defined in the MIB. It is
225 reported as such by omreport.
226 * Physical disk brand/model is now reported when the state of the disk
227 is "Rebuilding" or "Replacing".
228 * The state of a physical disk is reported in parentheses when
229 predictive failure is detected. It is useful to know if a disk is
230 online, offline, spare or even failed when predictive failure is
232 * Handling of physical disk predictive failure has been improved
234 * Refactoring of the perfdata code. In conformance with the plugin
235 development guidelines, the UOM (unit of measure) previously
236 reported in the perfdata output has been removed.
237 * The -p or --perfdata option now
238 takes an optional agrument 'minimal', which triggers shorter names
239 for the perfomance data labels. This shortens the output and is a
240 workaround for systems where the amount of output exceeds the 1024
242 * The PNP4Nagios template has been updated. Users of check_openmanage
243 and PNP4Nagios are advised to upgrade. This version of
244 check_openmanage needs the new template.
245 * Lots of other small improvements and updates.
251 * New option '--use-get_table' is added as a workaround for SNMPv3 on
252 Windows using net-snmp. This option will make check_openmanage use
253 the Net::SNMP function get_table() instead of get_entries() to
254 collect information via SNMP.
255 * Include a blacklisting option 'ctrl_pdisk' which takes the
256 controller number as argument. This blacklisting option only works
257 with omreport and is a workaround for broken disk firmwares that
258 contain illegal XML characters. These characters makes openmanage
259 barf and exit with an error. Patch from Bas Couwenberg.
260 * If the blacklisting keyword "all" is supplied for a component type,
261 that component type is not checked at all, i.e. the commands are
262 never executed. This will make check_openmanage execute faster if
263 blacklisting is heavily used.
264 * Option '--htmlinfo' now has a shorter equivalent '-I'
265 * The option '--short-state' now has a shorter equivalent '-S'
271 * Fixed an SNMP bug where the plugin didn't handle OID indexes that
272 were not sequential. Thanks to Gianluca Varenni for reporting.
273 * Fixed an SNMP bug when checking old hardware such as the PE 2650 and
274 PE 750. The controller id for physical drives were collected and
275 displayed incorrectly. This release uses an additional OID to fetch
276 this info, which would otherwise be unavailable. Thanks to Gianluca
277 Varenni for reporting this bug.
278 * Should use %snmp_probestatus, not %snmp_status when checking the
279 status of voltage probes. Thanks to Ken McKinlay for a patch.
280 * Fix when identifying blades via SNMP with very old OMSA
281 versions. Patch from Ken McKinlay.
282 * Better way of finding the ID of physical drives via SNMP
288 * Added support for storport driver version for controllers, only
289 applicable on servers running Windows. A new blacklisting keyword
290 for suppressing storport driver messages was added.
291 * The "all" keyword in blacklisting is now case insensitive.
292 * More fine-grained reporting in the rare case where a controller
293 battery fails during learning and charging states.
294 * New improved way of reporting perl warnings during execution of the
301 * Fix for path to omreport on Linux with OMSA 6.2.0
302 * A couple of other small fixes
308 * Fix for undefined device name for logical drives (thanks to Pontus
310 * Fixed a bug in the PNP4Nagios template, that prevented the template
311 from working with PNP4Nagios 0.6. Thanks to the PNP4Nagios team for
318 * CPU type, family etc. are now reported in case of a CPU failure (and
320 * The debug output now reports Openmanage version and plugin version
326 * New option '-a' or '--all' turns on checking of everything
327 * The manual page (POD info) is removed from the script and is now in
328 a separate file, to make check_openmanage fully ePN compatible
329 * ePN is no longer disabled by default, check_openmanage no longer has
330 an opinion on whether it should run under ePN or not
331 * The '-m' or '--man' option is no longer available
332 * The option '-v' or '--verbose' is renamed to '-d' or '--debug',
333 which makes more sense wrt. its usage
334 * The '-g' or '--global' option is removed. Checking the global health
335 status is now default if applicable
336 * Checking intrusion detection is now turned on by default
337 * The obsolete option '--snmp' is removed
338 * The option '--state' now has a shorter equivalent '-s'
339 * The basename stuff and options '--only-critical' and
340 '--only-warning' are now replaced by an option '--only'
341 * If plugin is run by Nagios, redirect stderr to stdout
342 * Added option '--omreport', that lets the user specify the full path
343 to the omreport binary
344 * Added non-8bit-legacy default search paths for omreport.exe for
346 * Minor changes to the plugin output, for consistency
347 * New blacklisting keyword 'bat_charge' disables warning messages
348 related to controller cache battery charging. Thanks to Robert
349 Heinzmann for a patch.
350 * For blacklisting, the component ID kan now be "ALL", in which all
351 components of that type is blacklisted.
352 * Man page is moved to manual section 8
358 * Fixed a bug that could cause errors and weird results when checking
359 cooling devices (fans) via SNMP. Thanks to Ken McKinlay for spotting
360 this bug and reporting it.
365 * For failed physical drives, check_openmanage will now output the
366 drive's vendor, model and size in GB or TB.
371 * The -s|--snmp option was redundant and no longer does anything. SNMP
372 is triggered automatically if the -H|--hostname option is
373 present. The -s|--snmp option is kept for compatibility, but has no
379 * Added support for performance data (temperatures) from attached
380 storage enclosures such as the MD1000
385 * Fixed a regression in the --htmlinfo option when it is not supplied
391 * New option --htmlinfo adds clickable HTML links in the plugin's
397 * Fixed a regression bug in CPU and power supply reporting that only
398 affects verbose output
399 * If blacklisting is used, the global health check (via the --global
400 option) is now negated. Checking the global health doesn't make
401 sense when one or more components is blacklisted. Thanks to Rene
402 Beaulieu for reporting this bug
403 * The PNP4Nagios template is now included in the tarball and zip
409 * Improved memory error reporting, when using omreport
410 * Collect performance data from pwrmonitoring (amperage probes) that
411 were previously ignored when using omreport
416 * Improved memory error reporting, when using SNMP
417 * Other small ehnancements
422 * The plugin is now compatible with the Nagios embedded Perl
423 interpreter (ePN) in theory. However, the plugin will not not use
424 ePN by default. We don't want any "accidents".
425 * License is now GPLv3, previously only specified as "GPL"
426 * New options --only-critical and --only-warning. With these options
427 the plugin will only print critical or warning alerts, respectively.
428 * Bugfixes and speed enhancements in the storage section, when
429 checking enclosure components via omreport
430 * The --okinfo option is now less verbose and more to the point
431 * Lots of code refactoring for readability, maintainability and
437 * Fixed a bug in the storage section, when checking controllers. This
438 is an obscure bug that only manifests itself in the odd case where a
439 server has multiple controllers, and one of the controllers are
440 missing some of the OIDs, in which case these OIDs will be missing
441 for the other controllers as well. The change is minor and only
442 includes using get_table() instead of get_entries() to collect the
443 SNMP result. Thanks to Stephan Bovet for reporting this bug.
448 * The --perfdata option can now optionally take an argument
449 "multiline", which makes the plugin produce multiline performance
450 data output in a Nagios 3.x way. Not really needed, but the plugin
452 * Added comment within the 10 first lines to disable the nagios
453 embedded perl (ePN) interpreter by default for Nagios 3.x
454 * Improvements in the performance data output. Units are now included
459 * Added --global option, which turns on checking of everything. If
460 used with SNMP, the global system health status is also probed, to
461 protect the user against bugs in the plugin. If used with omreport,
462 the overall chassis health is used.
463 * Support for SNMP version 3
464 * New check added: esmhealth. This checks the overall health of the
465 ESM log, i.e. the fill grade. More than 80% means a warning message
466 * Fixed alert log reporting to use the same format as for the ESM log
467 * Output messages are now sorted by severity
468 * Minor changes in how out-of-date controller firmware/driver is
470 * Code refactoring and cleanup
475 * Use "omreport about" to collect OMSA version. Slightly faster than
476 "omreport system version". This should give a small speedup in
477 certain configurations
478 * Fixed typo in output when a logical drive is rebuilding. Thanks to
479 Andreas Olsson for reporting
480 * Improved reporting of ESM log content
481 * Added omreport.sh as alternate omreport path
482 * Lots of other small fixes and enhancements
484 Plus: A few changes to make the plugin work with old PowerEdge models
485 (e.g. 2550, 2450) and/or old OMSA versions (e.g. version 4.5):
487 * Use the chassisModelName OID to determine if SNMP works (instead of
489 * No longer require a response when checking baseboard type via
490 SNMP. If there is no response, we assume that we're not dealing with
493 Thanks to Christian McHugh for help with testing and debugging this
499 * Use 'omreport system operatingsystem' to collect OS info, instead of
500 'omreport system' version which is incredibly slow. This should speed
501 things up in certain configurations.
502 * A few speedups, don't collect information that isn't needed
508 * New option --linebreak to specify the separator between line in case
510 * Added support for 64bit Windows. Thanks to Patrick Hemmen for a patch
511 * [Patrick Hemmen] Added install.bat for Windows installation
512 * [Patrick Hemmen] Improvements on install.sh. Will now install in
513 /usr/lib64 for x86_64
514 * RPMs are now architecture dependent, because of different libdir
519 * New option -o|--ok-info to display extra information when everything
520 is ok. The plugin can now display storage firmware and driver info,
521 DRAC and BMC firmware, and OMSA version
522 * Support for setting custom minimum temperature thresholds via the
523 -c|--critical and -w|--warning options
524 * Better and more detailed temperature error reporting
525 * Bugfix in the amperage report (including performance data). The
526 plugin now takes into account the correct unit and measurement for
527 amperage probes (other than watts)
528 * New option --port lets the user specify the remote SNMP port number
533 * Regression fix: Use the older Processor Device SNMP OIDs for older
534 PowerEdge models, that don't have the new Processor Device Status
535 OIDs. Thanks to Nicole Hähnel for reporting this bug.
536 * Default output (when there are no alerts) now shows RAC firmware,
537 BMC firmware, info about controllers and enclosures (firmware,
543 * Regression fix: Ignore unoccupied CPU slots with SNMP probing. This
544 fixes a bug introduced in versjon 3.2.1, which would output
545 something like this if one or more CPU slots were empty: CPU 1 needs
551 * Use Processor Device Status Table OIDs instead of Processor Device
552 Table when checking CPUs via SNMP
553 * Bugfix: don't report throttled CPUs as warnings when checking via
554 SNMP (same as for checking locally)
559 * New options --state and --short-state for displaying service state
561 * Lots of small fixes for code readability and maintainability
566 * Support for running natively on Windows (using omreport.exe). Thanks
567 to Peter Jestico for a patch.
568 * Support for compiled Windows version, i.e. check_openmanage.exe is
569 now a legal script name.
570 * Exit with error if script basename is illegal/unknown
571 * Various small fixes
576 * Use of alternate basenames for checking only one class of components
577 * Added support for checking the ESM log via SNMP
578 * Code refactoring for robustness and maintainability
579 * Numerous small fixes and enhancements
580 * Added install script in distribution tarball
585 * The script no longer aborts if it can't get system information via
586 SNMP. Give a warning instead, as this is not a critical error
587 * Increased robustness when checking controllers
592 * Man page fix in the 'check' section. Thanks to Ansgar Dahlen for
594 * Allow invalid command error from 'omreport chassis pwrmonitoring'
595 * Various small fixes
600 * Use unique IDs for storage components with regard to blacklisting,
601 which means that the blacklisting API has changed
602 * Added checks for storage components: connectors (channels),
603 enclosures, enclosure fans, enclosure power supplies, enclosure
604 temperature probes and enclosure management modules (EMMs)
605 * Improved verbose output
606 * New option -t|--timeout for setting the plugin timeout
607 * New option -w|--warning for setting custom temperature warning
609 * New option -c|--critical for setting custom temperature critical
611 * Option --check can no longer be specified in its short form (-c)
612 * Code cleanup and improvements
617 * The workaround for the OMSA bug introduced in OMSA 5.5.0 didn't take
618 multiple controllers into account. This has been fixed.
623 * New option -i|--info prefixes all alerts with the service tag
624 * New option -e|--extinfo gives and extra line of output in case of an
625 alert (model and service tag)
626 * New option --postmsg lets the user specify a post message string,
627 with info such as model, service tag etc.
628 * Options -b|--blacklist and -c|--check can now be specified multiple
629 times (actually quite useful)
634 * Slightly improved output for alerts on logical drives (vdisks)
635 * Now shows a rebuilding physical disk as a warning, as this is
636 usually accompanied by a degraded vdisk. Previous versions didn't
637 show this at all (omreport classifies it as "OK").
642 * Slightly improved output for charging controller batteries
647 * Bugfix for reporting physical drives with predictive failure (both
653 * Fix bug in option handling (ambiguous options)
654 * Slightly improved output if checking the storage subsystem is turned
656 * Don't complain if there are no logical drives. This is OK. Thanks to
657 Jamie Henderson for reporting this
662 * Fix bug in SNMP status level table
667 * Added workaround for a BUG introduced in OpenManage 5.5.0. OM
668 sometimes adds a newline in the controller driver version name,
669 which leads to problems parsing the output. Thanks to Hiren Patel
670 for bringing this to my attention.
675 * (snmp) Improved handling of cases where OM is not working properly
680 * Fixed issue where controller number for physical disks can't be
681 established via SNMP (now identifies as controller no. -1)
686 * Correctly identifies and reports error condition in which OpenManage
687 has stopped working (it happens)
693 * Same options for checking, blacklisting etc. supported with SNMP
694 * Same output with SNMP as with NRPE
699 * Collects performance data if the option '-p' or '--perfdata' is
701 * New blacklisting directives ctrl_fw and ctrl_driver
702 added. Suppresses the "special" warning messages concerning outdated
703 controller firmware and driver. Useful if you can't or won't
709 * Fix getting system model and serial number for newer blades
714 * Three new checks added:
715 - System battery probes (typical CMOS battery). Newer poweredge
717 - Power consumption monitoring (if the server supports it)
718 - ESM log, with same functionality as the alert log check. Disabled
724 * Internal refactoring: use ssv-formatted output from openmanage,
725 resulting in slightly faster execution and increased robustness.
726 * If /usr/bin/omreport doesn't exist, try
727 /opt/dell/srvadmin/oma/bin/omreport.
728 * Allow for no instrumented/redundant power supplies. Needed on
729 low-end poweredge models and blades.
734 * Openmanage reports non-critical warning about throttled CPUs on new
735 hardware models. Most og us use ondemand CPU frequency scaling (with
736 throttled CPUs as a result). This specific non-critical warning (CPU
737 Throttled) is ignored from now on.
738 * Remove superfluous Celcius sign when reporting temperatures.
743 * Accommodate blade systems with no fans or powersupplies, i.e. accept
744 errors from omreport when trying to check fans or powersupplies on
746 * Accommodate newer hardware with slightly different omreport
747 options. Use the newer options if they exist. Not really necessary
748 yet, but deprecated options may be removed in future versions of
754 * When everything is OK, check_openmanage now outputs the same info as
755 Gerhard Lausser's excellent check_hpasm plugin does for HP servers:
757 OK - System: 'poweredge 2850', S/N: 'XXXXXXX', ROM: 'A06
758 10/03/2006', hardware working fine, 2 logical drives, 4 physical