]> git.uio.no Git - check_openmanage.git/blame - CHANGES
jau
[check_openmanage.git] / CHANGES
CommitLineData
c9ffa70e 13.6.3 2010-12-13
2------------------
3
4* A few compatibility fixes for OMSA 6.4.0 were added.
5
63.6.2 2010-11-25
d57c3cb0 7------------------
8
9* Added support for IPv6 when checking via SNMP. IPv6 can be turned on
10 with the option '-6' or '--ipv6'. The default is IPv4 if the option
11 is not present.
12* Added support for TCP when checking vis SNMP. The option '--tcp' can
13 be used to turn on TCP. The default transport protocol is UDP if the
14 option is not present.
df17fc3c 15* The mode of operation (local or SNMP) is shown in the debug
16 output. If SNMP is used, the debug output will also show the SNMP
17 protocol version, IP version and transport protocol (UDP or TCP).
c9ffa70e 18* Amperage probe status via SNMP is of type "probe status", not
19 regular status. This has been fixed.
20* Massive overall robustness improvements to handle OMSA bugs where
21 some information from OMSA is missing.
22* Memory module enumeration via SNMP changed somewhat to reflect
23 enumeration provided by omreport. This ensures that the plugin's
24 output is identical in SNMP or local mode wrt. dimms IDs.
25* Fan enumeration via SNMP changed somewhat to reflect enumeration
26 provided by omreport. This ensures that the plugin's output is
27 identical in SNMP or local mode wrt. fan IDs.
d57c3cb0 28
b49f8fda 293.6.1 2010-11-02
2254d3c5 30------------------
31
32* Included new check for SD cards. Newer servers such as the R710 can
33 have SD cards installed, these should be monitored. The SD card
34 check is on by default. A new blacklisting keyword 'sd' has been
35 added. The SD card check can be turned off with '--check sdcard=0'.
36* Handle special cases where power monitoring capability is disabled
37 due to non-redundant and/or non-instrumented power supplies.
38* For physical disks probed via SNMP, check that values for vendor,
39 product ID and capacity is available before attempting to display
40 those values.
41* If a physical disk is in sufficiently bad condition, the vendor
42 field reported by OMSA may be empty. The plugin now handles this
43 situation without throwing an internal error.
44
45
84079ac7 463.6.0 2010-08-30
47------------------
48
49* Storage is no longer allowed to be absent. If the plugin doesn't
50 find a storage controller, it will give an alert. For diskless
51 systems or servers without a Dell controller that OMSA recognizes
52 you will now have to specify '--no-storage' or '--check storage=0'
53 to work around this.
54* Report the system revision (if applicable) wherever the model name
55 is printed. E.g. "PowerEdge 2950 III" instead of "PowerEdge 2950".
56* Small change in search path for omreport: The new location for OMSA
57 6.2.0 and later on Linux will be attempted first.
58* Small bugfix for the '--check' parameter, if the argument is a
59 filename. The file could not contain a linebreak, this has been
60 fixed.
61* Added an option '--no-storage', which is equivalent to the general
62 option '--check storage=0'.
63
64
5ee885c1 653.5.10 2010-07-14
66------------------
67
68* If a physical disk is a hot spare, display this information in
69 the debug output
70* Report the bus protocol (e.g. SAS, SATA) and media type (e.g. HDD,
71 SDD) for physical disks in the debug output, if applicable
72* Minor fix for 100GB physical disks, write "100GB" instead of "99GB"
73* SNMP: Use new features of OMSA 6.3.0 to display occupied and total
74 slots in storage enclosures, if applicable. This information is not
75 available with omreport and check_openmanage will not display this
76 info in local mode.
77* SNMP: Added new processor IDs from the OMSA 6.3.0 MIBs
78* SNMP: Use connection tables in a proper way to determine controller
79 and enclosure IDs, for use with physical disks and enclosure
80 components (fan, temp sensors etc.). This fixes a long standing bug
81 for servers with more than one controller, if checked via SNMP.
82* SNMP: Use the nexus ID as last resort to find the controller for
83 physical disks. Workaround for older, broken OMSA versions.
84* SNMP: Identify enclosures (e.g. '2:0:0') properly so that the
85 reporting with SNMP corresponds to the same report with omreport.
86* SNMP: added a couple of workarounds for pre-historic OMSA versions
87
88
9e56606a 893.5.9 2010-06-29
90------------------
91
92* More fine-grained reporting of temperature warnings for enclosure
93 temperature probes.
94* Max/min temperature limits for enclosure temp probes are reported in
95 the debug output
96* Report enclosure temperature probes that are "Inactive" as ok
97* Don't try to print out the reading of enclosure temperature probes
98 if the reading doesn't exist or is not an integer
99* Report enclosure EMMs that are "Not Installed" as ok, instead of
100 critical
101* Corrected typo in the PNP4Nagios template
102
103
49958a98 1043.5.8 2010-06-17
105------------------
106
107* Remove reporting of which controller a logical drive is "attached"
108 to, since this information can't be reliably extracted via SNMP.
9e56606a 109* Avoid collecting Lun ID via SNMP for virtual disks, we don't use it
110* Report total memory and number of dimms in the ok output
111* Difference in reporting if amperage probes have discrete readings
112* Workaround for broken amperage probes
113* Added workaround for bad temperature probes that yields no reading
49958a98 114 in SNMP mode
9e56606a 115* Get OMSA version via SNMP slightly more efficiently
49958a98 116
117
4723fc86 1183.5.7 2010-03-19
88d0e634 119------------------
120
121* Added robustness for received SNMP values that are not defined in
122 the MIB. Instead of throwing a perl warning when this happens, the
123 plugin will not report the undefined value.
124* Defined "Replacing" as a defined state for physical disks in SNMP
125 mode, even though this state is not defined in the MIB. It is
126 reported as such by omreport.
127* Physical disk brand/model is now reported when the state of the disk
128 is "Rebuilding" or "Replacing".
4f5eca49 129* The state of a physical disk is reported in parentheses when
88d0e634 130 predictive failure is detected. It is useful to know if a disk is
131 online, offline, spare or even failed when predictive failure is
132 reported.
133* Handling of physical disk predictive failure has been improved
134 overall.
135* Refactoring of the perfdata code. In conformance with the plugin
136 development guidelines, the UOM (unit of measure) previously
137 reported in the perfdata output has been removed.
138* The -p or --perfdata option now
139 takes an optional agrument 'minimal', which triggers shorter names
140 for the perfomance data labels. This shortens the output and is a
141 workaround for systems where the amount of output exceeds the 1024
142 char limit of NRPE.
143* The PNP4Nagios template has been updated. Users of check_openmanage
144 and PNP4Nagios are advised to upgrade. This version of
145 check_openmanage needs the new template.
88f7c874 146* Lots of other small improvements and updates.
88d0e634 147
148
77ae1787 1493.5.6 2010-02-23
150------------------
151
152* New option '--use-get_table' is added as a workaround for SNMPv3 on
153 Windows using net-snmp. This option will make check_openmanage use
154 the Net::SNMP function get_table() instead of get_entries() to
155 collect information via SNMP.
156* Include a blacklisting option 'ctrl_pdisk' which takes the
157 controller number as argument. This blacklisting option only works
158 with omreport and is a workaround for broken disk firmwares that
159 contain illegal XML characters. These characters makes openmanage
160 barf and exit with an error. Patch from Bas Couwenberg.
161* If the blacklisting keyword "all" is supplied for a component type,
162 that component type is not checked at all, i.e. the commands are
163 never executed. This will make check_openmanage execute faster if
164 blacklisting is heavily used.
165* Option '--htmlinfo' now has a shorter equivalent '-I'
166* The option '--short-state' now has a shorter equivalent '-S'
167
168
a37a38e4 1693.5.5 2010-01-22
75c0ce0f 170------------------
171
172* Fixed an SNMP bug where the plugin didn't handle OID indexes that
a37a38e4 173 were not sequential. Thanks to Gianluca Varenni for reporting.
75c0ce0f 174* Fixed an SNMP bug when checking old hardware such as the PE 2650 and
175 PE 750. The controller id for physical drives were collected and
176 displayed incorrectly. This release uses an additional OID to fetch
a37a38e4 177 this info, which would otherwise be unavailable. Thanks to Gianluca
178 Varenni for reporting this bug.
179* Should use %snmp_probestatus, not %snmp_status when checking the
180 status of voltage probes. Thanks to Ken McKinlay for a patch.
181* Fix when identifying blades via SNMP with very old OMSA
182 versions. Patch from Ken McKinlay.
183* Better way of finding the ID of physical drives via SNMP
184
75c0ce0f 185
ba53af74 1863.5.4 2010-01-13
187------------------
188
189* Added support for storport driver version for controllers, only
190 applicable on servers running Windows. A new blacklisting keyword
191 for suppressing storport driver messages was added.
879782a8 192* The "all" keyword in blacklisting is now case insensitive.
ba53af74 193* More fine-grained reporting in the rare case where a controller
07acfb60 194 battery fails during learning and charging states.
ba53af74 195* New improved way of reporting perl warnings during execution of the
196 plugin.
197
198
1993.5.3 2009-12-17
200------------------
201
202* Fix for path to omreport on Linux with OMSA 6.2.0
203* A couple of other small fixes
204
205
cd34f3e7 2063.5.2 2009-11-17
207------------------
208
209* Fix for undefined device name for logical drives (thanks to Pontus
210 Fuchs for a patch)
211* Fixed a bug in the PNP4Nagios template, that prevented the template
212 from working with PNP4Nagios 0.6. Thanks to the PNP4Nagios team for
213 the fix.
214* Other small fixes
215
c9cde537 2163.5.1 2009-10-22
d4c29457 217------------------
218
219* CPU type, family etc. are now reported in case of a CPU failure (and
220 in the debug output)
221* The debug output now reports Openmanage version and plugin version
222
223
c10216b6 2243.5.0 2009-10-13
a4e4d852 225------------------
226
227* New option '-a' or '--all' turns on checking of everything
228* The manual page (POD info) is removed from the script and is now in
229 a separate file, to make check_openmanage fully ePN compatible
230* ePN is no longer disabled by default, check_openmanage no longer has
231 an opinion on whether it should run under ePN or not
232* The '-m' or '--man' option is no longer available
02babec6 233* The option '-v' or '--verbose' is renamed to '-d' or '--debug',
234 which makes more sense wrt. its usage
a4e4d852 235* The '-g' or '--global' option is removed. Checking the global health
236 status is now default if applicable
237* Checking intrusion detection is now turned on by default
5cb5973b 238* The obsolete option '--snmp' is removed
a4e4d852 239* The option '--state' now has a shorter equivalent '-s'
240* The basename stuff and options '--only-critical' and
241 '--only-warning' are now replaced by an option '--only'
242* If plugin is run by Nagios, redirect stderr to stdout
02babec6 243* Added option '--omreport', that lets the user specify the full path
244 to the omreport binary
5cb5973b 245* Added non-8bit-legacy default search paths for omreport.exe for
246 Windows boxen
247* Minor changes to the plugin output, for consistency
2fdf5f00 248* New blacklisting keyword 'bat_charge' disables warning messages
ca98fb52 249 related to controller cache battery charging. Thanks to Robert
250 Heinzmann for a patch.
ce662ba2 251* For blacklisting, the component ID kan now be "ALL", in which all
252 components of that type is blacklisted.
945f527c 253* Man page is moved to manual section 8
254
a4e4d852 255
2563.4.9 2009-08-07
257------------------
258
259* Fixed a bug that could cause errors and weird results when checking
260 cooling devices (fans) via SNMP. Thanks to Ken McKinlay for spotting
261 this bug and reporting it.
262
2633.4.8 2009-07-31
264------------------
265
266* For failed physical drives, check_openmanage will now output the
267 drive's vendor, model and size in GB or TB.
268
2693.4.7 2009-07-24
270------------------
271
272* The -s|--snmp option was redundant and no longer does anything. SNMP
273 is triggered automatically if the -H|--hostname option is
274 present. The -s|--snmp option is kept for compatibility, but has no
275 effect.
276
2773.4.6 2009-07-07
278------------------
279
280* Added support for performance data (temperatures) from attached
281 storage enclosures such as the MD1000
282
2833.4.5 2009-06-22
284------------------
285
286* Fixed a regression in the --htmlinfo option when it is not supplied
287 with an argument
288
2893.4.4 2009-06-22
290------------------
291
292* New option --htmlinfo adds clickable HTML links in the plugin's
293 output
294
2953.4.3 2009-06-11
296------------------
297
298* Fixed a regression bug in CPU and power supply reporting that only
299 affects verbose output
300* If blacklisting is used, the global health check (via the --global
301 option) is now negated. Checking the global health doesn't make
302 sense when one or more components is blacklisted. Thanks to Rene
303 Beaulieu for reporting this bug
304* The PNP4Nagios template is now included in the tarball and zip
305 archive
306
3073.4.2 2009-06-03
308------------------
309
310* Improved memory error reporting, when using omreport
311* Collect performance data from pwrmonitoring (amperage probes) that
312 were previously ignored when using omreport
313
3143.4.1 2009-05-25
315------------------
316
317* Improved memory error reporting, when using SNMP
318* Other small ehnancements
319
3203.4.0 2009-05-25
321------------------
322
323* The plugin is now compatible with the Nagios embedded Perl
324 interpreter (ePN) in theory. However, the plugin will not not use
325 ePN by default. We don't want any "accidents".
326* License is now GPLv3, previously only specified as "GPL"
327* New options --only-critical and --only-warning. With these options
328 the plugin will only print critical or warning alerts, respectively.
329* Bugfixes and speed enhancements in the storage section, when
330 checking enclosure components via omreport
331* The --okinfo option is now less verbose and more to the point
332* Lots of code refactoring for readability, maintainability and
333 robustness
334
3353.3.2 2009-05-05
336------------------
337
338* Fixed a bug in the storage section, when checking controllers. This
339 is an obscure bug that only manifests itself in the odd case where a
340 server has multiple controllers, and one of the controllers are
341 missing some of the OIDs, in which case these OIDs will be missing
342 for the other controllers as well. The change is minor and only
343 includes using get_table() instead of get_entries() to collect the
344 SNMP result. Thanks to Stephan Bovet for reporting this bug.
345
3463.3.1 2009-04-28
347------------------
348
349* The --perfdata option can now optionally take an argument
350 "multiline", which makes the plugin produce multiline performance
351 data output in a Nagios 3.x way. Not really needed, but the plugin
352 output is prettier.
353* Added comment within the 10 first lines to disable the nagios
354 embedded perl (ePN) interpreter by default for Nagios 3.x
355* Improvements in the performance data output. Units are now included
356
3573.3.0 2009-04-07
358------------------
359
360* Added --global option, which turns on checking of everything. If
361 used with SNMP, the global system health status is also probed, to
362 protect the user against bugs in the plugin. If used with omreport,
363 the overall chassis health is used.
364* Support for SNMP version 3
365* New check added: esmhealth. This checks the overall health of the
366 ESM log, i.e. the fill grade. More than 80% means a warning message
367* Fixed alert log reporting to use the same format as for the ESM log
368* Output messages are now sorted by severity
369* Minor changes in how out-of-date controller firmware/driver is
370 reported
371* Code refactoring and cleanup
372
3733.2.7 2009-03-29
374------------------
375
376* Use "omreport about" to collect OMSA version. Slightly faster than
377 "omreport system version". This should give a small speedup in
378 certain configurations
379* Fixed typo in output when a logical drive is rebuilding. Thanks to
380 Andreas Olsson for reporting
381* Improved reporting of ESM log content
382* Added omreport.sh as alternate omreport path
383* Lots of other small fixes and enhancements
384
385Plus: A few changes to make the plugin work with old PowerEdge models
386(e.g. 2550, 2450) and/or old OMSA versions (e.g. version 4.5):
387
388* Use the chassisModelName OID to determine if SNMP works (instead of
389 BaseboardType)
390* No longer require a response when checking baseboard type via
391 SNMP. If there is no response, we assume that we're not dealing with
392 a blade server
393
394Thanks to Christian McHugh for help with testing and debugging this
395stuff
396
3973.2.6 2009-03-05
398------------------
399
400* Use 'omreport system operatingsystem' to collect OS info, instead of
401 'omreport system' version which is incredibly slow. This should speed
402 things up in certain configurations.
403* A few speedups, don't collect information that isn't needed
404* Man page fixes
405
4063.2.5 2009-02-24
407------------------
408
409* New option --linebreak to specify the separator between line in case
410 of multiline output
411* Added support for 64bit Windows. Thanks to Patrick Hemmen for a patch
412* [Patrick Hemmen] Added install.bat for Windows installation
413* [Patrick Hemmen] Improvements on install.sh. Will now install in
414 /usr/lib64 for x86_64
415* RPMs are now architecture dependent, because of different libdir
416
4173.2.4 2009-02-17
418------------------
419
420* New option -o|--ok-info to display extra information when everything
421 is ok. The plugin can now display storage firmware and driver info,
422 DRAC and BMC firmware, and OMSA version
423* Support for setting custom minimum temperature thresholds via the
424 -c|--critical and -w|--warning options
425* Better and more detailed temperature error reporting
426* Bugfix in the amperage report (including performance data). The
427 plugin now takes into account the correct unit and measurement for
428 amperage probes (other than watts)
429* New option --port lets the user specify the remote SNMP port number
430
4313.2.3 2009-02-09
432------------------
433
434* Regression fix: Use the older Processor Device SNMP OIDs for older
435 PowerEdge models, that don't have the new Processor Device Status
436 OIDs. Thanks to Nicole Hähnel for reporting this bug.
437* Default output (when there are no alerts) now shows RAC firmware,
438 BMC firmware, info about controllers and enclosures (firmware,
439 driver).
440
4413.2.2 2009-02-03
442------------------
443
444* Regression fix: Ignore unoccupied CPU slots with SNMP probing. This
445 fixes a bug introduced in versjon 3.2.1, which would output
446 something like this if one or more CPU slots were empty: CPU 1 needs
447 attention ()
448
4493.2.1 2009-02-03
450------------------
451
452* Use Processor Device Status Table OIDs instead of Processor Device
453 Table when checking CPUs via SNMP
454* Bugfix: don't report throttled CPUs as warnings when checking via
455 SNMP (same as for checking locally)
456
4573.2.0 2009-01-27
458------------------
459
460* New options --state and --short-state for displaying service state
461 along with the alert
462* Lots of small fixes for code readability and maintainability
463
4643.1.1 2009-01-12
465------------------
466
467* Support for running natively on Windows (using omreport.exe). Thanks
468 to Peter Jestico for a patch.
469* Support for compiled Windows version, i.e. check_openmanage.exe is
470 now a legal script name.
471* Exit with error if script basename is illegal/unknown
472* Various small fixes
473
4743.1.0 2008-12-26
475------------------
476
477* Use of alternate basenames for checking only one class of components
478* Added support for checking the ESM log via SNMP
479* Code refactoring for robustness and maintainability
480* Numerous small fixes and enhancements
481* Added install script in distribution tarball
482
4833.0.2 2008-12-20
484------------------
485
486* The script no longer aborts if it can't get system information via
487 SNMP. Give a warning instead, as this is not a critical error
488* Increased robustness when checking controllers
489
4903.0.1 2008-12-11
491------------------
492
493* Man page fix in the 'check' section. Thanks to Ansgar Dahlen for
494 reporting this.
495* Allow invalid command error from 'omreport chassis pwrmonitoring'
496* Various small fixes
497
4983.0.0 2008-12-04
499------------------
500
501* Use unique IDs for storage components with regard to blacklisting,
502 which means that the blacklisting API has changed
503* Added checks for storage components: connectors (channels),
504 enclosures, enclosure fans, enclosure power supplies, enclosure
505 temperature probes and enclosure management modules (EMMs)
506* Improved verbose output
507* New option -t|--timeout for setting the plugin timeout
508* New option -w|--warning for setting custom temperature warning
509 thresholds
510* New option -c|--critical for setting custom temperature critical
511 thresholds
512* Option --check can no longer be specified in its short form (-c)
513* Code cleanup and improvements
514
5152.1.1 2008-11-24
516------------------
517
518* The workaround for the OMSA bug introduced in OMSA 5.5.0 didn't take
519 multiple controllers into account. This has been fixed.
520
5212.1.0 2008-11-19
522------------------
523
524* New option -i|--info prefixes all alerts with the service tag
525* New option -e|--extinfo gives and extra line of output in case of an
526 alert (model and service tag)
527* New option --postmsg lets the user specify a post message string,
528 with info such as model, service tag etc.
529* Options -b|--blacklist and -c|--check can now be specified multiple
530 times (actually quite useful)
531
5322.0.9 2008-11-17
533------------------
534
535* Slightly improved output for alerts on logical drives (vdisks)
536* Now shows a rebuilding physical disk as a warning, as this is
537 usually accompanied by a degraded vdisk. Previous versions didn't
538 show this at all (omreport classifies it as "OK").
539
5402.0.8 2008-11-14
541------------------
542
543* Slightly improved output for charging controller batteries
544
5452.0.7 2008-11-12
546------------------
547
548* Bugfix for reporting physical drives with predictive failure (both
549 via NRPE and SNMP)
550
5512.0.6 2008-10-30
552------------------
553
554* Fix bug in option handling (ambiguous options)
555* Slightly improved output if checking the storage subsystem is turned
556 off
557* Don't complain if there are no logical drives. This is OK. Thanks to
558 Jamie Henderson for reporting this
559
5602.0.5 2008-10-29
561------------------
562
563* Fix bug in SNMP status level table
564
5652.0.4 2008-10-29
566------------------
567
568* Added workaround for a BUG introduced in OpenManage 5.5.0. OM
569 sometimes adds a newline in the controller driver version name,
570 which leads to problems parsing the output. Thanks to Hiren Patel
571 for bringing this to my attention.
572
5732.0.3 2008-10-28
574------------------
575
576* (snmp) Improved handling of cases where OM is not working properly
577
5782.0.2 2008-10-27
579------------------
580
581* Fixed issue where controller number for physical disks can't be
582 established via SNMP (now identifies as controller no. -1)
583
5842.0.1 2008-10-23
585------------------
586
587* Correctly identifies and reports error condition in which OpenManage
588 has stopped working (it happens)
589
5902.0.0 2008-10-23
591------------------
592
593* SNMP support
594* Same options for checking, blacklisting etc. supported with SNMP
595* Same output with SNMP as with NRPE
596
5971.2.1 2008-09-25
598------------------
599
600* Collects performance data if the option '-p' or '--perfdata' is
601 supplied.
602* New blacklisting directives ctrl_fw and ctrl_driver
603 added. Suppresses the "special" warning messages concerning outdated
604 controller firmware and driver. Useful if you can't or won't
605 upgrade.
606
6071.1.2 2008-08-06
608------------------
609
610* Fix getting system model and serial number for newer blades
611
6121.1.1 2008-08-06
613------------------
614
615* Three new checks added:
616 - System battery probes (typical CMOS battery). Newer poweredge
617 models have these
618 - Power consumption monitoring (if the server supports it)
619 - ESM log, with same functionality as the alert log check. Disabled
620 by default.
621
6221.1.0 2008-08-04
623------------------
624
625* Internal refactoring: use ssv-formatted output from openmanage,
626 resulting in slightly faster execution and increased robustness.
627* If /usr/bin/omreport doesn't exist, try
628 /opt/dell/srvadmin/oma/bin/omreport.
629* Allow for no instrumented/redundant power supplies. Needed on
630 low-end poweredge models and blades.
631
6321.0.3 2008-07-25
633------------------
634
635* Openmanage reports non-critical warning about throttled CPUs on new
636 hardware models. Most og us use ondemand CPU frequency scaling (with
637 throttled CPUs as a result). This specific non-critical warning (CPU
638 Throttled) is ignored from now on.
639* Remove superfluous Celcius sign when reporting temperatures.
640
6411.0.2 2008-07-25
642------------------
643
644* Accommodate blade systems with no fans or powersupplies, i.e. accept
645 errors from omreport when trying to check fans or powersupplies on
646 blade servers.
647* Accommodate newer hardware with slightly different omreport
648 options. Use the newer options if they exist. Not really necessary
649 yet, but deprecated options may be removed in future versions of
650 Dell OpenManage.
651
6521.0.1 2008-07-18
653------------------
654
655* When everything is OK, check_openmanage now outputs the same info as
656 Gerhard Lausser's excellent check_hpasm plugin does for HP servers:
657
658 OK - System: 'poweredge 2850', S/N: 'XXXXXXX', ROM: 'A06
659 10/03/2006', hardware working fine, 2 logical drives, 4 physical
660 drives
661
6621.0.0 2008-07-15
663------------------
664
665* Initial release