Create a tactical Overview that seperates between important and unimportant Problems
A general Showstopper in many enviroments is that the icinga Web-Interface does not seperate between important and unimportant problems in the overviews. At least one general overview should be available that accomplish this.
- Important Problems are Problems noone has taken care of yet (e.g. no acknowledges set, no downtimes set)
- Unimportant Problems are Problems someone has taken care of yet (e.g. acknowledges set, downtimes set)
The Nagios and also the icinga core both already have the logic to implement such a classification in Web-GUIs. In productive enviroments this is a widely used feature. See cgi/tac.c as a reference so see how it is implemented in the classic UI (The css-sets in the classic UI currently does not display this correctly, but that is another issue, see #868 for reference). Also, the Register Card "Open Problems" seems to use this logic already.
Ideally an overview would look like the generic "To Hostgroup", however "critical" hosts and services are seperated into important and unimportant criticals. Also warnings should be included seperately in this view.
Just as an example have a look at the nagios-core (http://nagioscore.demos.nagios.com/nagios/cgi-bin/status.cgi?hostgroup=all&style=summary, credentials nagiosadmin/nagiosadmin). The colours used in the core are i.m.O. not the best choice. However this view should give you a general impression on what kind of display is needed.
I am marking this feature request as urgent.
In many enviroments where more then one lonely sysadmin is working such a tactical overview is needed. Often responsibilities are seperated by Hostgroups, but some team members also act as a "backup".
Lets say "admin a" is responsible for "hostgroup a", "admin b" for "hostgroup b". The two are Backup for each other, so they need a general overview about the other Hostgroup as well.
When "admin a" looks at "hostgroup b" he knows he only needs to react if a problem there persists for a longer period of time. If "admin b" is already working on the issue, he acknowledges the problem (or sets a downtime). By doing this "admin a" automatically gets informed that no action is required from his side.
#1 Updated by b@fh about 5 years ago
- File hostgroups.jpg added
Just noticed that the nagios Demo site is not static and it is possible to edit states in there. So i attached a small screenshot.
As previously mentioned the colours used in this nagios-theme are not the best.
To sum it up :
Everything "unhandled" should catch eye attention immediately.