Hi
In line with organising our diskspace alerts, I would like to implement a more efficient Service Monitoring routine.
The following are my requirements:
- The NT Service Monitor is set to execute every 1 minute. It checks for any failed services.
- An Alert (Event Log Message) should be generated only the second time a specific service fails (this allows service restarts without notifying support staff unneccessarily).
- After the first Alert has been sent, the next Alert should only be generated in 30 minute intervals
- If a service recovers, the whole cycle should start again.
At first I thought to implement the above using a technique involving Status variables similar to what Rainer suggested to treat disk space Alerts, but here I ran into a problem. I cannot increment the value of a status variable by one to reflect the number of repepitions that a particular service has already failed (in the above example I would test for the status variable values of 1 and 30). Similarly, the minimum wait time option cannot be used as this is filter specific and I need to manage the state for an unknown number of services.
Have I expressed myself clearly? Any help would be greatly appreciated.
Regards
Mark


