mirror of
https://github.com/eworm-de/routeros-scripts
synced 2024-05-14 08:04:19 +00:00
88 lines
3 KiB
Markdown
88 lines
3 KiB
Markdown
Notify about health state
|
||
=========================
|
||
|
||
[⬅️ Go back to main README](../README.md)
|
||
|
||
> ℹ️ **Info**: This script can not be used on its own but requires the base
|
||
> installation. See [main README](../README.md) for details.
|
||
|
||
Description
|
||
-----------
|
||
|
||
This script is run from scheduler periodically, sending notification on
|
||
health related events:
|
||
|
||
* high CPU utilization
|
||
* low available free RAM
|
||
* voltage jumps up or down more than configured threshold
|
||
* voltage drops below hard lower limit
|
||
* power supply failed or recovered
|
||
* temperature is above or below threshold
|
||
|
||
Note that bad initial state will not trigger an event.
|
||
|
||
Monitoring CPU utilization and available free RAM works on all devices.
|
||
Other than that only sensors available in hardware can be checked. See what
|
||
your hardware supports:
|
||
|
||
/system/health/print;
|
||
|
||
### Sample notifications
|
||
|
||
#### CPU utilization
|
||
|
||
![check-health notification cpu utilization high](check-health.d/notification-01-cpu-utilization-high.avif)
|
||
![check-health notification cpu utilization ok](check-health.d/notification-02-cpu-utilization-ok.avif)
|
||
|
||
#### Available free RAM
|
||
|
||
![check-health notification free ram low](check-health.d/notification-03-free-ram-low.avif)
|
||
![check-health notification free ram ok](check-health.d/notification-04-free-ram-ok.avif)
|
||
|
||
#### Voltage
|
||
|
||
![check-health notification voltage](check-health.d/notification-05-voltage.avif)
|
||
|
||
#### Temperature
|
||
|
||
![check-health notification temperature high](check-health.d/notification-06-temperature-high.avif)
|
||
![check-health notification temperature ok](check-health.d/notification-07-temperature-ok.avif)
|
||
|
||
#### PSU state
|
||
|
||
![check-health notification psu fail](check-health.d/notification-08-psu-fail.avif)
|
||
![check-health notification psu ok](check-health.d/notification-09-psu-ok.avif)
|
||
|
||
Requirements and installation
|
||
-----------------------------
|
||
|
||
Just install the script and create a scheduler:
|
||
|
||
$ScriptInstallUpdate check-health;
|
||
/system/scheduler/add interval=53s name=check-health on-event="/system/script/run check-health;" start-time=startup;
|
||
|
||
> ℹ️ **Info**: Running lots of scripts simultaneously can tamper the
|
||
> precision of cpu utilization, escpecially on devices with limited
|
||
> resources. Thus an unusual interval is used here.
|
||
|
||
Configuration
|
||
-------------
|
||
|
||
The configuration goes to `global-config-overlay`, these are the parameters:
|
||
|
||
* `CheckHealthTemperature`: an array specifying temperature thresholds for sensors
|
||
* `CheckHealthVoltageLow`: value (in volt*10) giving a hard lower limit
|
||
* `CheckHealthVoltagePercent`: percentage value to trigger voltage jumps
|
||
|
||
> ℹ️ **Info**: Copy relevant configuration from
|
||
> [`global-config`](../global-config.rsc) (the one without `-overlay`) to
|
||
> your local `global-config-overlay` and modify it to your specific needs.
|
||
|
||
Also notification settings are required for
|
||
[e-mail](mod/notification-email.md),
|
||
[matrix](mod/notification-matrix.md) and/or
|
||
[telegram](mod/notification-telegram.md).
|
||
|
||
---
|
||
[⬅️ Go back to main README](../README.md)
|
||
[⬆️ Go back to top](#top)
|