Skip to content

ADR-003 System alerts

Context

There has been a wish for a functionality that alerts administrators if certain system values have gone beyond defined thresholds rules.

Decision

We have decided to use alertmanager that is a part of the Prometheus package that is already used for monitoring the cluster.

Consequences

  • We have tried to install alertmanager and testing it. It works and given the various possibilities of defining alert rules we consider the demands to be fulfilled.
  • We will be able to get alerts regarding thresholds on both container and cluster level which is what we need.
  • Alertmanager fits in the general focus of being cloud agnostic. It is CNCF approved and does not have any external infrastructure dependencies.