Monitoring¶
The monitoring system comprises of Prometheus and Grafana. Prometheus is configured as data collector and alert manager. Grafana is used for visualisation. Grafana can also be configured as alert manager. The alert manager is responsible for sending notifications via various communication channels like Email, Slack, PagerDuty and several others.
Prepare configuration repository¶
enable Prometheus/Grafana in
environments/kolla/configuration.yml
enable_grafana: "yes"
enable_prometheus: "yes"
Data retention policies for Prometheus can be configured via command line arguments:
prometheus_cmdline_extras: "--storage.tsdb.retention.time=90d --storage.tsdb.retention.size=50GB"
Grafana unified alerting can be enabled as following. See Grafana documentation. With Grafana unified alerting the alerts from Prometheus can be handled within Grafana. Add the file
environments/kolla/files/overlays/grafana/grafana.ini
with the following content to the configuration repository.
[alerting]
enabled = false
[unified_alerting]
enabled = true
[smtp]
enable = true
from_address = alerts@osism.local
Add Prometheus rule files and Grafana dashboards to the configuation repository. Clone the kolla-operations repository and copy all files from grafana and prometheus directory to the configuration repository. This will install a set of metrics definitions and alert rules for OpenStack services.
git clone https://github.com/osism/kolla-operations.git
cp -r kolla-operations/grafana cfg-osism/environments/kolla/files/overlays/
cp -r kolla-operations/prometheus cfg-osism/environments/kolla/files/overlays/
If you want to use Prometheus alert manager, place the configuration in
environments/kolla/files/overlays/prometheus/prometheus-alertmanager.yml
. See Prometheus alert manager documentation.
---
global:
resolve_timeout: 5m
smtp_require_tls: false
smtp_smarthost: localhost:25
smtp_from: alerts@osism.local
route:
receiver: default-receiver
group_wait: 30s
group_interval: 5m
repeat_interval: 4h
receivers:
- name: default-receiver
email_configs:
- to: monitoring@osism.local
send_resolved: true
Run Monitoring deployment¶
Run Prometheus and Grafana deployments:
osism-kolla deploy prometheus
osism-kolla deploy grafana
The Grafana Dashboard will be available on the internal network at
http://api-int.osism.local:3000
.The Prometheus web console will be available on the internal network at
http://api-int.osism.local:9091
.The Prometheus Alert Manager web console will be available on the internal network at
http://api-int.osism.local:9093
.