metric alert is trigger by prometheus, when expression condition matched, prometheus will send alert to alertmanager, while others are trigger by rancher, rancher will call alertmanager API to trigger ·
- Go to
System project
->Apps
-> check apps status formonitoring-operator
,cluster-alerting
,cluster-monitoring
, make sure these apps deployed success
- Go to
Tools
->Notifier
, Select the notifier you configured, clickTest
button, make sure you could receive a test message - Make sure you configure the receiver for your alert group, if no receiver configured, alert rule under this alert group wouldn't be triggered.
- curl http://alertmanager-operated.cattle-prometheus.svc.cluster.local:9093/api/v1/alerts in cluster agent to check the whether rancher server could access alertmanager api
- Go to
System project
->Apps
->cluster-monitoring
-> clickindex.html
go to prometheus page, paste your expression here for test
- In the prometheus page, click
Alerts
tab, make sure your expression is configured here if your alert expression isn't included here, may be got errors when generating alert rule, could check rancher server error. - You could see alert status in prometheus
Alerts
page, if the alert status isFire
, means prometheus will call alertmanager API and send an alert event it alertmanager, at this time, you could see the alert event under alertmanager page(System project
->Apps
->cluster-alerting
-> clickindex.html
), otherwise, prometheus failed to call alertmanager API, could go to see the prometheus's logs.
- These alerts are trigger by rancher, you could enable rancher server's debug logs, rancher server may fail to call alertmanager