Skip to content

Instantly share code, notes, and snippets.

@kulikov
Created May 19, 2017 11:47
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
Star You must be signed in to star a gist
Save kulikov/fc806428e3a384a1d01e9905857f0675 to your computer and use it in GitHub Desktop.
monitoring:
alerts:
RPS:
prometheus: sum(rate(ws_requests_latency_seconds_count{app="services/front-office-service"}[5m]))
warn if more: 200
crit if more: 500
Requests latency:
prometheus: histogram_quantile(0.95, sum(rate(ws_requests_latency_seconds_bucket[1m]{app="services/front-office-service"})) by (le, app))
warn if more: 100
crit if more: { qa: 200, live: 1000 }
Errors rate:
elastic: container_tag:"{{ info.name }}" AND ("error" "fail" "failure")
warn if more: 50
crit if more: 200
Purchases:
elastic: container_tag:"{{ info.name }}" AND message:"Order purchase successful"
from: 1day
warn if less: 10
crit if less: (1, 50)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment