Monitoring

Performance metrics, dashboards, and log aggregation

Avg Response

23ms

p50

P99 Latency

142ms

across all services

Error Rate

0.08%

last 24h

Uptime

99.97%

30 day SLA

Dashboards

Grafana-powered metrics visualization

Platform Overview

overview12 panelsLast viewed 2 hours ago

Service Health

service8 panelsLast viewed 35 min ago

Node Metrics

infrastructure6 panelsLast viewed 1 day ago

API Latency Analysis

custom4 panelsLast viewed 5 hours ago

Metrics

Prometheus targets

TargetEndpointStatusLast ScrapeScrape DurationSamples
api-gatewayhttp://api-gateway:9090/metricsup15s ago12ms1,247
user-servicehttp://user-service:9090/metricsup15s ago8ms892
order-servicehttp://order-service:9090/metricsup15s ago11ms743
payment-servicehttp://payment-service:9090/metricsup15s ago6ms421
node-exporterhttp://planet-01:9100/metricsup30s ago23ms3,891
postgres-exporterhttp://users-db:9187/metricsup30s ago45ms567

Logs

Loki log aggregation

14:23:01.432INF[api-gateway]→ POST /api/v1/orders 201 89ms consumer=web-app
14:23:01.510INF[user-service]cache hit uid=8a3f2 latency=0.3ms
14:23:02.101WRN[order-service]slow query: SELECT * FROM orders WHERE... (342ms)
14:23:02.450DBG[payment-service]stripe webhook received event=payment_intent.succeeded
14:23:02.892INF[frontend]SSR render /dashboard 45ms cache=HIT
14:23:03.220ERR[payment-service]stripe webhook sig verification failed req=w9x2k
14:23:03.567INF[api-gateway]→ GET /api/v1/users/me 200 12ms consumer=mobile-app
14:23:03.891INF[order-service]order ord_29f1k created total=€49.99
14:23:04.102WRN[api-gateway]rate limit 85/100 rpm for consumer=partner-api
14:23:04.334INF[user-service]jwt refreshed uid=3b7e1 exp=+1h
14:23:04.567ERR[notification-svc]SMTP connection timeout after 30s host=smtp.eu.mailgun.org
14:23:05.001INF[auth-service]token issued realm=production client=web-app sub=sarah@startup.com