Logs, metrics, traces, Prometheus, Grafana, and production alerting.
35+ SRE interview questions covering SLOs, error budgets, incident management, observability, capacity planning, and toil reduction — with production-grade answers.