RunbookRelay
RunbookRelay is a web app (with a lightweight CLI agent) that continuously validates your redundancy and failover by running safe, scheduled “micro-drills” against real infrastructure. Instead of trusting static runbooks and occasional game days, it executes small, reversible checks: DNS cutover simulations, standby promotion dry-runs, load balancer target rotation, backup restore spot-checks, and dependency reachability tests. It records evidence (logs, timings, screenshots/outputs), flags drift (runbook steps no longer match reality), and generates audit-ready reports for leadership and compliance. This is not a magic auto-failover tool; it’s a verification layer that catches the boring, common failures—stale credentials, missing IAM permissions, broken scripts, misconfigured health checks—before an incident. Integrates with common stacks (Kubernetes, AWS, Terraform) and ticketing to open actionable tasks when drills fail.