Alerts
- Controller pod down
A stageset-controller pod has been NotReady for the alert window.
- Operations
Metrics, alerts, events, and runbooks for running the controller day to day.
- Reconcile latency high
Reconcile p99 latency for the StageSet controller is above threshold.
- Workqueue saturation
The controller cannot drain its reconcile queue fast enough.