-
Notifications
You must be signed in to change notification settings - Fork 69
Open
Description
Description:
We need a way to measure and track the reconciliation time in the DevWorkspace Operator. Reconciliation time refers to the duration it takes for the operator to detect changes and bring the cluster state in line with the desired state, starting from the moment a relevant change occurs.
Why This Matters:
- Understanding reconciliation time helps identify performance bottlenecks in the operator loop.
- It is crucial for maintaining a responsive developer experience when workspaces are started, stopped, or modified.
Potential Approaches:
- custom Prometheus metrics to record the start and end time of reconciliations.
- Tag and correlate reconciliation events with resource UIDs or names for granularity.
- Expose reconciliation duration via metrics endpoint.
Acceptance Criteria:
- A method is proposed and documented for measuring reconciliation time.
- Initial implementation added to measure and log or expose reconciliation duration.
- Reconciliation time is accessible via metrics (e.g., Prometheus) or logs.
- Alerts or dashboards are configured for reconciliation durations.
- The solution should not significantly impact operator performance.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels