Open
Description
/kind feature
/help
Describe the solution you'd like
We would like to improve the supportability of CAPG. We would like to make it easier for operations/support users to investigate issues with CAPG.
There will be a number of areas of improvement:
- Add tracing using something like OpenTelemetry. This will enable us to see how many and how long reconcile loops, GCP API calls etc take, which will be very helpful for issue investigation.
- Add a pprof endpoint that can be enabled
Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]
Items to do (not exhaustive yet)
- Implement OTEL in CAPG
- Docs
- e2e test to be added
- otel backend for local dev
- otel backend for e2e