ephemeralrunner should support a TTL #4100
Labels
community
Community contribution
enhancement
New feature or request
needs triage
Requires review from the maintainers
Uh oh!
There was an error while loading. Please reload this page.
What would you like added?
It would be nice if the ephemeral runners supported a TimeToLive value to cleanly expire the pod & ephemeral runners.
Why is this needed?
I have an ephemeralrunner setup defined via gha-runner-scale-set. I have minRunners set, and when the runners startup they connect to GHE and wait for work. Part of my pod spec includes a custom volume that I need to "keep fresh" by rotating every 2 hours. (The volume is time consuming to setup, and our volume provisioner abstracts that time). We installed a liveness probe on the pod to expire after 2 hours, and this works up to a max of 5 times due to this code:
actions-runner-controller/controllers/actions.github.com/ephemeralrunner_controller.go
Lines 190 to 198 in 1dbb88c
After hitting that condition, the ephemeralrunner resource hangs around with no new pod and eventually (after ~10 hours of idle time) all ephemeralrunners have no running pods (meaning we lose all our GH Runners).
The text was updated successfully, but these errors were encountered: