rpk: add `--wait` to `remote-bundle start` #26486

JFlath · 2025-06-17T12:46:20Z

Backports Required

Release Notes

Improvements

Allows waiting for collection to complete when running a remote-bundle

JFlath · 2025-06-17T12:48:12Z

@r-vasquez As ever, I'll have made some mistakes with bit/build. This is branched from remote_bundle_upload, so includes that change too. Can either rebase, or we can just merge that in first.

Either way though, more broadly, does the approach here work for you?

JFlath · 2025-06-17T15:56:22Z

src/go/rpk/pkg/cli/debug/remotebundle/start.go

+					ready, errorred := filterCompletedBrokers(status)
+					if len(ready)+len(errorred) == len(status) {
+						if len(ready) == 0 {
+							fmt.Printf(`


Would using out.Die() be the better option here to exit with 1?

Regardless of approach, I actually think we need to exit 1 in this case, having thought about it some more. If out.Die() is the best option for that, great. If not, let me know :)

Yeah, either out.Die or os.Exit(1). But os.Die is effectively the same so let's use that 👍

vbotbuildovich · 2025-06-17T16:45:13Z

CI test results

test results on build#67463

test_class	test_method	test_arguments	test_kind	job_url	test_status	passed	reason
TopicDeleteCloudStorageTest	topic_delete_installed_snapshots_test		ducktape	https://buildkite.com/redpanda/redpanda/builds/67463#01977e21-bd90-47d5-a1b9-1796af54ff9b	FLAKY	20/21	upstream reliability is '100.0'. current run reliability is '95.23809523809523'. drift is 4.7619 and the allowed drift is set to 50. The test should PASS

r-vasquez · 2025-06-19T00:15:29Z

src/go/rpk/pkg/cli/debug/remotebundle/start.go

+					ready, errorred := filterCompletedBrokers(status)
+					if len(ready)+len(errorred) == len(status) {
+						if len(ready) == 0 {
+							fmt.Printf(`


Yeah, either out.Die or os.Exit(1). But os.Die is effectively the same so let's use that 👍

r-vasquez · 2025-06-19T00:53:46Z

src/go/rpk/pkg/cli/debug/remotebundle/start.go

+Waiting for collection to complete...
+`, jobID)
+
+				for {


I usually dislike 'naked' infinite loops without any escape hatch, what do you think of:

Listening to the cmd.Context() cancellation.

Having a timeout (probably configurable through flags)

having a parameter that breaks the for loop so you don't have to wait for the next poll to finish the loop

I'm thinking in something like:

// context with timeout ctx, cancel := context.WithTimeout(cmd.Context(), timeout) defer cancel() var done bool for !done { select { case <-ctx.Done(): // Context canceled or timed out; return or handle error return ctx.Err() default: select { case <-ctx.Done(): // Context canceled during wait return ctx.Err() case <-time.After(10 * time.Second): // Poll: // status, err := executeBundleStatus(ctx, fs, p) // handle error, right now is not being handled, we can break early if we find something bad. // ... // if len(ready)+len(errorred) == len(status) { // if len(ready) == 0 { // // Print "no bundles created" message // } // done = true // } } } }

I see that we start polling initially after 10s, as (I assume) this is a long-running operation and won't be ready right away, but alternatively, you can poll right away with the suggestion above. Just have the select at the end of the polling logic

Also, what do you think of printing a debug log (or straight to stdout) with the progress:

fmt.Printf("Waiting for collection... %v/%v ready\n", len(ready), len(status))

We can also be fancy and use carriage return to update the message

for !done { // ... polling fmt.Printf("\rWaiting for collection... %v/%v ready", len(ready), len(status)) // ... sleep ... } fmt.Println() // move to new line. fmt.Println("Debug bundle collection successful, you can find the debug bundle in ....")

r-vasquez · 2025-06-19T00:54:43Z

src/go/rpk/pkg/cli/debug/remotebundle/start.go

+  rpk debug remote-bundle status
+`, jobID)
+						}
+						break


I think if we break here, we are not printing any message, we should confirm before ending that everything went right and where the bundle is located.

JFlath added 2 commits June 11, 2025 21:05

rpk: Add --upload-url for remote-bundle

7627923

rpk: add --wait to remote-bundle start

c328ffd

JFlath requested review from r-vasquez, kbatuigas and a team as code owners June 17, 2025 12:46

github-actions bot added area/rpk area/build labels Jun 17, 2025

JFlath commented Jun 17, 2025

View reviewed changes

r-vasquez reviewed Jun 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rpk: add `--wait` to `remote-bundle start` #26486

rpk: add `--wait` to `remote-bundle start` #26486

Uh oh!

JFlath commented Jun 17, 2025

Uh oh!

JFlath commented Jun 17, 2025

Uh oh!

JFlath Jun 17, 2025

Uh oh!

JFlath Jun 18, 2025

Uh oh!

r-vasquez Jun 19, 2025

Uh oh!

vbotbuildovich commented Jun 17, 2025

Uh oh!

r-vasquez Jun 19, 2025

Uh oh!

r-vasquez Jun 19, 2025

Uh oh!

r-vasquez Jun 19, 2025

Uh oh!

r-vasquez Jun 19, 2025

Uh oh!

Uh oh!

rpk: add --wait to remote-bundle start #26486

Are you sure you want to change the base?

rpk: add --wait to remote-bundle start #26486

Uh oh!

Conversation

JFlath commented Jun 17, 2025

Backports Required

Release Notes

Improvements

Uh oh!

JFlath commented Jun 17, 2025

Uh oh!

JFlath Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

JFlath Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

r-vasquez Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

vbotbuildovich commented Jun 17, 2025

CI test results

Uh oh!

r-vasquez Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

r-vasquez Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

r-vasquez Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

r-vasquez Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rpk: add `--wait` to `remote-bundle start` #26486

rpk: add `--wait` to `remote-bundle start` #26486