Integration test github action #5076

enyst · 2024-11-16T00:34:44Z

The eval-runner workflow from the .github/workflows directory is too big. Read it all carefully, and note how it's doing two different things: integration test evaluation and SWE-Bench evaluation. Let's split it:

find the section named "Run integration test evaluation" and the necessary sections before and after
create a new workflow named integration-runner
define it as to work on a PR with the label 'integration-test' trigger
move the integration test evaluation section to the new file and copy the prerequisites.

IMPORTANT:

the action should run only when labeled or manually triggered.

github-actions · 2024-11-16T00:35:27Z

OpenHands started fixing the issue! You can monitor the progress here.

github-actions · 2024-11-16T00:39:41Z

A potential fix has been generated and a draft PR #5077 has been created. Please review the changes.

* Fix issue All-Hands-AI#5076: Integration test github action * Update integration-runner.yml * Update integration-runner.yml * update variables * use haiku * use base url * fix report name * Fix pr #8: Integration tests (openhands fix issue 5076) * Revert "Fix pr #8: Integration tests (openhands fix issue 5076)" This reverts commit dcd4681. * Fix pr #8: Integration tests (openhands fix issue 5076) * use haiku explicitly, in results too * remove duplicate * Update .github/workflows/integration-runner.yml * Revert "Update .github/workflows/integration-runner.yml" This reverts commit 7e7200e. * funny space * Fix pr #8: Integration tests (openhands fix issue 5076) * artifact fix * clean up remote runtimes * clean up runtimes more aggressively - a bit unexpected though * Fix pr #8: Integration tests (openhands fix issue 5076) * fix type issue that was preventing checking results * try with waiting time * add eval notes * increase timeouts * try with CI local builds * fix eval output * set debug * fix tests! * fix outputs * keep details in logs, not github comment * tweak schedule * lint-y --------- Co-authored-by: openhands <[email protected]>

Co-authored-by: Engel Nyst <[email protected]>

enyst added enhancement New feature or request fix-me Attempt to fix this issue with OpenHands labels Nov 16, 2024

openhands-agent mentioned this issue Nov 16, 2024

Fix issue #5076: Integration test github action #5077

Merged

enyst pushed a commit to enyst/playground that referenced this issue Nov 23, 2024

Fix issue All-Hands-AI#5076: Integration test github action

eaf3057

enyst added a commit that referenced this issue Nov 27, 2024

Fix issue #5076: Integration test github action (#5077)

f0ca223

Co-authored-by: Engel Nyst <[email protected]>

enyst closed this as completed in #5077 Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration test github action #5076

Integration test github action #5076

enyst commented Nov 16, 2024

github-actions bot commented Nov 16, 2024

github-actions bot commented Nov 16, 2024

Integration test github action #5076

Integration test github action #5076

Comments

enyst commented Nov 16, 2024

github-actions bot commented Nov 16, 2024

github-actions bot commented Nov 16, 2024