Skip to content

Turn off sbt delta lake tests in CI #903

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Jan 27, 2025

Conversation

piyush-zlai
Copy link
Collaborator

Summary

Turn off delta lake tests in CI - We are seeing these tests flake out fairly often and they result in holding up the rest of our dev loops. We are exercising testing of the table utils path already (the Hive flow). We can explore enabling these on Spark 3.5 and when we're running tests on Bazel there. Isolation and Spark 3.5 are the only differences with our internal fork's setup that come to my mind and these tests have not been flaky on our end.

Why / Goal

Test Plan

  • Added Unit Tests
  • Covered by existing CI
  • Integration tested

Checklist

  • Documentation update

Reviewers

@piyush-zlai
Copy link
Collaborator Author

@pengyu-hou / @caiocamatta-stripe / @mickjermsurawong-openai - could either of you folks review?

Copy link
Collaborator

@pengyu-hou pengyu-hou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks. Once we have the bazel PR merged, we should have spark 3.5. Let's revisit this later.

@@ -177,9 +147,6 @@ workflows:
- "Scala 13 -- Tests":
requires:
- "Pull Docker Image"
- "Scala 13 -- Delta Lake Format Tests":
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

On a second thought, is it possible to make it optional? So it won't be a blocker to merge PRs.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let me check if circle ci supports this - if possible I agree, it would be nice

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@piyush-zlai any follow up here? Working on a similar PR for iceberg #993

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@abbywh - I haven't had the chance to revisit these. Based on my (limited) understanding of Spark the way we're creating sessions per test coupled with kicking off all the tests in parallel does often result in flakes as the sessions seem to conflict. The tests pass standalone in this repo (and we have them enabled for months now in our fork) but they seem to be flaking out when turned on with other tests triggered at the same time I think.

@caiocamatta-stripe caiocamatta-stripe merged commit e3e7527 into airbnb:main Jan 27, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants