-
Notifications
You must be signed in to change notification settings - Fork 0
Logging fix -- make root cause more clear if label job has misaligned dates #611
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
WalkthroughThe changes update the Changes
Possibly related PRs
Suggested reviewers
Poem
Warning Review ran into problems🔥 ProblemsGitHub Actions and Pipeline Checks: Resource not accessible by integration - https://docs.github.com/rest/actions/workflow-runs#list-workflow-runs-for-a-repository. Please grant the required permissions to the CodeRabbit GitHub App under the organization or repository settings. 📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (10)
💤 Files with no reviewable changes (1)
✅ Files skipped from review due to trivial changes (1)
🚧 Files skipped from review as they are similar to previous changes (2)
🧰 Additional context used🧬 Code Graph Analysis (2)spark/src/main/scala/ai/chronon/spark/batch/LabelJoinV2.scala (4)
api/python/ai/chronon/cli/compile/parse_teams.py (2)
⏰ Context from checks skipped due to timeout of 90000ms (18)
🔇 Additional comments (21)
🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
## Summary We were attempting to run label join but the outputNamespace here was [null](https://github.com/zipline-ai/chronon/blob/main/spark/src/main/scala/ai/chronon/spark/batch/LabelJoinV2.scala#L210) since it's not set by compile. ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Improved team processing to ensure that join operations update metadata more consistently, resulting in better management of grouping information during team updates. <!-- end of auto-generated comment: release notes by coderabbit.ai -->
## Summary ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced partition input handling now accepts multiple segments, providing more flexible and robust processing of partition specifications. - **Refactor** - Improved internal code organization to boost overall maintainability and clarity. <!-- end of auto-generated comment: release notes by coderabbit.ai -->
## Summary Noticed that when I ran compile on Etsy's conf, their conf looked like this: ``` conf=ConfigProperties( common={ ... }, ) ``` With the current code, this ended up with our Submitter not setting any job confs at all. This is because we were checking for the `modeConfigs` set, and if not we'd do nothing. This is a bug. Fixing that logic ## Checklist - [ ] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit ## Summary by CodeRabbit - **Refactor** - Streamlined the internal process for retrieving mode configurations, reducing its complexity and improving efficiency while maintaining existing functionality. - **New Features** - Introduced new JSON configuration files for the `quickstart.purchases.v1` dataset, enhancing data management and execution settings. - Added a new test class for validating the functionality of the `JobSubmitter` class, ensuring robustness in configuration parsing and argument retrieval. - **Chores** - Updated test suite configurations to improve testing structure and accessibility. <!-- end of auto-generated comment: release notes by coderabbit.ai -->
… dates (#611) ## Summary Improve logging when labelDs - Window does not overlap with join data Add step day functionality to labelJoin ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced flexibility in date handling for label join operations, allowing for an optional start date override. - Introduction of new JSON configuration files for the `quickstart.purchases.v1` dataset, facilitating structured data processing and analysis. - **Bug Fixes** - Improved diagnostic messaging to clearly indicate when required data elements are missing during processing. - Enhanced error detection to ensure necessary computation windows are present before proceeding. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Co-authored-by: david-zlai <[email protected]> Co-authored-by: ezvz <[email protected]>
… dates (#611) ## Summary Improve logging when labelDs - Window does not overlap with join data Add step day functionality to labelJoin ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced flexibility in date handling for label join operations, allowing for an optional start date override. - Introduction of new JSON configuration files for the `quickstart.purchases.v1` dataset, facilitating structured data processing and analysis. - **Bug Fixes** - Improved diagnostic messaging to clearly indicate when required data elements are missing during processing. - Enhanced error detection to ensure necessary computation windows are present before proceeding. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Co-authored-by: david-zlai <[email protected]> Co-authored-by: ezvz <[email protected]>
… dates (#611) ## Summary Improve logging when labelDs - Window does not overlap with join data Add step day functionality to labelJoin ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced flexibility in date handling for label join operations, allowing for an optional start date override. - Introduction of new JSON configuration files for the `quickstart.purchases.v1` dataset, facilitating structured data processing and analysis. - **Bug Fixes** - Improved diagnostic messaging to clearly indicate when required data elements are missing during processing. - Enhanced error detection to ensure necessary computation windows are present before proceeding. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Co-authored-by: david-zlai <[email protected]> Co-authored-by: ezvz <[email protected]>
… dates (#611) ## Summary Improve logging when labelDs - Window does not overlap with join data Add step day functionality to labelJoin ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced flexibility in date handling for label join operations, allowing for an optional start date override. - Introduction of new JSON configuration files for the `quickstart.purchases.v1` dataset, facilitating structured data processing and analysis. - **Bug Fixes** - Improved diagnostic messaging to clearly indicate when required data elements are missing during processing. - Enhanced error detection to ensure necessary computation windows are present before proceeding. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Co-authored-by: david-zlai <[email protected]> Co-authored-by: ezvz <[email protected]>
… dates (#611) ## Summary Improve logging when labelDs - Window does not overlap with join data Add step day functionality to labelJoin ## Cheour clientslist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced flexibility in date handling for label join operations, allowing for an optional start date override. - Introduction of new JSON configuration files for the `quiour clientsstart.purchases.v1` dataset, facilitating structured data processing and analysis. - **Bug Fixes** - Improved diagnostic messaging to clearly indicate when required data elements are missing during processing. - Enhanced error detection to ensure necessary computation windows are present before proceeding. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Co-authored-by: david-zlai <[email protected]> Co-authored-by: ezvz <[email protected]>
Summary
Improve logging when labelDs - Window does not overlap with join data
Checklist
Summary by CodeRabbit
New Features
quickstart.purchases.v1
dataset, facilitating structured data processing and analysis.Bug Fixes