-
Notifications
You must be signed in to change notification settings - Fork 0
adding again: Retrieve source uri prefix from hive partitioning options when building the GCS format #230
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
WalkthroughThe pull request modifies the Changes
Possibly related PRs
Suggested Reviewers
Poem
Warning Review ran into problems🔥 ProblemsGitHub Actions: Resource not accessible by integration - https://docs.github.com/rest/actions/workflow-runs#list-workflow-runs-for-a-repository. Please grant the required permissions to the CodeRabbit GitHub App under the organization or repository settings. 📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (1)
🚧 Files skipped from review as they are similar to previous changes (1)
⏰ Context from checks skipped due to timeout of 90000ms (1)
🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 0
🧹 Nitpick comments (2)
cloud_gcp/src/test/scala/ai/chronon/integrations/cloud_gcp/GcpFormatProviderTest.scala (1)
19-33
: Consider extracting test constants.+ private val TestSourceUri = "gs://bucket-name/path/to/data/*.parquet" + private val TestTableName = "gs://bucket-name/path/to/data"cloud_gcp/src/main/scala/ai/chronon/integrations/cloud_gcp/GcpFormatProvider.scala (1)
63-71
: Enhance error message for URI validation.- require(uris.size == 1, s"External table ${table} can be backed by only one URI.") + require(uris.size == 1, s"External table ${table.getTableId} must have exactly one source URI, found ${uris.size}.")
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro (Legacy)
📒 Files selected for processing (2)
cloud_gcp/src/main/scala/ai/chronon/integrations/cloud_gcp/GcpFormatProvider.scala
(2 hunks)cloud_gcp/src/test/scala/ai/chronon/integrations/cloud_gcp/GcpFormatProviderTest.scala
(1 hunks)
⏰ Context from checks skipped due to timeout of 90000ms (1)
- GitHub Check: scala_compile_fmt_fix
🔇 Additional comments (3)
cloud_gcp/src/test/scala/ai/chronon/integrations/cloud_gcp/GcpFormatProviderTest.scala (2)
1-18
: LGTM! Clean test setup with proper imports and local SparkSession.
35-39
: LGTM! Comprehensive assertions covering all format properties.cloud_gcp/src/main/scala/ai/chronon/integrations/cloud_gcp/GcpFormatProvider.scala (1)
7-13
: LGTM! Proper imports and appropriate visibility for testing.Also applies to: 57-57
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM thank you for adding UT's!!!!
…ns when building the GCS format (#230) ## Summary basically this PR: https://github.com/zipline-ai/chronon/pull/204/files ## Checklist - [x] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced BigQuery API integration with more granular import statements - Added comprehensive test coverage for external table format handling - **Refactor** - Updated method visibility to improve package-level access - Refined external table processing logic - **Tests** - Introduced new test class for `GcpFormatProvider` - Added test case for URI handling with wildcard scenarios <!-- end of auto-generated comment: release notes by coderabbit.ai -->
…ns when building the GCS format (#230) ## Summary basically this PR: https://github.com/zipline-ai/chronon/pull/204/files ## Checklist - [x] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced BigQuery API integration with more granular import statements - Added comprehensive test coverage for external table format handling - **Refactor** - Updated method visibility to improve package-level access - Refined external table processing logic - **Tests** - Introduced new test class for `GcpFormatProvider` - Added test case for URI handling with wildcard scenarios <!-- end of auto-generated comment: release notes by coderabbit.ai -->
…ns when building the GCS format (#230) ## Summary basically this PR: https://github.com/zipline-ai/chronon/pull/204/files ## Checklist - [x] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced BigQuery API integration with more granular import statements - Added comprehensive test coverage for external table format handling - **Refactor** - Updated method visibility to improve package-level access - Refined external table processing logic - **Tests** - Introduced new test class for `GcpFormatProvider` - Added test case for URI handling with wildcard scenarios <!-- end of auto-generated comment: release notes by coderabbit.ai -->
…ns when building the GCS format (#230) ## Summary basically this PR: https://github.com/zipline-ai/chronon/pull/204/files ## Checklist - [x] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced BigQuery API integration with more granular import statements - Added comprehensive test coverage for external table format handling - **Refactor** - Updated method visibility to improve package-level access - Refined external table processing logic - **Tests** - Introduced new test class for `GcpFormatProvider` - Added test case for URI handling with wildcard scenarios <!-- end of auto-generated comment: release notes by coderabbit.ai -->
…ns when building the GCS format (#230) ## Summary basically this PR: https://github.com/zipline-ai/chronon/pull/204/files ## Cheour clientslist - [x] Added Unit Tests - [ ] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Enhanced BigQuery API integration with more granular import statements - Added comprehensive test coverage for external table format handling - **Refactor** - Updated method visibility to improve paour clientsage-level access - Refined external table processing logic - **Tests** - Introduced new test class for `GcpFormatProvider` - Added test case for URI handling with wildcard scenarios <!-- end of auto-generated comment: release notes by coderabbit.ai -->
Summary
basically this PR: https://github.com/zipline-ai/chronon/pull/204/files
Checklist
Summary by CodeRabbit
New Features
Refactor
Tests
GcpFormatProvider