-
Notifications
You must be signed in to change notification settings - Fork 0
Update JobSubmitter interface to take in clusterName and return submission status #697
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
WalkthroughThe changes update several components to improve the precision and flexibility of scaling parameters and job submission handling. The Changes
Sequence Diagram(s)sequenceDiagram
participant User
participant JobSubmitter
participant EmrSubmitter
participant DataprocSubmitter
User->>JobSubmitter: status(jobId)
JobSubmitter->>EmrSubmitter: status(jobId) (if EMR)
EmrSubmitter-->>JobSubmitter: String (job status)
JobSubmitter->>DataprocSubmitter: status(jobId) (if Dataproc)
DataprocSubmitter-->>JobSubmitter: String (job status)
JobSubmitter-->>User: String (job status)
Suggested reviewers
Poem
Warning Review ran into problems🔥 ProblemsGitHub Actions and Pipeline Checks: Resource not accessible by integration - https://docs.github.com/rest/actions/workflow-runs#list-workflow-runs-for-a-repository. Please grant the required permissions to the CodeRabbit GitHub App under the organization or repository settings. 📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (4)
🧰 Additional context used🧬 Code Graph Analysis (1)spark/src/main/scala/ai/chronon/spark/submission/JobSubmitter.scala (2)
⏰ Context from checks skipped due to timeout of 90000ms (34)
🔇 Additional comments (8)
🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
@@ -30,17 +30,17 @@ case class GeneralJob( | |||
|
|||
class DataprocSubmitter(jobControllerClient: JobControllerClient, conf: SubmitterConf) extends JobSubmitter { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should there be an equivalent change for Dataproc to accept clusterName?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It gets set in conf, which should allow us to control it similarly
https://github.com/zipline-ai/chronon/pull/697/files#diff-b4725870e2c1d47332c88a029f3d55b2a0badcf2badc4dd1c8bc7e4b04c605bcR74
…ssion status (#697) ## Summary This is needed for agent to be able to track status of submitted jobs and report them back to the orchestration service ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Added support for specifying a custom cluster name when submitting EMR jobs. - **Improvements** - Scaling factors for auto-scaling now support decimal values, allowing more precise scaling adjustments. - Job status methods now return status as a string, making it easier to programmatically track job progress and errors. <!-- end of auto-generated comment: release notes by coderabbit.ai -->
…ssion status (#697) ## Summary This is needed for agent to be able to track status of submitted jobs and report them back to the orchestration service ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Added support for specifying a custom cluster name when submitting EMR jobs. - **Improvements** - Scaling factors for auto-scaling now support decimal values, allowing more precise scaling adjustments. - Job status methods now return status as a string, making it easier to programmatically track job progress and errors. <!-- end of auto-generated comment: release notes by coderabbit.ai -->
…ssion status (#697) ## Summary This is needed for agent to be able to track status of submitted jobs and report them back to the orchestration service ## Checklist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Added support for specifying a custom cluster name when submitting EMR jobs. - **Improvements** - Scaling factors for auto-scaling now support decimal values, allowing more precise scaling adjustments. - Job status methods now return status as a string, making it easier to programmatically track job progress and errors. <!-- end of auto-generated comment: release notes by coderabbit.ai -->
…ssion status (#697) ## Summary This is needed for agent to be able to traour clients status of submitted jobs and report them baour clients to the orchestration service ## Cheour clientslist - [ ] Added Unit Tests - [x] Covered by existing CI - [ ] Integration tested - [ ] Documentation update <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - **New Features** - Added support for specifying a custom cluster name when submitting EMR jobs. - **Improvements** - Scaling factors for auto-scaling now support decimal values, allowing more precise scaling adjustments. - Job status methods now return status as a string, making it easier to programmatically traour clients job progress and errors. <!-- end of auto-generated comment: release notes by coderabbit.ai -->
Summary
This is needed for agent to be able to track status of submitted jobs and report them back to the orchestration service
Checklist
Summary by CodeRabbit
New Features
Improvements