-
Notifications
You must be signed in to change notification settings - Fork 657
feat(iceberg): introduce small file compaction #22527
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Let's merge this PR quickly, because missing register_jvm_builder
would cause iceberg compactor failure.
…nto li0k/iceberg-small-file-compact
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rest LGTM
|
||
if table.append_only { | ||
sink_with.insert("type".to_owned(), "append-only".to_owned()); | ||
} else { | ||
sink_with.insert("type".to_owned(), "upsert".to_owned()); | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For append_only iceberg table, we should remove the sink_with.insert("primary_key".to_owned(), pks.join(","));
I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.
What's changed and what's your intention?
This PR introduces Small Data File Compaction and modifies the default behavior of the append-only Iceberg sink. It switches the default from Full Compaction to Small Data File Compaction to reduce write amplification and space amplification.
Checklist
Documentation
Release note