chunked: allow conversion without zstd compression #2343

giuseppe · 2025-06-05T11:09:49Z

This commit introduces the capability to convert tar layers to the zstd:chunked format, without performing any zstd compression.

This allows using zstd:chunked without the cost of compression and decompression.

Closes: https://issues.redhat.com/browse/RUN-3056

openshift-ci · 2025-06-05T11:09:54Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: giuseppe

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [giuseppe]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mtrmac

Just an extremely brief skim…

This might well make sense as an intermediate step. For full performance, wouldn't we want to entirely avoid copyAllBlobToFile + convertTarToZstdChunked?

“Just” stream the incoming tar file as it comes; for every regular file, store it somewhere inside the staging destination, checksum it, and either hardlink/reflink it with a pre-existing one, or move it in place if it is entirely new. (Yes, that would be a third implementation of setting file metadata, along with pkg/archive (maybe there are even more??), and we’d really want to consolidate them.)

Alternatively, after #2325 , wouldn’t it be fairly easy to add chunked-like pre-staging of layer contents without holding locks to the regular pull path? And if we did that, could we teach pkg/archive to build the composefs structures ~directly? I think that (“just”, again) requires computing MeasureVerity, ordinary digests for RegularFilePathForValidatedDigest, and the tar header metadata.

Unlike truly-chunked pulls, on the “convert” path (and, also, for containers/image#2792 ), there’s not that much benefit in trying not to read files, so the hole / roll-sum data computation can, I think (?) be entirely avoided; we only want to match entire files, by digest, for reflinks/hardlinks.

I know, “just”…

pkg/chunked/internal/minimal/compression.go

giuseppe · 2025-06-06T07:21:39Z

This might well make sense as an intermediate step. For full performance, wouldn't we want to entirely avoid copyAllBlobToFile + convertTarToZstdChunked?

I've done some benchmarking and the cost of writing the tarball is minimal compared to computing the checksum of each file/chunk.

It is still useful to compute the chunks because we fill the cache with this information and future downloads can use these chunks.

I agree it would look cleaner to work directly on the tar stream, but the implementation cost is much higher because we will need to duplicate a lot of code paths.

pkg/chunked/internal/minimal/compression.go

pkg/chunked/compressor/compressor.go

pkg/chunked/compression_linux.go

pkg/chunked/compressor/compressor.go

pkg/chunked/compressor/compressor_test.go

pkg/chunked/internal/minimal/compression.go

mtrmac · 2025-06-06T18:16:31Z

pkg/chunked/storage_linux.go

+			if part != nil {
+				limit := mf.CompressedSize
+				// If we are reading from a source file, use the uncompressed size to limit the reader, because
+				// the compressed size refers to the original layer stream.
+				if missingPart.OriginFile != nil && partCompression == fileTypeNoCompression {
+					limit = mf.UncompressedSize
+				}
+				c.rawReader = io.LimitReader(part, limit)
+			}


I’m generally unhappy with the complexity. I feel that there must be some way to express this simpler.

I think that conflating compressedFileType to mean both “format of the file” and “origin+format of this chunk” is confusing, although it seems somewhat convenient for this PR.

We have the if c.rawReader != nil code to discard remaining data even for OriginFile chunks, where we don’t actually need to do that.

This part that duplicates the OriginFile condition could certainly be done without duplicating the condition, e.g. setting an extra readingFromALocalFile boolean in the case missingPart.OriginFile != nil section.

As a possible starting point, is there any need to split the prepareCompressedStreamToFile / appendCompressedStreamToFile code into two parts, and to interleave the openDestinationFile in the middle? I am quite possibly missing something, but I don’t see why that is necessary. And if it is not, combining the two …StreamToFile into a single function with a single switch would be simpler.

I’m afraid I can’t predict the full scope how the refactoring could be done, I can only see one or two steps ahead.

mtrmac · 2025-06-06T18:24:11Z

I've done some benchmarking and the cost of writing the tarball is minimal compared to computing the checksum of each file/chunk.

Thanks, that’s interesting.

It is still useful to compute the chunks because we fill the cache with this information and future downloads can use these chunks.

Wait … we are writing this not-really-zstd data to durable storage as bigDataKey? … Ultimately I don’t see that it breaks anything, but I didn’t realize that’s one of the consequences.

Flush() is only called before Close() so it has not any effect. Signed-off-by: Giuseppe Scrivano <[email protected]>

mtrmac

I’d really like storeMissingFiles to be simplified further, but I guess with the c.rawReader = io.LimitReader consolidation this PR, is, on net, not making things worse.

pkg/chunked/storage_linux.go

The reader for the part is now limited before calling prepareCompressedStreamToFile(). Signed-off-by: Giuseppe Scrivano <[email protected]>

a new function NoCompression() is added to provide a way to create uncompressed zstd:chunked files. Signed-off-by: Giuseppe Scrivano <[email protected]>

This commit introduces the capability to convert tar layers to the zstd:chunked format, without performing any zstd compression. This allows using zstd:chunked without the cost of compression and decompression. Closes: https://issues.redhat.com/browse/RUN-3056 Signed-off-by: Giuseppe Scrivano <[email protected]>

mtrmac

LGTM, with a bit of heavy heart about the complexity of storeMissingFiles.

rhatdan · 2025-06-09T11:35:54Z

/lgtm

openshift-ci bot added the do-not-merge/work-in-progress label Jun 5, 2025

openshift-ci bot added the approved label Jun 5, 2025

giuseppe force-pushed the convert-to-zstd-without-compression branch 4 times, most recently from f593118 to f6a2815 Compare June 5, 2025 21:43

giuseppe changed the title ~~[WIP] chunked: allow conversion without zstd compression~~ chunked: allow conversion without zstd compression Jun 5, 2025

mtrmac reviewed Jun 6, 2025

View reviewed changes

pkg/chunked/internal/minimal/compression.go Outdated Show resolved Hide resolved

pkg/chunked/internal/minimal/compression.go Outdated Show resolved Hide resolved

giuseppe force-pushed the convert-to-zstd-without-compression branch from f6a2815 to a7455c2 Compare June 6, 2025 07:42

giuseppe marked this pull request as ready for review June 6, 2025 07:45

openshift-ci bot removed the do-not-merge/work-in-progress label Jun 6, 2025

mtrmac reviewed Jun 6, 2025

View reviewed changes

pkg/chunked/internal/minimal/compression.go Outdated Show resolved Hide resolved

mtrmac reviewed Jun 6, 2025

View reviewed changes

chunked: drop calls to Flush

fe6561f

Flush() is only called before Close() so it has not any effect. Signed-off-by: Giuseppe Scrivano <[email protected]>

giuseppe force-pushed the convert-to-zstd-without-compression branch from a7455c2 to 92ce27a Compare June 9, 2025 09:42

mtrmac reviewed Jun 9, 2025

View reviewed changes

pkg/chunked/storage_linux.go Show resolved Hide resolved

giuseppe force-pushed the convert-to-zstd-without-compression branch 2 times, most recently from f5b395e to b3ada72 Compare June 9, 2025 11:05

giuseppe added 3 commits June 9, 2025 13:17

chunked: refactor limit reader creation

378f38c

The reader for the part is now limited before calling prepareCompressedStreamToFile(). Signed-off-by: Giuseppe Scrivano <[email protected]>

chunked: add no-compression option for ZstdWriter

1a704f7

a new function NoCompression() is added to provide a way to create uncompressed zstd:chunked files. Signed-off-by: Giuseppe Scrivano <[email protected]>

giuseppe force-pushed the convert-to-zstd-without-compression branch from b3ada72 to 87c6994 Compare June 9, 2025 11:17

mtrmac reviewed Jun 9, 2025

View reviewed changes

openshift-ci bot assigned rhatdan Jun 9, 2025

openshift-ci bot added the lgtm label Jun 9, 2025

openshift-merge-bot bot merged commit e1679c1 into containers:main Jun 9, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chunked: allow conversion without zstd compression #2343

chunked: allow conversion without zstd compression #2343

Uh oh!

giuseppe commented Jun 5, 2025

Uh oh!

openshift-ci bot commented Jun 5, 2025

Uh oh!

mtrmac left a comment

Uh oh!

Uh oh!

Uh oh!

giuseppe commented Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtrmac Jun 6, 2025 •

edited

Loading

Uh oh!

mtrmac commented Jun 6, 2025

Uh oh!

mtrmac left a comment

Uh oh!

Uh oh!

mtrmac left a comment

Uh oh!

rhatdan commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

chunked: allow conversion without zstd compression #2343

chunked: allow conversion without zstd compression #2343

Uh oh!

Conversation

giuseppe commented Jun 5, 2025

Uh oh!

openshift-ci bot commented Jun 5, 2025

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

giuseppe commented Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtrmac Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mtrmac commented Jun 6, 2025

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

rhatdan commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

mtrmac Jun 6, 2025 •

edited

Loading