Listing and extracting entries of a very large archive #564

struffel · 2025-04-16T12:46:47Z

I have the following code for listing the contents of an archive given its url:

const zipReader = new zip.ZipReader(new zip.HttpReader(fileUrl));
const entries = await zipReader.getEntries();
console.log(entries);

This works well for small and medium-sized files, but the getEntries() it fails and creates a very ambiguous failed to fetch error for large archives.
The threshold seems to be at about 2GiB. This seems to match the threshold for blob sizes in Google Chrome.

Could this be the reason for the failure and is what I have even the right approach when trying to read large files on the client-side?
I have tried a few of the HttpOptions but enabling chunking had the side-effect of creating a ridiculous number of requests with 500kb each which is way to small and has a severe performance penalty when trying to actually extract the files (which is the intended next step).

Sample link for testing:
https://f003.backblazeb2.com/file/ambientCG-Web/download/Ground054_5jRbQ1OF/Ground054_12K-PNG.zip

The text was updated successfully, but these errors were encountered:

gildas-lormeau · 2025-04-17T14:47:56Z

This is due to the fact that the entries are located at the end of the file. If the server supports range requests (see curl command below) then you just need to replace HttpReader with HttpRangeReader in your code to fix the issue. You can increase the chunk size (i.e. "500kb") by calling zip.configure({ chunkSize: <desired size in bytes> }).

curl -I https://f003.backblazeb2.com/file/ambientCG-Web/download/Ground054_5jRbQ1OF/Ground054_12K-PNG.zip
HTTP/1.1 200 
[...]
Accept-Ranges: bytes
[...]

struffel · 2025-04-18T20:31:53Z

Thanks for this tip about configuring chunk size, I will try that out.

Repository owner locked and limited conversation to collaborators Apr 22, 2025

gildas-lormeau converted this issue into discussion #565 Apr 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Listing and extracting entries of a very large archive #564

Listing and extracting entries of a very large archive #564

struffel commented Apr 16, 2025

gildas-lormeau commented Apr 17, 2025 •

edited

Loading

struffel commented Apr 18, 2025

This issue was moved to a discussion.

This issue was moved to a discussion.

Listing and extracting entries of a very large archive #564

Listing and extracting entries of a very large archive #564

Comments

struffel commented Apr 16, 2025

gildas-lormeau commented Apr 17, 2025 • edited Loading

struffel commented Apr 18, 2025

This issue was moved to a discussion.

gildas-lormeau commented Apr 17, 2025 •

edited

Loading