Split up library builds into individual builder stages to preserve layer cache #343

lightswitch05 · 2025-02-24T05:07:41Z

Why

Hello! I've noticed the docker image is quite hefty, and I was curious if I could improve it a bit. After taking a look, I realized I couldn't help much with reducing the image size 😆 .... but I thought it might be possible to improve layer reuse between builds. In the end, this feature branch is only 7.98mb smaller then whats on master, but I believe layer reuse is now a possibility depending on how the builds and caching are set up.

If no one thinks this PR provides any value, that’s no problem! It does introduce a bit more complexity, so I totally understand. Anyway, on to the changes I made:

What

I've moved each major build phase into its own builder stage using multi-stage builds: ssocr, pip, libcec, PicoTTS, and Telldus. The results of those builder stages are then copied out into the 'main' stage. All temporary files were already being pruned nicely, so again, no real space savings. However, using the COPY --link command from the builder stages enables this cool docker feature:

Use --link to reuse already built layers in subsequent builds with --cache-from even if the previous layers have changed. This is especially important for multi-stage builds where a COPY --from statement would previously get invalidated if any previous commands in the same stage changed, causing the need to rebuild the intermediate stages again. With --link the layer the previous build generated is reused and merged on top of the new layers. This also means you can easily rebase your images when the base images receive updates, without having to execute the whole build again.

So, if you need to bump a version in requirements.txt, using COPY --link will allow those other layer - like ssocr - to remain unchanged. Pretty cool! If this PR works as expected, I hope that the next time I run docker compose pull, it will require fewer layers to be pulled.

Now... there is a bit of a gotcha with all this. This caching logic only works if the builds are correctly set up with caching. For example, docker-compose builds cannot create a multi-stage build cache. Looking around, I see buildx is being used over at home-assistant/builder/, but there was a lot of logic going on, and I couldn't quite follow it all.

So, there's a chance some follow-up changes might be needed before the benefits of this PR can be realized - for example, using cache-to and ensuring mode=max is set to enable the mutli-stage build cache. But one step at a time - if you all think this is an improvement worth making, we can iterate from here.

Testing

For testing, I ran the build and verified that it runs. However, that doesn’t fully confirm that the libraries I modified are still being installed correctly. Some follow-up work is definitely required to verify everything is functioning as expected.

…yer cache

home-assistant

Hi @lightswitch05

It seems you haven't yet signed a CLA. Please do so here.

Once you do that we will be able to review and accept this pull request.

Thanks!

home-assistant · 2025-02-24T05:07:44Z

Please take a look at the requested changes, and use the Ready for review button when you are done, thanks 👍

Learn more about our pull request process.

Stale

lightswitch05 · 2025-02-24T14:28:12Z

Dockerfile

+WORKDIR /tmp/
+COPY patches/libcec-fix-null-return.patch /tmp/
+COPY patches/libcec-python313.patch /tmp/
+RUN --mount=type=cache,target=/etc/apk/cache,sharing=locked,id=libcec-builder-${BUILD_FROM}-${LIBCEC_VERSION} \


Hmm... looks like I might have used to wrong cache directory... based on the docs it seems like /etc/apk/cache is what things write to, but its just a simlink to /var/cache/apk. I can just add a cache mount for each directory and call it a day.

I'm also thinking it might be fine to re-use that cache between each builder instead of having unique cache for each one.

Split up library builds into individual builder stages to preserve la…

ed4fad4

…yer cache

home-assistant bot added the cla-needed label Feb 24, 2025

home-assistant bot previously requested changes Feb 24, 2025

View reviewed changes

home-assistant bot marked this pull request as draft February 24, 2025 05:07

lightswitch05 marked this pull request as ready for review February 24, 2025 05:07

home-assistant bot added cla-recheck cla-signed and removed cla-recheck cla-needed labels Feb 24, 2025

lightswitch05 commented Feb 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split up library builds into individual builder stages to preserve layer cache #343

Split up library builds into individual builder stages to preserve layer cache #343

lightswitch05 commented Feb 24, 2025

home-assistant bot left a comment

home-assistant bot commented Feb 24, 2025

lightswitch05 Feb 24, 2025 •

edited

Loading

Split up library builds into individual builder stages to preserve layer cache #343

Are you sure you want to change the base?

Split up library builds into individual builder stages to preserve layer cache #343

Conversation

lightswitch05 commented Feb 24, 2025

Why

What

Testing

home-assistant bot left a comment

Choose a reason for hiding this comment

home-assistant bot commented Feb 24, 2025

lightswitch05 Feb 24, 2025 • edited Loading

Choose a reason for hiding this comment

lightswitch05 Feb 24, 2025 •

edited

Loading