Add a lock around cgroupd communication. #2620

hanwen-flow · 2025-03-13T07:52:04Z

Threads are put into cgroups through the cgroupd thread, which communicates with other threads using a socketpair. Each thread received a dup'd copy of the socket, and did the following

sendmsg(socket_dup_fd, my_cgroup_set);

// wait for ack.
while (1) {
    recvmsg(socket_dup_fd, &h, MSG_PEEK);
    if (h.pid != my_pid) continue;
    recvmsg(socket_dup_fd, &h, 0);
}
close(socket_dup_fd);

When restoring many threads, most threads would be spinning in the above loop waiting for their PID to appear.

In my test-case, restoring a process with a 11.5G heap and 491 threads could take anywhere between 10 seconds and 60 seconds to complete.

To avoid the spinning, we drop the loop and MSG_PEEK, and add a lock around the above code. This does not decrease parallelism, as the cgroupd daemon uses a single thread anyway.

With the lock in place, the same restore takes a consistent 9.2 seconds on my machine (Thinkpad P14s, AMD Ryzen 8840HS).

There is a similar "daemon" thread for user namespaces. That already is protected with a similar userns_sync_lock in __userns_call().

Fixes #2614

criu/include/restorer.h

hanwen-flow · 2025-03-13T09:05:54Z

PTAL

minhbq-99

There are 3 empty lines which have the tabs at the beginning so the linter is complaining.

I think the commit title should be "restorer: Add a lock around cgroupd communication" to follow the project's convention.

These are nits. Overall, the approach LGTM.

criu/pie/restorer.c

hanwen-flow · 2025-03-13T09:59:45Z

thanks, PTAL.

I tried running 'make lint', but it wants to run 'ruff' which isn't available on Ubuntu 24.

Threads are put into cgroups through the cgroupd thread, which communicates with other threads using a socketpair. Previously, each thread received a dup'd copy of the socket, and did the following sendmsg(socket_dup_fd, my_cgroup_set); // wait for ack. while (1) { recvmsg(socket_dup_fd, &h, MSG_PEEK); if (h.pid != my_pid) continue; recvmsg(socket_dup_fd, &h, 0); } close(socket_dup_fd); When restoring many threads, many threads would be spinning in the above loop waiting for their PID to appear. In my test-case, restoring a process with a 11.5G heap and 491 threads could take anywhere between 10 seconds and 60 seconds to complete. To avoid the spinning, we drop the loop and MSG_PEEK, and add a lock around the above code. This does not decrease parallelism, as the cgroupd daemon uses a single thread anyway. With the lock in place, the same restore consistently takes around 10 seconds on my machine (Thinkpad P14s, AMD Ryzen 8840HS). There is a similar "daemon" thread for user namespaces. That already is protected with a similar userns_sync_lock in __userns_call(). Fixes checkpoint-restore#2614 Signed-off-by: Han-Wen Nienhuys <[email protected]>

rst0git · 2025-03-13T12:22:11Z

it wants to run 'ruff' which isn't available on Ubuntu 24.

Have you tried pip3 install ruff?

avagin · 2025-03-13T15:42:14Z

Merged. Thanks a lot.

minhbq-99 reviewed Mar 13, 2025

View reviewed changes

criu/include/restorer.h Outdated Show resolved Hide resolved

hanwen-flow force-pushed the cgroup-lock branch from 5fe25b7 to ae55633 Compare March 13, 2025 09:05

minhbq-99 reviewed Mar 13, 2025

View reviewed changes

criu/pie/restorer.c Show resolved Hide resolved

criu/pie/restorer.c Outdated Show resolved Hide resolved

criu/pie/restorer.c Outdated Show resolved Hide resolved

hanwen-flow force-pushed the cgroup-lock branch from ae55633 to a361b71 Compare March 13, 2025 09:57

hanwen-flow force-pushed the cgroup-lock branch from a361b71 to 3240a17 Compare March 13, 2025 10:01

minhbq-99 approved these changes Mar 13, 2025

View reviewed changes

avagin merged commit 8d5cef5 into checkpoint-restore:criu-dev Mar 13, 2025
35 of 41 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a lock around cgroupd communication. #2620

Add a lock around cgroupd communication. #2620

hanwen-flow commented Mar 13, 2025

hanwen-flow commented Mar 13, 2025

minhbq-99 left a comment •

edited

Loading

hanwen-flow commented Mar 13, 2025

rst0git commented Mar 13, 2025

avagin commented Mar 13, 2025

Add a lock around cgroupd communication. #2620

Add a lock around cgroupd communication. #2620

Conversation

hanwen-flow commented Mar 13, 2025

hanwen-flow commented Mar 13, 2025

minhbq-99 left a comment • edited Loading

Choose a reason for hiding this comment

hanwen-flow commented Mar 13, 2025

rst0git commented Mar 13, 2025

avagin commented Mar 13, 2025

minhbq-99 left a comment •

edited

Loading