High CPU usage in scheduler #2033

tanjianfeng · 2020-03-01T15:26:33Z

With below program, we find that the scheduler shows ~380% CPU usage vs ~200% in runc.

How to test:
$ docker run -it --cpu-period=1000 --cpu-quota=8000 ...
$ gcc -o threads thread.c -lpthread
$ ./threads 1024 100000

#include <stdio.h>
#include <pthread.h>
#include <sys/time.h>
#include <time.h>
#include <string.h>
#include <unistd.h>
#include <stdlib.h>
#include <stdint.h>

struct worker_arg {
        int seq;
        int iterations;

        pthread_mutex_t count_lock;
        pthread_cond_t count_nonzero;
        unsigned count;
};


static void
park(struct worker_arg *arg)
{
        pthread_mutex_lock(&arg->count_lock);
        while (arg->count == 0)
                pthread_cond_wait(&arg->count_nonzero, &arg->count_lock);
        arg->count--;
        pthread_mutex_unlock(&arg->count_lock);
}

static void
unpark(struct worker_arg *arg)
{
        pthread_mutex_lock(&arg->count_lock);
        if (arg->count == 0)
                pthread_cond_signal(&arg->count_nonzero);
        arg->count++;
        pthread_mutex_unlock(&arg->count_lock);
}

static void *
worker(void *p)
{
        struct worker_arg *arg = (struct worker_arg *)p;

        while (arg->iterations--) {
                /* do some work */
                //printf("%d %d\n", arg->seq, arg->iterations);
                park(arg);
        }

        return NULL;
}

int
main(int argc, char **argv)
{
        int i, j, N, M;
        void *dummy;
        struct worker_arg *args;
        pthread_t *tids;

        if (argc != 3) {
                fprintf(stderr, "Usage: %s <number of threads> <iterations>\n", argv[0]);
                exit(1);
        }

        N = atoi(argv[1]);
        M = atoi(argv[2]);

        tids = malloc(sizeof(*tids) * N);
        args = malloc(sizeof(*args) * N);

        for (i = 0; i < N; i++) {
                args[i].seq = i;
                args[i].iterations = M;

                args[i].count = 0;
                pthread_mutex_init(&args[i].count_lock, NULL);
                pthread_cond_init(&args[i].count_nonzero, NULL);

                pthread_create(&tids[i], NULL, worker, &args[i]);
        }

        for (i = 0; i < M; i++) {
                for (j = 0; j < N; j++)
                        unpark(&args[j]);
        }

        for (i = 0; i < N; i++)
                pthread_join(tids[i], &dummy);
}

The text was updated successfully, but these errors were encountered:

prattmic · 2020-03-02T17:40:44Z

When you say 'CPU in the scheduler', I assume you mean the host Linux kernel scheduler?

Which platform (or platforms) do you see this behavior on?

tanjianfeng · 2020-03-03T09:52:01Z

When you say 'CPU in the scheduler', I assume you mean the host Linux kernel scheduler?

I mean scheduler in broader meaning. The example above uses futex syscall to wait/wakeup threads. So it involves (1) futex implementation in sentry; (2) golang sheduler; (3) maybe also linux host CFS.

Which platform (or platforms) do you see this behavior on?

kvm platorm.

prattmic · 2021-01-29T19:36:42Z

For reference, this workload tends to suffer from golang/go#43997.

Background: channel is designed to be a data transfer pipe between go routines. A scenario in gVisor is: block/wake a G (stands for a task) by using channel, wherein there's no need for data transfer. Channel is relatively too heavy for this case. Channel manages Gs in a list, and has capacity. When the channel is full, the sender will bock, and the channel is clear, the receiver is blocked. The receiver pushed to chan list, then schedule to run next. The sender uses goready to wake Gs in the chan list. A summary on our idea: introduce a new set of APIs for goroutine wake/block. Details: We propose three new APIs: GetG() - used to get the address of goroutine. By address, find the goroutine G easier at go program. WakeG() - can be used to wake one G, which can be in running/blocked status. BlockG() - can be used to block goroutine by itself. How we use this in gVisor: In futex()/epoll_wait(), we can modify it to use the new mechanism for block and wake. Between sentry and go runtime, we maintain the status of task Gs. Let's use futex as an example, add running status at goruntime, NoWake,Waked,Blocked. At sentry, one task/G can use BlockG() to block, like <-chan. Other tasks/Gs can use WakeG() to wake the task/G which is blocked by BlockG() , like chan <-. Based on a basic prototype of Go and gVisor, we use the program in google/gvisor#2033(comment) as the test program. We can see 22% improvement by test case: google/gvisor#2033. Signed-off-by: liushi <[email protected]>

Background: channel is designed to be a data transfer pipe between go routines. A scenario in gVisor is: block/wake a G (stands for a task) by using channel, wherein there's no need for data transfer. Channel is relatively too heavy for this case. Channel manages Gs in a list, and has capacity. When the channel is full, the sender will bock, and the channel is clear, the receiver is blocked. The receiver pushed to chan list, then schedule to run next. The sender uses goready to wake Gs in the chan list. A summary on our idea: introduce a new set of APIs for goroutine wake/block. Details: We propose three new APIs: GetG() - used to get the address of goroutine. By address, find the goroutine G easier at go program. WakeG() - can be used to wake one G, which can be in running/blocked status. BlockG() - can be used to block goroutine by itself. How we use this in gVisor: GuhuangLS/gvisor@97e0e6c In futex()/epoll_wait(), we can modify it to use the new mechanism for block and wake. Between sentry and go runtime, we maintain the status of task Gs. Let's use futex as an example, add running status at goruntime, NoWake,Waked,Blocked. At sentry, one task/G can use BlockG() to block, like <-chan. Other tasks/Gs can use WakeG() to wake the task/G which is blocked by BlockG() , like chan <-. Based on a basic prototype of Go and gVisor, we use the program in google/gvisor#2033(comment) as the test program. We can see 22% improvement by test case: google/gvisor#2033. Signed-off-by: liushi <[email protected]>

github-actions · 2023-09-15T00:19:03Z

A friendly reminder that this issue had no activity for 120 days.

ayushr2 · 2023-09-15T17:58:05Z

golang/go@ecfce58 must have helped with this. Is this still an issue?

github-actions · 2024-01-14T00:19:57Z

A friendly reminder that this issue had no activity for 120 days.

github-actions · 2024-04-14T00:21:22Z

This issue has been closed due to lack of activity.

prattmic added area: performance Issue related to performance & benchmarks type: bug Something isn't working labels Mar 2, 2020

tanjianfeng mentioned this issue Apr 13, 2020

Reduce the cpu cycles in runtime.findrunnable #1942

Closed

prattmic self-assigned this Apr 13, 2020

amscanne mentioned this issue Jul 6, 2020

Implement more sophisticated scheduler #3147

Open

prattmic mentioned this issue Jan 29, 2021

runtime: non-spinning Ms spin uselessly when work exists golang/go#43997

Closed

GuhuangLS mentioned this issue May 28, 2021

proposal: runtime: add block/wake function golang/go#46431

Closed

github-actions bot added the stale-issue This issue has not been updated in 120 days. label Sep 15, 2023

github-actions bot removed the stale-issue This issue has not been updated in 120 days. label Sep 16, 2023

github-actions bot added the stale-issue This issue has not been updated in 120 days. label Jan 14, 2024

github-actions bot added the auto-closed label Apr 14, 2024

github-actions bot closed this as completed Apr 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High CPU usage in scheduler #2033

High CPU usage in scheduler #2033

tanjianfeng commented Mar 1, 2020

prattmic commented Mar 2, 2020

tanjianfeng commented Mar 3, 2020

prattmic commented Jan 29, 2021

github-actions bot commented Sep 15, 2023

ayushr2 commented Sep 15, 2023

github-actions bot commented Jan 14, 2024

github-actions bot commented Apr 14, 2024

High CPU usage in scheduler #2033

High CPU usage in scheduler #2033

Comments

tanjianfeng commented Mar 1, 2020

prattmic commented Mar 2, 2020

tanjianfeng commented Mar 3, 2020

prattmic commented Jan 29, 2021

github-actions bot commented Sep 15, 2023

ayushr2 commented Sep 15, 2023

github-actions bot commented Jan 14, 2024

github-actions bot commented Apr 14, 2024