Add thread yield function for spin-loops. #82

devinamatthews · 2016-05-30T15:54:20Z

I noticed that TBLIS outperforms BLIS by about 30 GFLOP/s (400 vs. 370) on 12 cores of Lonestar 5 (both using OMP). To try and fix this I added a pause instruction in the spin-loop for Intel architectures, but alas there is no improvement. In any case, the optional macro BLIS_YIELD that I added in bli_kernel.h might be useful.

@fgvanzee, @tlrmchlsmth we should go over the respective codes at some point and figure out why TBLIS is faster.

jeffhammond · 2016-10-31T20:18:41Z

Is the 8% deviation reproducible within a single job, i.e. the same node over a short time interval?

I'll see if I can reproduce locally.

gaming-hacker · 2017-02-20T05:53:41Z

what happened to this? why wasn't merged?

devinamatthews · 2017-02-20T12:41:29Z

@gaming-hacker I haven't measured any performance improvement from pause (which isn't too surprising since it is best not to use HyperThreads with BLIS), so there isn't a real rush to merge. Also, it would probably behoove me to add a similar feature for ARM at the same time.

jeffhammond · 2017-02-20T18:56:38Z

FYI the OpenBLAS have been looking at the same issue, but don't have a clear conclusion: OpenMathLib/OpenBLAS#1051

Add thread yield function for spin-loops.

2793f70

devinamatthews closed this Aug 6, 2020

devinamatthews mentioned this pull request Jan 17, 2022

[aarch64] possible issue with atomic barrier and generic implementation (lack of good atomic support on generic kernel ?) #588

Closed

jeffhammond mentioned this pull request Jan 28, 2022

thread barriers need backoff #604

Open

devinamatthews mentioned this pull request Jan 31, 2022

Add a progressive backoff mechanism for barriers. #607

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add thread yield function for spin-loops. #82

Add thread yield function for spin-loops. #82

devinamatthews commented May 30, 2016

jeffhammond commented Oct 31, 2016

gaming-hacker commented Feb 20, 2017

devinamatthews commented Feb 20, 2017

jeffhammond commented Feb 20, 2017

Add thread yield function for spin-loops. #82

Add thread yield function for spin-loops. #82

Conversation

devinamatthews commented May 30, 2016

jeffhammond commented Oct 31, 2016

gaming-hacker commented Feb 20, 2017

devinamatthews commented Feb 20, 2017

jeffhammond commented Feb 20, 2017