Skip to content

add building compute-runtime UMD in benchmarks jobs #2577

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jan 17, 2025
Merged

Conversation

pbalcer
Copy link
Contributor

@pbalcer pbalcer commented Jan 16, 2025

No description provided.

@pbalcer pbalcer requested a review from a team as a code owner January 16, 2025 17:38
Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12814518956

@github-actions github-actions bot added the ci/cd Continuous integration/devliery label Jan 16, 2025
Copy link
Contributor

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/12814518956
Job status: success. Test status: success.

Summary

Total 83 benchmarks in mean.
Geomean 98.284%.
Improved 4 Regressed 24 (threshold 2.00%)

(result is better)

Performance change in benchmark groups

Relative perf in group api (11): 98.747%
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_ur SubmitKernel out of order 15.705000 μs 15.896 μs 101.22% 1.22% .
api_overhead_benchmark_ur SubmitKernel in order 16.558000 μs 16.663 μs 100.63% 0.63% .
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 1.674000 μs 1.675 μs 100.06% 0.06% .
api_overhead_benchmark_l0 SubmitKernel out of order 11.582 μs 11.528000 μs 99.53% -0.47% .
api_overhead_benchmark_sycl SubmitKernel in order 25.082 μs 24.844000 μs 99.05% -0.95% .
api_overhead_benchmark_sycl SubmitKernel out of order 23.944 μs 23.678000 μs 98.89% -1.11% .
api_overhead_benchmark_ur SubmitKernel out of order CPU count 105483.000 instr 101923.000000 instr 96.63% -3.37% -
api_overhead_benchmark_ur SubmitKernel in order CPU count 110835.000 instr 107041.000000 instr 96.58% -3.42% -
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 2.200 μs 2.118000 μs 96.27% -3.73% -
api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count 124015.000000 instr -
api_overhead_benchmark_ur SubmitKernel in order with measure completion 21.533000 μs -
Relative perf in group memory (4): 85.407%
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 257.869 μs 253.805000 μs 98.42% -1.58% .
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 3.056 GB/s 3.151000 GB/s 96.99% -3.01% -
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 5.962 μs 5.638000 μs 94.57% -5.43% -
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 225.520 μs 132.929000 μs 58.94% -41.06% ----------
Relative perf in group miscellaneous (1): 100.000%
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum 858.609000 bw GB/s 858.609 bw GB/s 100.00% 0.00% .
Relative perf in group multithread (10): 98.165%
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 16958.141000 μs 17316.620 μs 102.11% 2.11% +
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 47437.164000 μs 47907.007 μs 100.99% 0.99% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 6891.437000 μs 6935.535 μs 100.64% 0.64% .
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 2043.209 μs 2022.915000 μs 99.01% -0.99% .
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 8708.383 μs 8555.721000 μs 98.25% -1.75% .
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events 111169.587 μs 108338.415000 μs 97.45% -2.55% -
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 1191.024 μs 1157.521000 μs 97.19% -2.81% -
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events 42636.515 μs 40973.625000 μs 96.10% -3.90% -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 26765.936 μs 25543.132000 μs 95.43% -4.57% -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 7865.020 μs 7452.758000 μs 94.76% -5.24% -
Relative perf in group Velocity-Bench (9): 99.521%
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Easywave 228.000000 ms 229.000 ms 100.44% 0.44% .
Velocity-Bench QuickSilver 117.640000 MMS/CTT 117.490 MMS/CTT 100.13% 0.13% .
Velocity-Bench Sobel Filter 603.242 ms 602.045000 ms 99.80% -0.20% .
Velocity-Bench Bitcracker 35.254 s 35.129800 s 99.65% -0.35% .
Velocity-Bench CudaSift 203.197 ms 201.142000 ms 98.99% -1.01% .
Velocity-Bench Hashtable 355.764 M keys/sec 362.504819 M keys/sec 98.14% -1.86% .
Velocity-Bench dl-cifar - 23.743900 s
Velocity-Bench dl-mnist - 2.720000 s
Velocity-Bench svm - 0.139900 s
Relative perf in group Runtime (8): 100.459%
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 271.946000 ms 278.916 ms 102.56% 2.56% +
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 272.793000 ms 278.736 ms 102.18% 2.18% +
Runtime_DAGTaskThroughput_BasicParallelFor 1724.390000 ms 1746.233 ms 101.27% 1.27% .
Runtime_DAGTaskThroughput_HierarchicalParallelFor 1707.455000 ms 1725.256 ms 101.04% 1.04% .
Runtime_DAGTaskThroughput_SingleTask 1667.011000 ms 1678.732 ms 100.70% 0.70% .
Runtime_DAGTaskThroughput_NDRangeParallelFor 1689.333000 ms 1695.816 ms 100.38% 0.38% .
Runtime_IndependentDAGTaskThroughput_SingleTask 264.721 ms 259.395000 ms 97.99% -2.01% .
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 281.980 ms 275.382000 ms 97.66% -2.34% -
Relative perf in group MicroBench (14): 97.193%
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Strided 4.518000 ms 4.547 ms 100.64% 0.64% .
MicroBench_HostDeviceBandwidth_2D_D2H_Strided 617.206000 ms 617.523 ms 100.05% 0.05% .
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous 617.773000 ms 617.994 ms 100.04% 0.04% .
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous 617.770000 ms 617.954 ms 100.03% 0.03% .
MicroBench_HostDeviceBandwidth_3D_D2H_Strided 617.095000 ms 617.254 ms 100.03% 0.03% .
MicroBench_LocalMem_int32_4096 29.902 ms 29.866000 ms 99.88% -0.12% .
MicroBench_LocalMem_fp32_4096 29.877 ms 29.833000 ms 99.85% -0.15% .
MicroBench_HostDeviceBandwidth_1D_D2H_Strided 4.759 ms 4.702000 ms 98.80% -1.20% .
MicroBench_HostDeviceBandwidth_3D_H2D_Strided 4.707 ms 4.574000 ms 97.17% -2.83% -
MicroBench_HostDeviceBandwidth_2D_H2D_Strided 4.951 ms 4.781000 ms 96.57% -3.43% -
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous 4.733 ms 4.414000 ms 93.26% -6.74% --
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous 4.703 ms 4.322000 ms 91.90% -8.10% --
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous 4.612 ms 4.238000 ms 91.89% -8.11% --
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous 4.720 ms 4.317000 ms 91.46% -8.54% --
Relative perf in group Pattern (10): 99.958%
Benchmark This PR baseline Relative perf Change -
Pattern_SegmentedReduction_Hierarchical_int32 11.584000 ms 11.599 ms 100.13% 0.13% .
Pattern_SegmentedReduction_Hierarchical_int64 11.764000 ms 11.779 ms 100.13% 0.13% .
Pattern_SegmentedReduction_NDRange_int16 2.263000 ms 2.264 ms 100.04% 0.04% .
Pattern_SegmentedReduction_Hierarchical_int16 11.798000 ms 11.801 ms 100.03% 0.03% .
Pattern_SegmentedReduction_Hierarchical_fp32 11.587000 ms 11.589 ms 100.02% 0.02% .
Pattern_SegmentedReduction_NDRange_int32 2.164000 ms 2.164 ms 100.00% 0.00% .
Pattern_SegmentedReduction_NDRange_int64 2.337 ms 2.336000 ms 99.96% -0.04% .
Pattern_SegmentedReduction_NDRange_fp32 2.166 ms 2.163000 ms 99.86% -0.14% .
Pattern_Reduction_Hierarchical_int32 16.440 ms 16.411000 ms 99.82% -0.18% .
Pattern_Reduction_NDRange_int32 16.228 ms 16.163000 ms 99.60% -0.40% .
Relative perf in group ScalarProduct (6): 99.867%
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_fp32 3.744000 ms 3.759 ms 100.40% 0.40% .
ScalarProduct_Hierarchical_fp32 10.142000 ms 10.170 ms 100.28% 0.28% .
ScalarProduct_Hierarchical_int64 11.487000 ms 11.490 ms 100.03% 0.03% .
ScalarProduct_Hierarchical_int32 10.522000 ms 10.523 ms 100.01% 0.01% .
ScalarProduct_NDRange_int64 5.460 ms 5.456000 ms 99.93% -0.07% .
ScalarProduct_NDRange_int32 3.787 ms 3.733000 ms 98.57% -1.43% .
Relative perf in group USM (7): 99.402%
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_shared 0.060000 ms 0.066 ms 110.00% 10.00% ++
USM_Allocation_latency_fp32_device 0.067000 ms 0.068 ms 101.49% 1.49% .
USM_Allocation_latency_fp32_host 37.754000 ms 37.899 ms 100.38% 0.38% .
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 1.853 ms 1.814000 ms 97.90% -2.10% -
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 1.720 ms 1.661000 ms 96.57% -3.43% -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.097 ms 1.046000 ms 95.35% -4.65% -
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.259 ms 1.195000 ms 94.92% -5.08% -
Relative perf in group VectorAddition (3): 99.941%
Benchmark This PR baseline Relative perf Change -
VectorAddition_int64 3.108000 ms 3.139 ms 101.00% 1.00% .
VectorAddition_fp32 1.447 ms 1.445000 ms 99.86% -0.14% .
VectorAddition_int32 1.463 ms 1.448000 ms 98.97% -1.03% .
Relative perf in group Polybench (3): 100.191%
Benchmark This PR baseline Relative perf Change -
Polybench_2mm 1.211000 ms 1.216 ms 100.41% 0.41% .
Polybench_Atax 6.869000 ms 6.880 ms 100.16% 0.16% .
Polybench_3mm 1.727000 ms 1.727 ms 100.00% 0.00% .
Relative perf in group Kmeans (1): 100.287%
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 16.037000 ms 16.083 ms 100.29% 0.29% .
Relative perf in group LinearRegressionCoeff (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
LinearRegressionCoeff_fp32 840.429000 ms -
Relative perf in group MolecularDynamics (1): 93.333%
Benchmark This PR baseline Relative perf Change -
MolecularDynamics 0.030 ms 0.028000 ms 93.33% -6.67% --
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:4 glibc 2437.540000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider 2184.030000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool<os_provider> 2936.560000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool<os_provider> 301.359000 ns -
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:1 glibc 694.985000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider 192.264000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool<os_provider> 263.332000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool<os_provider> 205.812000 ns -
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc 1206.980000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider 1838.860000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool<os_provider> 3143.580000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool<os_provider> 260.359000 ns -
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc 720.620000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider 188.160000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool<os_provider> 298.914000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool<os_provider> 200.241000 ns -
Relative perf in group alloc/min (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc 826.120000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc 177.484000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool<os_provider> 953.037000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool<os_provider> 934.560000 ns -
Relative perf in group multiple (24): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc 30422.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc 4349.980000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc 138635.000000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc 31499.300000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool<os_provider> 1196650.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool<os_provider> 157278.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider 1255370.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider 140676.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool<os_provider> 42120.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool<os_provider> 14824.500000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool<os_provider> 75328.800000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool<os_provider> 25524.100000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc - 32574.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc - 4128.530000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc - 138399.000000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc - 28197.400000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1161430.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> - 161766.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider - 1166110.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider - 141737.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42212.800000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> - 14889.200000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> - 72778.500000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> - 27538.700000 ns
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 128 - 838.869803 token/s
llama.cpp Text Generation Batched 128 - 63.338561 token/s
llama.cpp Prompt Processing Batched 256 - 872.377637 token/s
llama.cpp Text Generation Batched 256 - 63.361520 token/s
llama.cpp Prompt Processing Batched 512 - 434.541716 token/s
llama.cpp Text Generation Batched 512 - 63.295460 token/s
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc - 2589.180000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc - 710.936000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc - 1188.310000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc - 716.901000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc - 861.597000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc - 175.935000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider - 2246.790000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider - 187.819000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider - 1690.250000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider - 189.702000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 4441.700000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 256.696000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3268.220000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 306.439000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 299.852000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.534000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 263.904000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 197.833000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> - 1051.720000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> - 952.492000 ns

Details

Benchmark details - environment, command, output...
api_overhead_benchmark_l0 SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.626,11.582,2.62%,10.847,21.734,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),24.098,23.944,3.58%,23.191,246.489,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),25.231,25.082,4.54%,24.016,346.409,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Device --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Device destinationPlacement=Device size=1KB count=100),259.355,257.869,1.80%,253.994,539.452,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Host --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Host destinationPlacement=Device size=1KB count=100),239.230,225.520,20.24%,221.774,524.205,[CPU],[us]

memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueMemcpy --csv --noHeaders --iterations=10000 --sourcePlacement=Device --destinationPlacement=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueMemcpy(api=sycl sourcePlacement=Device destinationPlacement=Device size=1KB),6.417,5.962,17.04%,5.499,74.012,[CPU],[us]

memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=StreamMemory --csv --noHeaders --iterations=10000 --type=Triad --size=10240 --memoryPlacement=Device --useEvents=0 --contents=Zeros --multiplier=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
StreamMemory(api=sycl type=Triad size=10KB useEvents=0 contents=Zeros memoryPlacement=Device multiplier=1),2.998,3.056,6.24%,0.365,3.264,[CPU],[GB/s]

api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=0 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Device --dst=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Device dst=Device size=1KB ioq=0),2.207,2.200,4.45%,1.993,9.910,[CPU],[us]

api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=1 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Host --dst=Host --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Host dst=Host size=1KB ioq=1),1.685,1.674,5.13%,1.575,8.978,[CPU],[us]

miscellaneous_benchmark_sycl VectorSum

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/miscellaneous_benchmark_sycl --test=VectorSum --csv --noHeaders --iterations=1000 --numberOfElementsX=512 --numberOfElementsY=256 --numberOfElementsZ=256

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
VectorSum(api=sycl numberOfElementsX=512 numberOfElementsY=256 numberOfElementsZ=256),858.033,858.609,0.45%,820.001,866.293,[GPU],bw [GB/s]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),6922.441,6891.437,1.00%,6864.405,7080.902,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),16960.409,16958.141,3.68%,15689.935,18432.492,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),47416.129,47437.164,1.70%,44378.699,49916.452,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),2096.115,2043.209,26.35%,1545.476,17679.507,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),7879.182,7865.020,1.44%,7724.843,8095.812,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),8875.031,8708.383,5.15%,8571.501,10161.543,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),26899.390,26765.936,1.84%,25830.661,28370.590,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),1288.682,1191.024,54.28%,908.979,15263.874,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=1 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=1 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),42655.824,42636.515,0.36%,42440.092,42962.465,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=4 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=4 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),111142.326,111169.587,0.27%,110679.293,111647.184,[CPU],[us]

api_overhead_benchmark_ur SubmitKernel out of order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),105543.699,105483.000,6.02%,105348.000,2110447.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),15.929,15.667,349.74%,15.016,17631.761,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),105543.699,105483.000,6.02%,105348.000,2110447.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),15.907,15.705,357.98%,15.091,18022.856,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110902.265,110835.000,3.98%,110835.000,1500644.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.789,16.558,272.69%,15.826,14490.190,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110902.265,110835.000,3.98%,110835.000,1500644.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.789,16.558,272.69%,15.826,14490.190,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),124479.684,124015.000,3.80%,123475.000,1587502.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),21.890,21.528,200.65%,20.157,13906.972,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),124480.174,124015.000,3.81%,123700.000,1590337.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),21.896,21.533,207.46%,20.107,14382.390,[CPU],time [us]

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.377266 s
355.764252 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00381339 s
bitcracker - total time for whole calculation: 35.2537 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1255 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1264 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1089 1251 29.5683% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1261 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1271 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1111 1254 30.1656% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1270 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1108 1280 30.0842% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1260 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1121 1264 30.4371% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1258 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1186 1255 32.202% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1068 1270 28.9981% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1253 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1268 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1206 1258 32.745% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1258 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1083 1266 29.4054% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1090 1262 29.5954% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1266 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1120 1259 30.41% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1259 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1204 1257 32.6907% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1265 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1277 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1242 1276 33.7225% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1266 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1053 1267 28.5908% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1062 1259 28.8352% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1265 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1127 1263 30.6001% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1265 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1045 1269 28.3736% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1268 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1215 1254 32.9894% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1101 1274 29.8941% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1102 1271 29.9213% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1082 1258 29.3782% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1126 1266 30.5729% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1097 1255 29.7855% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1132 1267 30.7358% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1266 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1264 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1258 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1257 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1212 1248 32.908% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1118 1266 30.3557% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1080 1265 29.3239% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1266 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1263 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 203.197 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 4.334930e-01 6.279490e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.654610e-01 7.708770e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.621910e-01 7.803650e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.612940e-01 8.338790e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.323040e-01 7.907160e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.320770e-01 7.641590e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.314860e-01 7.637800e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.313930e-01 7.842600e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.331700e-01 7.826970e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.316040e-01 7.594260e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.117e+07 1.117e+07 1.117e+07 0.000e+00 100.00
cycleInit 10 3.514e+06 3.514e+06 3.514e+06 0.000e+00 100.00
cycleTracking 10 7.658e+06 7.658e+06 7.658e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.927e+06 4.927e+06 4.927e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.074e+05 2.074e+05 2.074e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.100e+02 4.100e+02 4.100e+02 0.000e+00 100.00
Figure Of Merit 117.64 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.48217 s
sobelfilter - total time for whole calculation: 0.603242 s

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.264505', '0.264721', '0.253657', '0.253657 0.264721 0.275136', '0.010741', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.286228', '0.281980', '0.269502', '0.269502 0.281980 0.307203', '0.019206', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.271548', '0.271946', '0.268885', '0.268885 0.271946 0.273814', '0.002488', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.282388', '0.272793', '0.270145', '0.270145 0.272793 0.304225', '0.018958', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.667697', '1.667011', '1.661389', '1.661389 1.667011 1.674692', '0.006678', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.724477', '1.724390', '1.723009', '1.723009 1.724390 1.726033', '0.001514', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.707265', '1.707455', '1.705249', '1.705249 1.707455 1.709092', '0.001929', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.690128', '1.689333', '1.688722', '1.688722 1.689333 1.692329', '0.001931', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.031230', '0.004612', '0.004605', '0.004605 0.004612 0.084471', '0.046109', '27.142027', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004775', '0.004720', '0.004666', '0.004666 0.004720 0.004938', '0.000144', '26.787469', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004714', '0.004703', '0.004615', '0.004615 0.004703 0.004826', '0.000106', '27.087117', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004726', '0.004733', '0.004697', '0.004697 0.004733 0.004748', '0.000027', '26.615565', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617776', '0.617773', '0.617742', '0.617742 0.617773 0.617813', '0.000036', '0.202350', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617777', '0.617770', '0.617747', '0.617747 0.617770 0.617813', '0.000034', '0.202348', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004489', '0.004518', '0.004392', '0.004392 0.004518 0.004557', '0.000086', '28.463599', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004950', '0.004951', '0.004939', '0.004939 0.004951 0.004959', '0.000010', '25.308931', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004693', '0.004707', '0.004608', '0.004608 0.004707 0.004763', '0.000078', '27.125259', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004832', '0.004759', '0.004751', '0.004751 0.004759 0.004987', '0.000134', '26.312892', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617225', '0.617206', '0.617205', '0.617205 0.617206 0.617265', '0.000034', '0.202526', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617092', '0.617095', '0.617080', '0.617080 0.617095 0.617102', '0.000011', '0.202567', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_int32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029901', '0.029902', '0.029846', '0.029846 0.029902 0.029954', '0.000054', '10453.691554', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_fp32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029877', '0.029877', '0.029862', '0.029862 0.029877 0.029891', '0.000014', '10447.901889', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016292', '0.016228', '0.016140', '0.016140 0.016228 0.016510', '0.000193', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016453', '0.016440', '0.016352', '0.016352 0.016440 0.016566', '0.000108', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003778', '0.003787', '0.003729', '0.003729 0.003787 0.003819', '0.000046', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.005458', '0.005460', '0.005432', '0.005432 0.005460 0.005483', '0.000026', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003748', '0.003744', '0.003735', '0.003735 0.003744 0.003764', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010527', '0.010522', '0.010521', '0.010521 0.010522 0.010539', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011485', '0.011487', '0.011442', '0.011442 0.011487 0.011526', '0.000042', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010146', '0.010142', '0.010137', '0.010137 0.010142 0.010157', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002271', '0.002263', '0.002260', '0.002260 0.002263 0.002288', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002165', '0.002164', '0.002161', '0.002161 0.002164 0.002169', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002340', '0.002337', '0.002333', '0.002333 0.002337 0.002351', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002166', '0.002166', '0.002157', '0.002157 0.002166 0.002174', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011798', '0.011798', '0.011790', '0.011790 0.011798 0.011806', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011584', '0.011584', '0.011571', '0.011571 0.011584 0.011596', '0.000012', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011775', '0.011764', '0.011760', '0.011760 0.011764 0.011800', '0.000022', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011585', '0.011587', '0.011566', '0.011566 0.011587 0.011601', '0.000018', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_device', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000063', '0.000067', '0.000048', '0.000048 0.000067 0.000074', '0.000013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_host', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.037806', '0.037754', '0.037652', '0.037652 0.037754 0.038014', '0.000187', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_shared', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000060', '0.000060', '0.000057', '0.000057 0.000060 0.000062', '0.000003', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.021709', '0.001720', '0.001682', '0.001682 0.001720 0.061725', '0.034655', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001095', '0.001097', '0.001076', '0.001076 0.001097 0.001113', '0.000019', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001886', '0.001853', '0.001844', '0.001844 0.001853 0.001961', '0.000065', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001251', '0.001259', '0.001235', '0.001235 0.001259 0.001259', '0.000014', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001472', '0.001463', '0.001452', '0.001452 0.001463 0.001501', '0.000026', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003101', '0.003108', '0.003041', '0.003041 0.003108 0.003154', '0.000057', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001465', '0.001447', '0.001444', '0.001444 0.001447 0.001504', '0.000034', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_2mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/2mm.csv --size=512

Output:

['Polybench_2mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001211', '0.001211', '0.001204', '0.001204 0.001211 0.001218', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_3mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/3mm.csv --size=512

Output:

['Polybench_3mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001728', '0.001727', '0.001720', '0.001720 0.001727 0.001738', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_Atax

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Atax.csv --size=8192

Output:

['Polybench_Atax', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.006862', '0.006869', '0.006845', '0.006845 0.006869 0.006873', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Kmeans_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Kmeans.csv --size=700000000

Output:

['Kmeans_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '700000000', '0.016047', '0.016037', '0.016033', '0.016033 0.016037 0.016071', '0.000021', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LinearRegressionCoeff.csv --size=1638400000

Output:

['LinearRegressionCoeff_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1638400000', '0.840594', '0.840429', '0.840349', '0.840349 0.840429 0.841005', '0.000358', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MolecularDynamics

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/MolecularDynamics.csv --size=8196

Output:

['MolecularDynamics', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8196', '0.000036', '0.000030', '0.000026', '0.000026 0.000030 0.000053', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

alloc/size:10000/0/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2764.81,1900.04,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.985,694.985,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1216.32,1123.54,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,720.62,720.627,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,826.12,754.384,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.484,177.482,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2253.23,2252.48,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.334,189.328,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1838.86,1838.59,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.571,187.566,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2900.31,2848.73,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,263.332,263.281,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3044.82,2997.71,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,299.397,299.393,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,295.171,289.665,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,203.121,203.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,256.383,252.49,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,185.624,185.622,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,966.92,950.596,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,945.259,945.211,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35843,33591.5,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4738.75,4738.6,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140896,89003.2,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,32762.6,32760.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19665e+06,1.19505e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,160390,160389,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.25537e+06,1.25475e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,141738,141737,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42120,41471.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15134.5,15134.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76302.7,76262.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24911.5,24911,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2373.57,1815.89,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,708.215,708.211,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1206.98,1157.2,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,735.105,735.103,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,831.424,755.063,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,183.023,183.015,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1983.6,1981.93,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,193.023,193.019,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1815.69,1814.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.915,188.909,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2936.56,2895.08,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,272.917,272.911,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3143.58,3093.29,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,298.914,298.909,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,306.648,294.85,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,205.812,205.806,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,269.061,268.492,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.241,200.235,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,932.898,923.331,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,927.791,927.735,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30422,28148.7,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4295.1,4294.93,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,138635,87278.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30805.3,30805,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.11973e+06,1.11908e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154695,154694,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14994e+06,1.14906e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138763,138762,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42844,41124,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14824.5,14824.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75328.8,74044.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27715.7,27715.1,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2437.54,1817.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,688.508,688.505,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1184.15,1093.31,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,695.751,695.754,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,753.346,727.617,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.995,173.994,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2184.03,2182.57,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,192.264,192.22,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1870.97,1870.63,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,188.16,188.155,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3328.5,3279.48,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,262.784,262.778,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3410.01,3357.55,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,294.458,294.451,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,301.359,298.08,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,214.39,214.39,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,260.359,259.494,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,202.78,202.775,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,953.037,949.463,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,934.56,934.549,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,30027,28169.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4349.98,4349.83,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137289,86382,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31499.3,31499.1,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27019e+06,1.26961e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,157278,157277,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.27141e+06,1.27046e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140676,140669,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41618.2,40749.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14590.7,14590.4,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,73967.8,73947,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,25524.1,25523.4,ns,,,,,

Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12825782769

Copy link
Contributor

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/12825782769
Job status: success. Test status: success.

Summary

Total 83 benchmarks in mean.
Geomean 97.918%.
Improved 7 Regressed 29 (threshold 2.00%)

(result is better)

Performance change in benchmark groups

Relative perf in group api (12): 98.992%
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_ur SubmitKernel in order 15.982000 μs 16.663 μs 104.26% 4.26% +
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 1.655000 μs 1.675 μs 101.21% 1.21% .
api_overhead_benchmark_ur SubmitKernel out of order 15.750000 μs 15.896 μs 100.93% 0.93% .
api_overhead_benchmark_l0 SubmitKernel out of order 11.672 μs 11.528000 μs 98.77% -1.23% .
api_overhead_benchmark_sycl SubmitKernel in order 25.375 μs 24.844000 μs 97.91% -2.09% -
api_overhead_benchmark_ur SubmitKernel out of order CPU count 104663.000 instr 101923.000000 instr 97.38% -2.62% -
api_overhead_benchmark_ur SubmitKernel in order CPU count 110006.000 instr 107041.000000 instr 97.30% -2.70% -
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 2.187 μs 2.118000 μs 96.84% -3.16% -
api_overhead_benchmark_sycl SubmitKernel out of order 24.514 μs 23.678000 μs 96.59% -3.41% -
api_overhead_benchmark_l0 SubmitKernel in order 11.162000 μs -
api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count 123353.000000 instr -
api_overhead_benchmark_ur SubmitKernel in order with measure completion 21.568000 μs -
Relative perf in group memory (4): 86.496%
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 255.419 μs 253.805000 μs 99.37% -0.63% .
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 3.039 GB/s 3.151000 GB/s 96.45% -3.55% -
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 5.853 μs 5.638000 μs 96.33% -3.67% -
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 219.235 μs 132.929000 μs 60.63% -39.37% ----------
Relative perf in group miscellaneous (1): 106.585%
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum 805.564000 bw GB/s 858.609 bw GB/s 106.58% 6.58% ++
Relative perf in group multithread (10): 97.484%
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 46713.383000 μs 47907.007 μs 102.56% 2.56% +
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 17003.876000 μs 17316.620 μs 101.84% 1.84% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 6972.475 μs 6935.535000 μs 99.47% -0.53% .
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 2059.267 μs 2022.915000 μs 98.23% -1.77% .
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events 42021.477 μs 40973.625000 μs 97.51% -2.49% -
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 1203.256 μs 1157.521000 μs 96.20% -3.80% -
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 8924.850 μs 8555.721000 μs 95.86% -4.14% -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 7847.707 μs 7452.758000 μs 94.97% -5.03% -
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events 114323.514 μs 108338.415000 μs 94.76% -5.24% -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 27217.618 μs 25543.132000 μs 93.85% -6.15% --
Relative perf in group Velocity-Bench (9): 99.365%
Benchmark This PR baseline Relative perf Change -
Velocity-Bench QuickSilver 117.900000 MMS/CTT 117.490 MMS/CTT 100.35% 0.35% .
Velocity-Bench Easywave 229.000000 ms 229.000 ms 100.00% 0.00% .
Velocity-Bench Bitcracker 35.177 s 35.129800 s 99.87% -0.13% .
Velocity-Bench CudaSift 203.399 ms 201.142000 ms 98.89% -1.11% .
Velocity-Bench Sobel Filter 610.897 ms 602.045000 ms 98.55% -1.45% .
Velocity-Bench Hashtable 357.248 M keys/sec 362.504819 M keys/sec 98.55% -1.45% .
Velocity-Bench dl-cifar - 23.743900 s
Velocity-Bench dl-mnist - 2.720000 s
Velocity-Bench svm - 0.139900 s
Relative perf in group Runtime (8): 99.366%
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 276.096000 ms 278.736 ms 100.96% 0.96% .
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 276.307000 ms 278.916 ms 100.94% 0.94% .
Runtime_IndependentDAGTaskThroughput_SingleTask 259.484 ms 259.395000 ms 99.97% -0.03% .
Runtime_DAGTaskThroughput_HierarchicalParallelFor 1726.594 ms 1725.256000 ms 99.92% -0.08% .
Runtime_DAGTaskThroughput_SingleTask 1682.600 ms 1678.732000 ms 99.77% -0.23% .
Runtime_DAGTaskThroughput_NDRangeParallelFor 1705.339 ms 1695.816000 ms 99.44% -0.56% .
Runtime_DAGTaskThroughput_BasicParallelFor 1763.156 ms 1746.233000 ms 99.04% -0.96% .
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 289.823 ms 275.382000 ms 95.02% -4.98% -
Relative perf in group MicroBench (14): 94.368%
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_2D_D2H_Strided 617.273000 ms 617.523 ms 100.04% 0.04% .
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous 617.915000 ms 617.994 ms 100.01% 0.01% .
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous 617.953000 ms 617.954 ms 100.00% 0.00% .
MicroBench_LocalMem_int32_4096 29.867 ms 29.866000 ms 100.00% -0.00% .
MicroBench_HostDeviceBandwidth_3D_D2H_Strided 617.297 ms 617.254000 ms 99.99% -0.01% .
MicroBench_LocalMem_fp32_4096 29.903 ms 29.833000 ms 99.77% -0.23% .
MicroBench_HostDeviceBandwidth_2D_H2D_Strided 5.067 ms 4.781000 ms 94.36% -5.64% -
MicroBench_HostDeviceBandwidth_1D_H2D_Strided 4.849 ms 4.547000 ms 93.77% -6.23% --
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous 4.816 ms 4.414000 ms 91.65% -8.35% --
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous 4.731 ms 4.322000 ms 91.35% -8.65% --
MicroBench_HostDeviceBandwidth_3D_H2D_Strided 5.013 ms 4.574000 ms 91.24% -8.76% --
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous 4.735 ms 4.317000 ms 91.17% -8.83% --
MicroBench_HostDeviceBandwidth_1D_D2H_Strided 5.209 ms 4.702000 ms 90.27% -9.73% --
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous 5.298 ms 4.238000 ms 79.99% -20.01% -----
Relative perf in group Pattern (10): 102.072%
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_Hierarchical_int32 14.720000 ms 16.411 ms 111.49% 11.49% +++
Pattern_Reduction_NDRange_int32 14.547000 ms 16.163 ms 111.11% 11.11% +++
Pattern_SegmentedReduction_Hierarchical_int32 11.590000 ms 11.599 ms 100.08% 0.08% .
Pattern_SegmentedReduction_Hierarchical_fp32 11.594 ms 11.589000 ms 99.96% -0.04% .
Pattern_SegmentedReduction_NDRange_int16 2.265 ms 2.264000 ms 99.96% -0.04% .
Pattern_SegmentedReduction_Hierarchical_int64 11.785 ms 11.779000 ms 99.95% -0.05% .
Pattern_SegmentedReduction_Hierarchical_int16 11.808 ms 11.801000 ms 99.94% -0.06% .
Pattern_SegmentedReduction_NDRange_int64 2.338 ms 2.336000 ms 99.91% -0.09% .
Pattern_SegmentedReduction_NDRange_fp32 2.168 ms 2.163000 ms 99.77% -0.23% .
Pattern_SegmentedReduction_NDRange_int32 2.174 ms 2.164000 ms 99.54% -0.46% .
Relative perf in group ScalarProduct (6): 99.931%
Benchmark This PR baseline Relative perf Change -
ScalarProduct_Hierarchical_int64 11.458000 ms 11.490 ms 100.28% 0.28% .
ScalarProduct_Hierarchical_int32 10.516000 ms 10.523 ms 100.07% 0.07% .
ScalarProduct_Hierarchical_fp32 10.170000 ms 10.170 ms 100.00% 0.00% .
ScalarProduct_NDRange_fp32 3.761 ms 3.759000 ms 99.95% -0.05% .
ScalarProduct_NDRange_int32 3.743 ms 3.733000 ms 99.73% -0.27% .
ScalarProduct_NDRange_int64 5.480 ms 5.456000 ms 99.56% -0.44% .
Relative perf in group USM (7): 97.014%
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_host 37.748000 ms 37.899 ms 100.40% 0.40% .
USM_Allocation_latency_fp32_shared 0.066000 ms 0.066 ms 100.00% 0.00% .
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 1.701 ms 1.661000 ms 97.65% -2.35% -
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 1.873 ms 1.814000 ms 96.85% -3.15% -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.093 ms 1.046000 ms 95.70% -4.30% -
USM_Allocation_latency_fp32_device 0.072 ms 0.068000 ms 94.44% -5.56% -
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.268 ms 1.195000 ms 94.24% -5.76% -
Relative perf in group VectorAddition (3): 101.177%
Benchmark This PR baseline Relative perf Change -
VectorAddition_int64 3.056000 ms 3.139 ms 102.72% 2.72% +
VectorAddition_int32 1.440000 ms 1.448 ms 100.56% 0.56% .
VectorAddition_fp32 1.441000 ms 1.445 ms 100.28% 0.28% .
Relative perf in group Polybench (3): 100.864%
Benchmark This PR baseline Relative perf Change -
Polybench_Atax 6.713000 ms 6.880 ms 102.49% 2.49% +
Polybench_2mm 1.211000 ms 1.216 ms 100.41% 0.41% .
Polybench_3mm 1.732 ms 1.727000 ms 99.71% -0.29% .
Relative perf in group Kmeans (1): 100.274%
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 16.039000 ms 16.083 ms 100.27% 0.27% .
Relative perf in group MolecularDynamics (1): 96.552%
Benchmark This PR baseline Relative perf Change -
MolecularDynamics 0.029 ms 0.028000 ms 96.55% -3.45% -
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:4 glibc 2738.350000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider 2018.430000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool<os_provider> 3019.300000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool<os_provider> 297.523000 ns -
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:1 glibc 723.083000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider 191.762000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool<os_provider> 270.528000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool<os_provider> 216.749000 ns -
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc 1257.310000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider 1834.630000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool<os_provider> 3343.820000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool<os_provider> 254.273000 ns -
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc 737.976000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider 191.662000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool<os_provider> 285.897000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool<os_provider> 200.089000 ns -
Relative perf in group alloc/min (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc 818.174000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc 177.341000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool<os_provider> 964.486000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool<os_provider> 1045.300000 ns -
Relative perf in group multiple (24): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc 35693.200000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc 4319.450000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc 140646.000000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc 31390.100000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool<os_provider> 1167170.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool<os_provider> 156232.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider 1191460.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider 138974.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool<os_provider> 43158.900000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool<os_provider> 15529.200000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool<os_provider> 75604.000000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool<os_provider> 27973.700000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc - 32574.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc - 4128.530000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc - 138399.000000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc - 28197.400000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1161430.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> - 161766.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider - 1166110.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider - 141737.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42212.800000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> - 14889.200000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> - 72778.500000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> - 27538.700000 ns
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 128 - 838.869803 token/s
llama.cpp Text Generation Batched 128 - 63.338561 token/s
llama.cpp Prompt Processing Batched 256 - 872.377637 token/s
llama.cpp Text Generation Batched 256 - 63.361520 token/s
llama.cpp Prompt Processing Batched 512 - 434.541716 token/s
llama.cpp Text Generation Batched 512 - 63.295460 token/s
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc - 2589.180000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc - 710.936000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc - 1188.310000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc - 716.901000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc - 861.597000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc - 175.935000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider - 2246.790000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider - 187.819000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider - 1690.250000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider - 189.702000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 4441.700000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 256.696000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3268.220000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 306.439000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 299.852000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.534000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 263.904000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 197.833000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> - 1051.720000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> - 952.492000 ns

Details

Benchmark details - environment, command, output...
api_overhead_benchmark_l0 SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.720,11.672,3.13%,10.927,69.478,[CPU],[us]

api_overhead_benchmark_l0 SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.196,11.162,2.34%,10.429,23.165,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),24.653,24.514,2.29%,23.525,106.134,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),25.547,25.375,4.47%,24.523,342.284,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Device --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Device destinationPlacement=Device size=1KB count=100),256.623,255.419,1.82%,249.828,498.960,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Host --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Host destinationPlacement=Device size=1KB count=100),240.527,219.235,24.32%,217.767,519.622,[CPU],[us]

memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueMemcpy --csv --noHeaders --iterations=10000 --sourcePlacement=Device --destinationPlacement=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueMemcpy(api=sycl sourcePlacement=Device destinationPlacement=Device size=1KB),6.223,5.853,15.42%,5.348,43.573,[CPU],[us]

memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=StreamMemory --csv --noHeaders --iterations=10000 --type=Triad --size=10240 --memoryPlacement=Device --useEvents=0 --contents=Zeros --multiplier=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
StreamMemory(api=sycl type=Triad size=10KB useEvents=0 contents=Zeros memoryPlacement=Device multiplier=1),2.976,3.039,6.55%,0.264,3.290,[CPU],[GB/s]

api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=0 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Device --dst=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Device dst=Device size=1KB ioq=0),2.194,2.187,10.78%,1.985,66.594,[CPU],[us]

api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=1 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Host --dst=Host --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Host dst=Host size=1KB ioq=1),1.666,1.655,5.23%,1.567,12.295,[CPU],[us]

miscellaneous_benchmark_sycl VectorSum

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/miscellaneous_benchmark_sycl --test=VectorSum --csv --noHeaders --iterations=1000 --numberOfElementsX=512 --numberOfElementsY=256 --numberOfElementsZ=256

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
VectorSum(api=sycl numberOfElementsX=512 numberOfElementsY=256 numberOfElementsZ=256),804.860,805.564,0.45%,770.540,812.325,[GPU],bw [GB/s]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),6987.446,6972.475,0.71%,6935.264,7089.674,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),17004.499,17003.876,3.91%,15661.347,18047.091,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),46727.795,46713.383,1.60%,44125.673,49078.653,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),2140.351,2059.267,37.34%,1532.449,17980.812,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),7879.450,7847.707,1.40%,7780.354,8156.193,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),9089.047,8924.850,3.93%,8746.972,9998.254,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),27330.346,27217.618,1.79%,26042.888,28773.474,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),1312.365,1203.256,61.30%,905.040,15445.032,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=1 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=1 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),42071.530,42021.477,0.54%,41895.886,42725.230,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=4 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=4 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),114281.635,114323.514,0.18%,113773.691,114648.341,[CPU],[us]

api_overhead_benchmark_ur SubmitKernel out of order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),104723.682,104663.000,6.05%,104528.000,2106113.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),15.870,15.632,331.61%,14.857,16656.876,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),104723.682,104663.000,6.05%,104528.000,2106113.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),15.997,15.750,333.23%,15.059,16872.141,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110073.284,110006.000,4.01%,110006.000,1499816.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.719,16.514,258.57%,15.674,13685.210,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110073.284,110006.000,4.01%,110006.000,1499816.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.213,15.982,315.68%,15.137,16197.936,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),123662.516,123353.000,3.83%,122852.000,1588481.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),21.927,21.591,245.44%,20.038,17036.529,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),123653.762,123166.000,3.83%,122539.000,1588167.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),21.917,21.568,239.12%,20.047,16588.956,[CPU],time [us]

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.3757 s
357.247554 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00402114 s
bitcracker - total time for whole calculation: 35.177 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1221 1254 33.1523% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1274 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1140 1275 30.953% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1095 1273 29.7312% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1271 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1271 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1130 1273 30.6815% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1256 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1253 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1125 1265 30.5458% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1203 1256 32.6636% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1270 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1273 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1093 1267 29.6769% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1266 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1085 1258 29.4597% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1262 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1214 1251 32.9623% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1267 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1081 1253 29.3511% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1086 1269 29.4868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1122 1270 30.4643% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1119 1273 30.3828% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1109 1254 30.1113% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1111 1273 30.1656% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1077 1268 29.2425% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1088 1253 29.5411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1272 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1107 1259 30.057% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1273 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1213 1268 32.9351% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1152 1262 31.2788% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1208 1261 32.7993% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1269 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1066 1262 28.9438% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1144 1272 31.0616% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1116 1272 30.3014% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1266 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1276 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1254 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1126 1269 30.5729% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1083 1255 29.4054% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1079 1264 29.2968% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1261 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1165 1267 31.6318% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1099 1268 29.8398% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1132 1273 30.7358% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1261 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1212 1244 32.908% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1267 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 203.399 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.32224)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 4.390690e-01 6.297750e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.738790e-01 7.622130e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.439430e-01 7.783380e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.669560e-01 8.332190e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.386240e-01 7.892990e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.421230e-01 7.638260e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.430190e-01 7.623900e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.405080e-01 7.822750e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.421380e-01 7.819560e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.405340e-01 7.577380e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.121e+07 1.121e+07 1.121e+07 0.000e+00 100.00
cycleInit 10 3.571e+06 3.571e+06 3.571e+06 0.000e+00 100.00
cycleTracking 10 7.641e+06 7.641e+06 7.641e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.916e+06 4.916e+06 4.916e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.013e+05 2.013e+05 2.013e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.010e+02 4.010e+02 4.010e+02 0.000e+00 100.00
Figure Of Merit 117.90 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.52273 s
sobelfilter - total time for whole calculation: 0.610897 s

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.261849', '0.259484', '0.258310', '0.258310 0.259484 0.267751', '0.005145', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.291817', '0.289823', '0.274900', '0.274900 0.289823 0.310728', '0.017997', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.276662', '0.276307', '0.275022', '0.275022 0.276307 0.278656', '0.001843', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.283670', '0.276096', '0.274143', '0.274143 0.276096 0.300771', '0.014842', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.682388', '1.682600', '1.679350', '1.679350 1.682600 1.685214', '0.002938', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.765438', '1.763156', '1.762996', '1.762996 1.763156 1.770163', '0.004092', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.728155', '1.726594', '1.724325', '1.724325 1.726594 1.733545', '0.004804', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.704936', '1.705339', '1.703257', '1.703257 1.705339 1.706212', '0.001518', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.006020', '0.005298', '0.004686', '0.004686 0.005298 0.008077', '0.001807', '26.672385', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004787', '0.004735', '0.004730', '0.004730 0.004735 0.004895', '0.000094', '26.426709', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004755', '0.004731', '0.004712', '0.004712 0.004731 0.004822', '0.000059', '26.526927', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004821', '0.004816', '0.004806', '0.004806 0.004816 0.004841', '0.000018', '26.011455', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617926', '0.617915', '0.617898', '0.617898 0.617915 0.617966', '0.000035', '0.202299', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617946', '0.617953', '0.617903', '0.617903 0.617953 0.617982', '0.000040', '0.202297', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004744', '0.004849', '0.004531', '0.004531 0.004849 0.004852', '0.000185', '27.587644', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005013', '0.005067', '0.004892', '0.004892 0.005067 0.005080', '0.000105', '25.552872', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005056', '0.005013', '0.005007', '0.005007 0.005013 0.005148', '0.000080', '24.963234', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005184', '0.005209', '0.005035', '0.005035 0.005209 0.005310', '0.000139', '24.827573', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617345', '0.617273', '0.617248', '0.617248 0.617273 0.617516', '0.000148', '0.202512', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617303', '0.617297', '0.617239', '0.617239 0.617297 0.617374', '0.000067', '0.202515', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_int32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029879', '0.029867', '0.029859', '0.029859 0.029867 0.029910', '0.000027', '10448.945648', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_fp32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029883', '0.029903', '0.029839', '0.029839 0.029903 0.029908', '0.000038', '10455.982375', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.015248', '0.014547', '0.014346', '0.014346 0.014547 0.016852', '0.001392', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.014542', '0.014720', '0.014116', '0.014116 0.014720 0.014789', '0.000371', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003764', '0.003743', '0.003731', '0.003731 0.003743 0.003820', '0.000048', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.005469', '0.005480', '0.005442', '0.005442 0.005480 0.005486', '0.000024', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003758', '0.003761', '0.003739', '0.003739 0.003761 0.003774', '0.000018', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010501', '0.010516', '0.010469', '0.010469 0.010516 0.010520', '0.000028', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011466', '0.011458', '0.011439', '0.011439 0.011458 0.011501', '0.000032', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010165', '0.010170', '0.010148', '0.010148 0.010170 0.010175', '0.000014', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002272', '0.002265', '0.002263', '0.002263 0.002265 0.002290', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002174', '0.002174', '0.002165', '0.002165 0.002174 0.002184', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002340', '0.002338', '0.002334', '0.002334 0.002338 0.002347', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002168', '0.002168', '0.002159', '0.002159 0.002168 0.002176', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011806', '0.011808', '0.011795', '0.011795 0.011808 0.011814', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011595', '0.011590', '0.011569', '0.011569 0.011590 0.011625', '0.000029', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011780', '0.011785', '0.011759', '0.011759 0.011785 0.011796', '0.000019', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011600', '0.011594', '0.011589', '0.011589 0.011594 0.011619', '0.000016', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_device', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000065', '0.000072', '0.000050', '0.000050 0.000072 0.000074', '0.000013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_host', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.037745', '0.037748', '0.037611', '0.037611 0.037748 0.037876', '0.000133', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_shared', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000067', '0.000066', '0.000057', '0.000057 0.000066 0.000078', '0.000011', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.002322', '0.001701', '0.001686', '0.001686 0.001701 0.003580', '0.001089', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001102', '0.001093', '0.001083', '0.001083 0.001093 0.001130', '0.000025', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001880', '0.001873', '0.001847', '0.001847 0.001873 0.001919', '0.000037', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001268', '0.001268', '0.001256', '0.001256 0.001268 0.001280', '0.000012', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001454', '0.001440', '0.001424', '0.001424 0.001440 0.001497', '0.000038', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003054', '0.003056', '0.003048', '0.003048 0.003056 0.003057', '0.000005', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001452', '0.001441', '0.001440', '0.001440 0.001441 0.001474', '0.000020', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_2mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/2mm.csv --size=512

Output:

['Polybench_2mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001216', '0.001211', '0.001210', '0.001210 0.001211 0.001227', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_3mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/3mm.csv --size=512

Output:

['Polybench_3mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001731', '0.001732', '0.001726', '0.001726 0.001732 0.001734', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_Atax

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Atax.csv --size=8192

Output:

['Polybench_Atax', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.006743', '0.006713', '0.006689', '0.006689 0.006713 0.006828', '0.000074', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Kmeans_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Kmeans.csv --size=700000000

Output:

['Kmeans_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '700000000', '0.016042', '0.016039', '0.016036', '0.016036 0.016039 0.016052', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MolecularDynamics

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/MolecularDynamics.csv --size=8196

Output:

['MolecularDynamics', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8196', '0.000038', '0.000029', '0.000025', '0.000025 0.000029 0.000060', '0.000019', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

alloc/size:10000/0/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2758.34,1952.55,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,723.083,723.081,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1413.23,1298.08,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,739.232,739.203,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.114,760.032,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,185.585,185.576,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2082.36,2079.53,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,191.762,191.757,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1834.63,1834.44,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.979,191.974,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3273.8,3229.41,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.166,270.121,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3167.57,3117.57,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,288.628,288.622,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.74,296.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,223.294,223.292,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,255.595,249.013,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.018,203.011,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,964.486,947.698,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1033.58,1033.57,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31306.9,29077.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4329.71,4329.62,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137492,87173.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31530.9,31530.6,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14042e+06,1.13997e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156232,156231,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.07655e+06,1.0763e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138427,138423,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41117.3,40744.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,16310,16309.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74425.3,73893.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28146.2,28145.7,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2758.34,1952.55,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,723.083,723.081,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1413.23,1298.08,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,739.232,739.203,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.114,760.032,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,185.585,185.576,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2082.36,2079.53,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,191.762,191.757,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1834.63,1834.44,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.979,191.974,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3273.8,3229.41,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.166,270.121,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3167.57,3117.57,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,288.628,288.622,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.74,296.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,223.294,223.292,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,255.595,249.013,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.018,203.011,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,964.486,947.698,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1033.58,1033.57,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31306.9,29077.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4329.71,4329.62,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137492,87173.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31530.9,31530.6,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14042e+06,1.13997e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156232,156231,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.07655e+06,1.0763e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138427,138423,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41117.3,40744.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,16310,16309.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74425.3,73893.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28146.2,28145.7,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2758.34,1952.55,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,723.083,723.081,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1413.23,1298.08,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,739.232,739.203,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.114,760.032,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,185.585,185.576,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2082.36,2079.53,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,191.762,191.757,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1834.63,1834.44,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.979,191.974,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3273.8,3229.41,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.166,270.121,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3167.57,3117.57,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,288.628,288.622,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.74,296.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,223.294,223.292,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,255.595,249.013,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.018,203.011,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,964.486,947.698,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1033.58,1033.57,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31306.9,29077.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4329.71,4329.62,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137492,87173.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31530.9,31530.6,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14042e+06,1.13997e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156232,156231,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.07655e+06,1.0763e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138427,138423,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41117.3,40744.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,16310,16309.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74425.3,73893.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28146.2,28145.7,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2758.34,1952.55,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,723.083,723.081,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1413.23,1298.08,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,739.232,739.203,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.114,760.032,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,185.585,185.576,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2082.36,2079.53,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,191.762,191.757,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1834.63,1834.44,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.979,191.974,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3273.8,3229.41,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.166,270.121,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3167.57,3117.57,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,288.628,288.622,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.74,296.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,223.294,223.292,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,255.595,249.013,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.018,203.011,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,964.486,947.698,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1033.58,1033.57,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31306.9,29077.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4329.71,4329.62,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137492,87173.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31530.9,31530.6,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14042e+06,1.13997e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156232,156231,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.07655e+06,1.0763e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138427,138423,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41117.3,40744.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,16310,16309.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74425.3,73893.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28146.2,28145.7,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2758.34,1952.55,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,723.083,723.081,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1413.23,1298.08,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,739.232,739.203,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,815.114,760.032,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,185.585,185.576,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2082.36,2079.53,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,191.762,191.757,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1834.63,1834.44,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.979,191.974,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3273.8,3229.41,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.166,270.121,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3167.57,3117.57,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,288.628,288.622,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.74,296.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,223.294,223.292,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,255.595,249.013,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,203.018,203.011,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,964.486,947.698,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1033.58,1033.57,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31306.9,29077.9,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4329.71,4329.62,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137492,87173.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31530.9,31530.6,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.14042e+06,1.13997e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156232,156231,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.07655e+06,1.0763e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138427,138423,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41117.3,40744.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,16310,16309.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74425.3,73893.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28146.2,28145.7,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2738.35,1987.62,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,709.656,709.656,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.42,1141.77,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,730.741,730.738,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,818.174,752.19,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,176.774,176.772,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,1986.3,1986.02,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,189.788,189.782,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1680.9,1680.35,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,187.109,187.105,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2972.02,2924.94,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,273.849,273.841,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3364.23,3319.89,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,285.897,285.891,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,297.523,279.614,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,212.659,212.658,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,247.035,241.504,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.089,200.086,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,950.948,946.125,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1048.35,1048.28,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,35693.2,33423.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4126.72,4126.58,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,140646,88688.4,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28239.5,28239.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18595e+06,1.18532e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,156812,156807,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20782e+06,1.20575e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,140651,140628,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43358.4,43073.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15529.2,15528.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76211,75767.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,26000.3,25999.7,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2671.2,1996.49,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,728.916,728.863,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1257.31,1135.76,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,737.976,737.937,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,836.689,760.399,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,177.341,177.074,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2018.43,2017.61,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,194.389,194.382,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1878.32,1877.71,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,191.662,191.657,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3019.3,2972.1,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.528,270.475,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3343.82,3301.6,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,282.153,282.147,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,293.94,291.856,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,216.749,216.746,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,254.273,248.566,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.063,192.058,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1079.66,1051.36,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,1045.3,1045.25,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,36899,34664.8,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4319.45,4319.35,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,142032,89595.3,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31390.1,31389.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.16717e+06,1.16672e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,155031,155027,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.19146e+06,1.19112e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,138974,138970,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,43158.9,42398.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14773.8,14773.5,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,75604,75115.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,27973.7,27973.1,ns,,,,,

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12826547948

@oneapi-src oneapi-src deleted a comment from github-actions bot Jan 17, 2025
@oneapi-src oneapi-src deleted a comment from github-actions bot Jan 17, 2025
Copy link
Contributor

Compute Benchmarks level_zero run (--filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12826547948
Job status: failure. Test status: failure.

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12826701422

Copy link
Contributor

Compute Benchmarks level_zero run (--filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12826701422
Job status: failure. Test status: failure.

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12826967462

Copy link
Contributor

Compute Benchmarks level_zero run (--filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12826967462
Job status: failure. Test status: failure.

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12827163375

Copy link
Contributor

Compute Benchmarks level_zero run (--filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12827163375
Job status: success. Test status: success.

Summary

Total 7 benchmarks in mean.
Geomean 99.840%.
Improved 1 Regressed 1 (threshold 2.00%)

(result is better)

Performance change in benchmark groups

Relative perf in group Velocity-Bench (9): 99.840%
Benchmark This PR baseline Relative perf Change -
Velocity-Bench svm 0.137000 s 0.140 s 102.12% 2.12% +++++++++
Velocity-Bench QuickSilver 118.840000 MMS/CTT 117.490 MMS/CTT 101.15% 1.15% .
Velocity-Bench Bitcracker 35.029000 s 35.130 s 100.29% 0.29% .
Velocity-Bench Easywave 230.000 ms 229.000000 ms 99.57% -0.43% .
Velocity-Bench CudaSift 202.856 ms 201.142000 ms 99.16% -0.84% .
Velocity-Bench Hashtable 358.943 M keys/sec 362.504819 M keys/sec 99.02% -0.98% .
Velocity-Bench Sobel Filter 616.496 ms 602.045000 ms 97.66% -2.34% ----------
Velocity-Bench dl-cifar - 23.743900 s
Velocity-Bench dl-mnist - 2.720000 s
Relative perf in group api (9): cannot calculate
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order - 11.528000 μs
api_overhead_benchmark_sycl SubmitKernel out of order - 23.678000 μs
api_overhead_benchmark_sycl SubmitKernel in order - 24.844000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 2.118000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 1.675000 μs
api_overhead_benchmark_ur SubmitKernel out of order CPU count - 101923.000000 instr
api_overhead_benchmark_ur SubmitKernel out of order - 15.896000 μs
api_overhead_benchmark_ur SubmitKernel in order CPU count - 107041.000000 instr
api_overhead_benchmark_ur SubmitKernel in order - 16.663000 μs
Relative perf in group memory (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 253.805000 μs
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 132.929000 μs
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 5.638000 μs
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 3.151000 GB/s
Relative perf in group miscellaneous (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 858.609000 bw GB/s
Relative perf in group multithread (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 - 6935.535000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 - 17316.620000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 - 47907.007000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 - 2022.915000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 - 7452.758000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 - 8555.721000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 - 25543.132000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 - 1157.521000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events - 40973.625000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events - 108338.415000 μs
Relative perf in group Runtime (8): cannot calculate
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_SingleTask - 259.395000 ms
Runtime_IndependentDAGTaskThroughput_BasicParallelFor - 275.382000 ms
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor - 278.916000 ms
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor - 278.736000 ms
Runtime_DAGTaskThroughput_SingleTask - 1678.732000 ms
Runtime_DAGTaskThroughput_BasicParallelFor - 1746.233000 ms
Runtime_DAGTaskThroughput_HierarchicalParallelFor - 1725.256000 ms
Runtime_DAGTaskThroughput_NDRangeParallelFor - 1695.816000 ms
Relative perf in group MicroBench (14): cannot calculate
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous - 4.238000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous - 4.317000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous - 4.322000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous - 4.414000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous - 617.994000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous - 617.954000 ms
MicroBench_HostDeviceBandwidth_1D_H2D_Strided - 4.547000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Strided - 4.781000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Strided - 4.574000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Strided - 4.702000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Strided - 617.523000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Strided - 617.254000 ms
MicroBench_LocalMem_int32_4096 - 29.866000 ms
MicroBench_LocalMem_fp32_4096 - 29.833000 ms
Relative perf in group Pattern (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_NDRange_int32 - 16.163000 ms
Pattern_Reduction_Hierarchical_int32 - 16.411000 ms
Pattern_SegmentedReduction_NDRange_int16 - 2.264000 ms
Pattern_SegmentedReduction_NDRange_int32 - 2.164000 ms
Pattern_SegmentedReduction_NDRange_int64 - 2.336000 ms
Pattern_SegmentedReduction_NDRange_fp32 - 2.163000 ms
Pattern_SegmentedReduction_Hierarchical_int16 - 11.801000 ms
Pattern_SegmentedReduction_Hierarchical_int32 - 11.599000 ms
Pattern_SegmentedReduction_Hierarchical_int64 - 11.779000 ms
Pattern_SegmentedReduction_Hierarchical_fp32 - 11.589000 ms
Relative perf in group ScalarProduct (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_int32 - 3.733000 ms
ScalarProduct_NDRange_int64 - 5.456000 ms
ScalarProduct_NDRange_fp32 - 3.759000 ms
ScalarProduct_Hierarchical_int32 - 10.523000 ms
ScalarProduct_Hierarchical_int64 - 11.490000 ms
ScalarProduct_Hierarchical_fp32 - 10.170000 ms
Relative perf in group USM (7): cannot calculate
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_device - 0.068000 ms
USM_Allocation_latency_fp32_host - 37.899000 ms
USM_Allocation_latency_fp32_shared - 0.066000 ms
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch - 1.661000 ms
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch - 1.046000 ms
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch - 1.814000 ms
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch - 1.195000 ms
Relative perf in group VectorAddition (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
VectorAddition_int32 - 1.448000 ms
VectorAddition_int64 - 3.139000 ms
VectorAddition_fp32 - 1.445000 ms
Relative perf in group Polybench (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
Polybench_2mm - 1.216000 ms
Polybench_3mm - 1.727000 ms
Polybench_Atax - 6.880000 ms
Relative perf in group Kmeans (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 - 16.083000 ms
Relative perf in group MolecularDynamics (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
MolecularDynamics - 0.028000 ms
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 128 - 838.869803 token/s
llama.cpp Text Generation Batched 128 - 63.338561 token/s
llama.cpp Prompt Processing Batched 256 - 872.377637 token/s
llama.cpp Text Generation Batched 256 - 63.361520 token/s
llama.cpp Prompt Processing Batched 512 - 434.541716 token/s
llama.cpp Text Generation Batched 512 - 63.295460 token/s
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc - 2589.180000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc - 710.936000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc - 1188.310000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc - 716.901000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc - 861.597000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc - 175.935000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider - 2246.790000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider - 187.819000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider - 1690.250000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider - 189.702000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 4441.700000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 256.696000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3268.220000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 306.439000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 299.852000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.534000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 263.904000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 197.833000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> - 1051.720000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> - 952.492000 ns
Relative perf in group multiple (12): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc - 32574.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc - 4128.530000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc - 138399.000000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc - 28197.400000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1161430.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> - 161766.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider - 1166110.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider - 141737.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42212.800000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> - 14889.200000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> - 72778.500000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> - 27538.700000 ns

Details

Benchmark details - environment, command, output...
Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.373925 s
358.942858 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00411444 s
bitcracker - total time for whole calculation: 35.029 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1099 1263 29.8398% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1256 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1248 1284 33.8854% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1262 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1115 1267 30.2742% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1267 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1267 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1098 1259 29.8127% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1269 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1253 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1162 1268 31.5504% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1268 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1096 1266 29.7583% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1249 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1106 1252 30.0299% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1273 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1271 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1252 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1151 1260 31.2517% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1259 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1081 1270 29.3511% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1080 1253 29.3239% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1120 1260 30.41% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1258 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1218 1259 33.0709% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1261 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1268 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1092 1269 29.6497% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1257 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1101 1270 29.8941% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1268 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1261 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1106 1256 30.0299% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1238 1276 33.6139% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1204 1271 32.6907% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1273 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1067 1272 28.9709% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1217 1252 33.0437% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1066 1264 28.9438% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1271 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1164 1259 31.6047% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1260 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1272 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1265 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1154 1270 31.3332% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1095 1260 29.7312% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1106 1256 30.0299% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1054 1270 28.618% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1097 1268 29.7855% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1073 1267 29.1339% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 202.856 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.31294)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 3.732040e-01 6.035540e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.386820e-01 7.433280e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.354010e-01 7.606540e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.671940e-01 8.153080e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.340240e-01 7.877320e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.326670e-01 7.637150e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.337340e-01 7.621170e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.335410e-01 7.835010e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.354200e-01 7.854990e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.340620e-01 7.749550e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.100e+07 1.100e+07 1.100e+07 0.000e+00 100.00
cycleInit 10 3.418e+06 3.418e+06 3.418e+06 0.000e+00 100.00
cycleTracking 10 7.580e+06 7.580e+06 7.580e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.917e+06 4.917e+06 4.917e+06 0.000e+00 100.00
cycleTracking_MPI 117 1.934e+05 1.934e+05 1.934e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.000e+02 4.000e+02 4.000e+02 0.000e+00 100.00
Figure Of Merit 118.84 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.54385 s
sobelfilter - total time for whole calculation: 0.616496 s

Velocity-Bench svm

Environment Variables:

Command:

/home/pmdk/bench_workdir/svm/svm_sycl /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a9a /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a.m

Output:

Number of args 3
Using cuSVM (Carpenter)...

Buffering input text file (6989624 B).
Load Done
Starting Training
_C 1.000000
Workgroup Size: 1024
nbrCtas 80
elemsPerCta 1248
threadsPerCta 128
Total run time: 0.065121 seconds
Iter:100
M:97683
N:123
Train done. Calulate Vector counts
Training done

Loading elapsed time : 0.0646 s
Processing elapsed time : 0.0703 s
Storing elapsed time : 0.0021 s
Total elapsed time : 0.1370 s
Result's are correct: 0.0551

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12827777489

Copy link
Contributor

Compute Benchmarks level_zero run (--filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12827777489
Job status: success. Test status: success.

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12827897391

Copy link
Contributor

Compute Benchmarks level_zero run (--filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12827897391
Job status: success. Test status: success.

Summary

No diffs to calculate performance change

(result is better)

Performance change in benchmark groups

Relative perf in group api (9): cannot calculate
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order - 11.528000 μs
api_overhead_benchmark_sycl SubmitKernel out of order - 23.678000 μs
api_overhead_benchmark_sycl SubmitKernel in order - 24.844000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 2.118000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 1.675000 μs
api_overhead_benchmark_ur SubmitKernel out of order CPU count - 101923.000000 instr
api_overhead_benchmark_ur SubmitKernel out of order - 15.896000 μs
api_overhead_benchmark_ur SubmitKernel in order CPU count - 107041.000000 instr
api_overhead_benchmark_ur SubmitKernel in order - 16.663000 μs
Relative perf in group memory (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 253.805000 μs
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 132.929000 μs
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 5.638000 μs
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 3.151000 GB/s
Relative perf in group miscellaneous (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 858.609000 bw GB/s
Relative perf in group multithread (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 - 6935.535000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 - 17316.620000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 - 47907.007000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 - 2022.915000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 - 7452.758000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 - 8555.721000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 - 25543.132000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 - 1157.521000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events - 40973.625000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events - 108338.415000 μs
Relative perf in group Velocity-Bench (9): cannot calculate
Benchmark This PR baseline Relative perf Change -
Velocity-Bench Hashtable - 362.504819 M keys/sec
Velocity-Bench Bitcracker - 35.129800 s
Velocity-Bench CudaSift - 201.142000 ms
Velocity-Bench Easywave - 229.000000 ms
Velocity-Bench QuickSilver - 117.490000 MMS/CTT
Velocity-Bench Sobel Filter - 602.045000 ms
Velocity-Bench dl-cifar - 23.743900 s
Velocity-Bench dl-mnist - 2.720000 s
Velocity-Bench svm - 0.139900 s
Relative perf in group Runtime (8): cannot calculate
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_SingleTask - 259.395000 ms
Runtime_IndependentDAGTaskThroughput_BasicParallelFor - 275.382000 ms
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor - 278.916000 ms
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor - 278.736000 ms
Runtime_DAGTaskThroughput_SingleTask - 1678.732000 ms
Runtime_DAGTaskThroughput_BasicParallelFor - 1746.233000 ms
Runtime_DAGTaskThroughput_HierarchicalParallelFor - 1725.256000 ms
Runtime_DAGTaskThroughput_NDRangeParallelFor - 1695.816000 ms
Relative perf in group MicroBench (14): cannot calculate
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous - 4.238000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous - 4.317000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous - 4.322000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous - 4.414000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous - 617.994000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous - 617.954000 ms
MicroBench_HostDeviceBandwidth_1D_H2D_Strided - 4.547000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Strided - 4.781000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Strided - 4.574000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Strided - 4.702000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Strided - 617.523000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Strided - 617.254000 ms
MicroBench_LocalMem_int32_4096 - 29.866000 ms
MicroBench_LocalMem_fp32_4096 - 29.833000 ms
Relative perf in group Pattern (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_NDRange_int32 - 16.163000 ms
Pattern_Reduction_Hierarchical_int32 - 16.411000 ms
Pattern_SegmentedReduction_NDRange_int16 - 2.264000 ms
Pattern_SegmentedReduction_NDRange_int32 - 2.164000 ms
Pattern_SegmentedReduction_NDRange_int64 - 2.336000 ms
Pattern_SegmentedReduction_NDRange_fp32 - 2.163000 ms
Pattern_SegmentedReduction_Hierarchical_int16 - 11.801000 ms
Pattern_SegmentedReduction_Hierarchical_int32 - 11.599000 ms
Pattern_SegmentedReduction_Hierarchical_int64 - 11.779000 ms
Pattern_SegmentedReduction_Hierarchical_fp32 - 11.589000 ms
Relative perf in group ScalarProduct (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_int32 - 3.733000 ms
ScalarProduct_NDRange_int64 - 5.456000 ms
ScalarProduct_NDRange_fp32 - 3.759000 ms
ScalarProduct_Hierarchical_int32 - 10.523000 ms
ScalarProduct_Hierarchical_int64 - 11.490000 ms
ScalarProduct_Hierarchical_fp32 - 10.170000 ms
Relative perf in group USM (7): cannot calculate
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_device - 0.068000 ms
USM_Allocation_latency_fp32_host - 37.899000 ms
USM_Allocation_latency_fp32_shared - 0.066000 ms
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch - 1.661000 ms
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch - 1.046000 ms
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch - 1.814000 ms
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch - 1.195000 ms
Relative perf in group VectorAddition (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
VectorAddition_int32 - 1.448000 ms
VectorAddition_int64 - 3.139000 ms
VectorAddition_fp32 - 1.445000 ms
Relative perf in group Polybench (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
Polybench_2mm - 1.216000 ms
Polybench_3mm - 1.727000 ms
Polybench_Atax - 6.880000 ms
Relative perf in group Kmeans (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 - 16.083000 ms
Relative perf in group MolecularDynamics (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
MolecularDynamics - 0.028000 ms
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 128 - 838.869803 token/s
llama.cpp Text Generation Batched 128 - 63.338561 token/s
llama.cpp Prompt Processing Batched 256 - 872.377637 token/s
llama.cpp Text Generation Batched 256 - 63.361520 token/s
llama.cpp Prompt Processing Batched 512 - 434.541716 token/s
llama.cpp Text Generation Batched 512 - 63.295460 token/s
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc - 2589.180000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc - 710.936000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc - 1188.310000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc - 716.901000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc - 861.597000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc - 175.935000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider - 2246.790000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider - 187.819000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider - 1690.250000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider - 189.702000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 4441.700000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 256.696000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3268.220000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 306.439000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 299.852000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.534000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 263.904000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 197.833000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> - 1051.720000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> - 952.492000 ns
Relative perf in group multiple (12): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc - 32574.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc - 4128.530000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc - 138399.000000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc - 28197.400000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1161430.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> - 161766.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider - 1166110.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider - 141737.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42212.800000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> - 14889.200000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> - 72778.500000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> - 27538.700000 ns

Details

Benchmark details - environment, command, output...

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12830785458

Copy link
Contributor

Compute Benchmarks level_zero run (with params: --filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12831096136

Copy link
Contributor

Compute Benchmarks level_zero run (--filter "Velocity|llama"):
https://github.com/oneapi-src/unified-runtime/actions/runs/12831096136
Job status: success. Test status: success.

Summary

Total 15 benchmarks in mean.
Geomean 98.711%.
Improved 1 Regressed 2 (threshold 2.00%)

(result is better)

Performance change in benchmark groups

Relative perf in group Velocity-Bench (9): 98.502%
Benchmark This PR baseline Relative perf Change -
Velocity-Bench dl-mnist 2.380000 s 2.720 s 114.29% 14.29% +++++++
Velocity-Bench Bitcracker 35.024000 s 35.130 s 100.30% 0.30% .
Velocity-Bench QuickSilver 117.820000 MMS/CTT 117.490 MMS/CTT 100.28% 0.28% .
Velocity-Bench dl-cifar 23.768 s 23.743900 s 99.90% -0.10% .
Velocity-Bench Sobel Filter 603.153 ms 602.045000 ms 99.82% -0.18% .
Velocity-Bench svm 0.140 s 0.139900 s 99.79% -0.21% .
Velocity-Bench Hashtable 357.004 M keys/sec 362.504819 M keys/sec 98.48% -1.52% .
Velocity-Bench CudaSift 204.243 ms 201.142000 ms 98.48% -1.52% .
Velocity-Bench Easywave 291.000 ms 229.000000 ms 78.69% -21.31% ----------
Relative perf in group llama.cpp (6): 99.024%
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 256 876.139255 token/s 872.378 token/s 100.43% 0.43% .
llama.cpp Prompt Processing Batched 512 433.964 token/s 434.541716 token/s 99.87% -0.13% .
llama.cpp Text Generation Batched 512 62.489 token/s 63.295460 token/s 98.73% -1.27% .
llama.cpp Text Generation Batched 128 62.505 token/s 63.338561 token/s 98.68% -1.32% .
llama.cpp Text Generation Batched 256 62.508 token/s 63.361520 token/s 98.65% -1.35% .
llama.cpp Prompt Processing Batched 128 820.469 token/s 838.869803 token/s 97.81% -2.19% -
Relative perf in group api (9): cannot calculate
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order - 11.528000 μs
api_overhead_benchmark_sycl SubmitKernel out of order - 23.678000 μs
api_overhead_benchmark_sycl SubmitKernel in order - 24.844000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 - 2.118000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 - 1.675000 μs
api_overhead_benchmark_ur SubmitKernel out of order CPU count - 101923.000000 instr
api_overhead_benchmark_ur SubmitKernel out of order - 15.896000 μs
api_overhead_benchmark_ur SubmitKernel in order CPU count - 107041.000000 instr
api_overhead_benchmark_ur SubmitKernel in order - 16.663000 μs
Relative perf in group memory (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 - 253.805000 μs
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 - 132.929000 μs
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 - 5.638000 μs
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 - 3.151000 GB/s
Relative perf in group miscellaneous (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum - 858.609000 bw GB/s
Relative perf in group multithread (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 - 6935.535000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 - 17316.620000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 - 47907.007000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 - 2022.915000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 - 7452.758000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 - 8555.721000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 - 25543.132000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 - 1157.521000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events - 40973.625000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events - 108338.415000 μs
Relative perf in group Runtime (8): cannot calculate
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_SingleTask - 259.395000 ms
Runtime_IndependentDAGTaskThroughput_BasicParallelFor - 275.382000 ms
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor - 278.916000 ms
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor - 278.736000 ms
Runtime_DAGTaskThroughput_SingleTask - 1678.732000 ms
Runtime_DAGTaskThroughput_BasicParallelFor - 1746.233000 ms
Runtime_DAGTaskThroughput_HierarchicalParallelFor - 1725.256000 ms
Runtime_DAGTaskThroughput_NDRangeParallelFor - 1695.816000 ms
Relative perf in group MicroBench (14): cannot calculate
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous - 4.238000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous - 4.317000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous - 4.322000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous - 4.414000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous - 617.994000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous - 617.954000 ms
MicroBench_HostDeviceBandwidth_1D_H2D_Strided - 4.547000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Strided - 4.781000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Strided - 4.574000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Strided - 4.702000 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Strided - 617.523000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Strided - 617.254000 ms
MicroBench_LocalMem_int32_4096 - 29.866000 ms
MicroBench_LocalMem_fp32_4096 - 29.833000 ms
Relative perf in group Pattern (10): cannot calculate
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_NDRange_int32 - 16.163000 ms
Pattern_Reduction_Hierarchical_int32 - 16.411000 ms
Pattern_SegmentedReduction_NDRange_int16 - 2.264000 ms
Pattern_SegmentedReduction_NDRange_int32 - 2.164000 ms
Pattern_SegmentedReduction_NDRange_int64 - 2.336000 ms
Pattern_SegmentedReduction_NDRange_fp32 - 2.163000 ms
Pattern_SegmentedReduction_Hierarchical_int16 - 11.801000 ms
Pattern_SegmentedReduction_Hierarchical_int32 - 11.599000 ms
Pattern_SegmentedReduction_Hierarchical_int64 - 11.779000 ms
Pattern_SegmentedReduction_Hierarchical_fp32 - 11.589000 ms
Relative perf in group ScalarProduct (6): cannot calculate
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_int32 - 3.733000 ms
ScalarProduct_NDRange_int64 - 5.456000 ms
ScalarProduct_NDRange_fp32 - 3.759000 ms
ScalarProduct_Hierarchical_int32 - 10.523000 ms
ScalarProduct_Hierarchical_int64 - 11.490000 ms
ScalarProduct_Hierarchical_fp32 - 10.170000 ms
Relative perf in group USM (7): cannot calculate
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_device - 0.068000 ms
USM_Allocation_latency_fp32_host - 37.899000 ms
USM_Allocation_latency_fp32_shared - 0.066000 ms
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch - 1.661000 ms
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch - 1.046000 ms
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch - 1.814000 ms
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch - 1.195000 ms
Relative perf in group VectorAddition (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
VectorAddition_int32 - 1.448000 ms
VectorAddition_int64 - 3.139000 ms
VectorAddition_fp32 - 1.445000 ms
Relative perf in group Polybench (3): cannot calculate
Benchmark This PR baseline Relative perf Change -
Polybench_2mm - 1.216000 ms
Polybench_3mm - 1.727000 ms
Polybench_Atax - 6.880000 ms
Relative perf in group Kmeans (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 - 16.083000 ms
Relative perf in group MolecularDynamics (1): cannot calculate
Benchmark This PR baseline Relative perf Change -
MolecularDynamics - 0.028000 ms
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc - 2589.180000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc - 710.936000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc - 1188.310000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc - 716.901000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc - 861.597000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc - 175.935000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider - 2246.790000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider - 187.819000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider - 1690.250000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider - 189.702000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 4441.700000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 256.696000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3268.220000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 306.439000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 299.852000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.534000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 263.904000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 197.833000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> - 1051.720000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> - 952.492000 ns
Relative perf in group multiple (12): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc - 32574.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc - 4128.530000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc - 138399.000000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc - 28197.400000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1161430.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> - 161766.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider - 1166110.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider - 141737.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42212.800000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> - 14889.200000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> - 72778.500000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> - 27538.700000 ns

Details

Benchmark details - environment, command, output...
Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.375956 s
357.003659 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00378219 s
bitcracker - total time for whole calculation: 35.024 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1110 1264 30.1385% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1207 1256 32.7722% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1269 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1058 1271 28.7266% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1122 1264 30.4643% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1091 1259 29.6226% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1077 1272 29.2425% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1109 1257 30.1113% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1141 1250 30.9802% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1269 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1264 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1276 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1148 1258 31.1702% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1092 1265 29.6497% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1105 1256 30.0027% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1270 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1083 1253 29.4054% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1237 1279 33.5868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1151 1262 31.2517% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1101 1277 29.8941% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1159 1265 31.4689% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1267 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1260 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1107 1254 30.057% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1219 1253 33.098% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1153 1261 31.306% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1267 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1076 1261 29.2153% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1258 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1033 1261 28.0478% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1127 1285 30.6001% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1180 1266 32.0391% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1103 1265 29.9484% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1086 1255 29.4868% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1107 1267 30.057% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1120 1271 30.41% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1122 1269 30.4643% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1097 1265 29.7855% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1091 1260 29.6226% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1202 1258 32.6364% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1100 1257 29.867% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1054 1264 28.618% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1113 1277 30.2199% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1272 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1091 1252 29.6226% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1245 1277 33.804% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1271 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1152 1275 31.2788% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1091 1259 29.6226% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1269 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 204.243 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 3.683340e-01 6.247260e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.335790e-01 7.658220e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.327160e-01 7.809610e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.647570e-01 8.337120e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.289860e-01 7.894370e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.308450e-01 7.648480e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.285020e-01 7.632040e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.289090e-01 7.833600e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.299260e-01 7.820970e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.297780e-01 7.583050e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.102e+07 1.102e+07 1.102e+07 0.000e+00 100.00
cycleInit 10 3.376e+06 3.376e+06 3.376e+06 0.000e+00 100.00
cycleTracking 10 7.646e+06 7.646e+06 7.646e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.923e+06 4.923e+06 4.923e+06 0.000e+00 100.00
cycleTracking_MPI 117 1.983e+05 1.983e+05 1.983e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 3.940e+02 3.940e+02 3.940e+02 0.000e+00 100.00
Figure Of Merit 117.82 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.47787 s
sobelfilter - total time for whole calculation: 0.603153 s

Velocity-Bench dl-cifar

Environment Variables:

Command:

/home/pmdk/bench_workdir/dl-cifar/dl-cifar_sycl

Output:

	Welcome to DL-CIFAR workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: DL NW size type: WORKLOAD_DEFAULT_SIZE
WL PARAMS: ==================================================
WL PARAMS:

dataFileReadTimer->getTotalOpTime(): 8.3e-05 s
dl-cifar - total time for whole calculation: 23.7676 s

Velocity-Bench dl-mnist

Environment Variables:

NEOReadDebugKeys=1
DisableScratchPages=0

Command:

/home/pmdk/bench_workdir/dl-mnist/dl-mnist-sycl -conv_algo ONEDNN_AUTO

Output:

	Welcome to DL-MNIST workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: Tensor management policy: per_layer
WL PARAMS: Convolution algorithm: ONEDNN_AUTO
WL PARAMS: Dataset reader format: NCHW
WL PARAMS: Dry run: YES
WL PARAMS: OneDNN Conv PD memory format: ONEDNN_CONVPD_ANY
WL PARAMS: No of iterations for inference: 400
WL PARAMS: ==================================================
WL PARAMS:

dl-mnist - total time for whole calculation: 2.38 s

Velocity-Bench svm

Environment Variables:

Command:

/home/pmdk/bench_workdir/svm/svm_sycl /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a9a /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a.m

Output:

Number of args 3
Using cuSVM (Carpenter)...

Buffering input text file (6989624 B).
Load Done
Starting Training
_C 1.000000
Workgroup Size: 1024
nbrCtas 80
elemsPerCta 1248
threadsPerCta 128
Total run time: 0.069558 seconds
Iter:100
M:97683
N:123
Train done. Calulate Vector counts
Training done

Loading elapsed time : 0.0635 s
Processing elapsed time : 0.0746 s
Storing elapsed time : 0.0022 s
Total elapsed time : 0.1402 s
Result's are correct: 0.0551

llama.cpp Prompt Processing Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:50:59Z","624035249","1138101","820.468655","1.495311"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:51:05Z","2046198163","3388056","62.555173","0.103427"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:51:16Z","596034794","2223027","859.019787","3.195430"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:51:19Z","2041815363","2781417","62.689405","0.085256"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:51:30Z","1177059850","2109545","434.983248","0.780093"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:51:37Z","2043667779","1684319","62.632523","0.051575"

llama.cpp Text Generation Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:50:12Z","622833487","3982906","822.076392","5.242947"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:50:17Z","2047834651","3673052","62.505207","0.111848"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:50:27Z","590749963","2085110","866.703546","3.050356"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:50:30Z","2047735501","2282252","62.508135","0.069584"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:50:41Z","1193106369","3793554","429.135367","1.363894"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:50:48Z","2049336342","1604062","62.459275","0.048811"

llama.cpp Prompt Processing Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:49:25Z","610344115","2082904","838.878846","2.860757"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:49:29Z","2049660997","6106221","62.449794","0.185531"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:49:40Z","584391498","2634990","876.139255","3.952773"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:49:44Z","2048635145","1791229","62.480661","0.054543"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:49:54Z","1187188089","5260466","431.277947","1.911165"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:50:01Z","2047552387","2222372","62.513722","0.067758"

llama.cpp Text Generation Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:50:12Z","622833487","3982906","822.076392","5.242947"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:50:17Z","2047834651","3673052","62.505207","0.111848"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:50:27Z","590749963","2085110","866.703546","3.050356"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:50:30Z","2047735501","2282252","62.508135","0.069584"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:50:41Z","1193106369","3793554","429.135367","1.363894"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:50:48Z","2049336342","1604062","62.459275","0.048811"

llama.cpp Prompt Processing Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:48:33Z","1030558656","935784333","707.082371","289.459706"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:48:43Z","2051201447","2965608","62.402556","0.090070"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:48:53Z","584120053","4029469","876.565256","6.001231"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:48:57Z","2048069094","2539018","62.497968","0.077348"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:49:07Z","1179837902","4857048","433.963813","1.790029"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:49:14Z","2050458207","897792","62.425081","0.027329"

llama.cpp Text Generation Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:47:46Z","629702900","27468397","814.276980","34.257272"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:47:51Z","2044416365","2766885","62.609647","0.084592"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:48:01Z","576340317","1953063","888.372175","3.009091"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:48:05Z","2046070008","1956101","62.559000","0.059740"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T14:48:15Z","1171789578","5961410","436.947551","2.219789"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T14:48:22Z","2048353129","1381979","62.489248","0.042126"

Copy link
Contributor

Compute Benchmarks level_zero run (with params: ):
https://github.com/oneapi-src/unified-runtime/actions/runs/12831425140

Copy link
Contributor

Compute Benchmarks level_zero run ():
https://github.com/oneapi-src/unified-runtime/actions/runs/12831425140
Job status: success. Test status: success.

Summary

Total 92 benchmarks in mean.
Geomean 97.944%.
Improved 3 Regressed 30 (threshold 2.00%)

(result is better)

Performance change in benchmark groups

Relative perf in group api (12): 97.031%
Benchmark This PR baseline Relative perf Change -
api_overhead_benchmark_sycl SubmitKernel in order 24.437000 μs 24.844 μs 101.67% 1.67% .
api_overhead_benchmark_sycl SubmitKernel out of order 23.425000 μs 23.678 μs 101.08% 1.08% .
api_overhead_benchmark_ur SubmitKernel in order 16.573000 μs 16.663 μs 100.54% 0.54% .
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 2.144 μs 2.118000 μs 98.79% -1.21% .
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 1.703 μs 1.675000 μs 98.36% -1.64% .
api_overhead_benchmark_l0 SubmitKernel out of order 11.868 μs 11.528000 μs 97.14% -2.86% -
api_overhead_benchmark_ur SubmitKernel out of order CPU count 105463.000 instr 101923.000000 instr 96.64% -3.36% -
api_overhead_benchmark_ur SubmitKernel in order CPU count 110815.000 instr 107041.000000 instr 96.59% -3.41% -
api_overhead_benchmark_ur SubmitKernel out of order 18.979 μs 15.896000 μs 83.76% -16.24% ----
api_overhead_benchmark_l0 SubmitKernel in order 11.709000 μs -
api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count 123991.000000 instr -
api_overhead_benchmark_ur SubmitKernel in order with measure completion 21.478000 μs -
Relative perf in group memory (4): 86.077%
Benchmark This PR baseline Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 258.349 μs 253.805000 μs 98.24% -1.76% .
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 3.059 GB/s 3.151000 GB/s 97.08% -2.92% -
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 5.859 μs 5.638000 μs 96.23% -3.77% -
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 222.228 μs 132.929000 μs 59.82% -40.18% ----------
Relative perf in group miscellaneous (1): 100.034%
Benchmark This PR baseline Relative perf Change -
miscellaneous_benchmark_sycl VectorSum 858.316000 bw GB/s 858.609 bw GB/s 100.03% 0.03% .
Relative perf in group multithread (10): 97.418%
Benchmark This PR baseline Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 46984.823000 μs 47907.007 μs 101.96% 1.96% .
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 17286.617000 μs 17316.620 μs 100.17% 0.17% .
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 6927.714000 μs 6935.535 μs 100.11% 0.11% .
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 2053.062 μs 2022.915000 μs 98.53% -1.47% .
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 8846.523 μs 8555.721000 μs 96.71% -3.29% -
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events 42548.498 μs 40973.625000 μs 96.30% -3.70% -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 7775.891 μs 7452.758000 μs 95.84% -4.16% -
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 1211.537 μs 1157.521000 μs 95.54% -4.46% -
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events 113757.337 μs 108338.415000 μs 95.24% -4.76% -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 27153.283 μs 25543.132000 μs 94.07% -5.93% -
Relative perf in group Velocity-Bench (9): 98.289%
Benchmark This PR baseline Relative perf Change -
Velocity-Bench dl-mnist 2.380000 s 2.720 s 114.29% 14.29% ++++
Velocity-Bench QuickSilver 118.360000 MMS/CTT 117.490 MMS/CTT 100.74% 0.74% .
Velocity-Bench svm 0.139700 s 0.140 s 100.14% 0.14% .
Velocity-Bench Bitcracker 35.185 s 35.129800 s 99.84% -0.16% .
Velocity-Bench dl-cifar 23.848 s 23.743900 s 99.56% -0.44% .
Velocity-Bench CudaSift 203.736 ms 201.142000 ms 98.73% -1.27% .
Velocity-Bench Hashtable 355.823 M keys/sec 362.504819 M keys/sec 98.16% -1.84% .
Velocity-Bench Sobel Filter 618.891 ms 602.045000 ms 97.28% -2.72% -
Velocity-Bench Easywave 289.000 ms 229.000000 ms 79.24% -20.76% -----
Relative perf in group Runtime (8): 100.672%
Benchmark This PR baseline Relative perf Change -
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 273.737000 ms 278.916 ms 101.89% 1.89% .
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 273.604000 ms 278.736 ms 101.88% 1.88% .
Runtime_IndependentDAGTaskThroughput_SingleTask 255.713000 ms 259.395 ms 101.44% 1.44% .
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 273.291000 ms 275.382 ms 100.77% 0.77% .
Runtime_DAGTaskThroughput_SingleTask 1679.346 ms 1678.732000 ms 99.96% -0.04% .
Runtime_DAGTaskThroughput_NDRangeParallelFor 1696.629 ms 1695.816000 ms 99.95% -0.05% .
Runtime_DAGTaskThroughput_HierarchicalParallelFor 1726.628 ms 1725.256000 ms 99.92% -0.08% .
Runtime_DAGTaskThroughput_BasicParallelFor 1753.295 ms 1746.233000 ms 99.60% -0.40% .
Relative perf in group MicroBench (14): 95.453%
Benchmark This PR baseline Relative perf Change -
MicroBench_HostDeviceBandwidth_2D_D2H_Strided 617.446000 ms 617.523 ms 100.01% 0.01% .
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous 618.095 ms 617.994000 ms 99.98% -0.02% .
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous 618.113 ms 617.954000 ms 99.97% -0.03% .
MicroBench_HostDeviceBandwidth_3D_D2H_Strided 617.470 ms 617.254000 ms 99.97% -0.03% .
MicroBench_LocalMem_int32_4096 29.887 ms 29.866000 ms 99.93% -0.07% .
MicroBench_LocalMem_fp32_4096 29.897 ms 29.833000 ms 99.79% -0.21% .
MicroBench_HostDeviceBandwidth_1D_D2H_Strided 4.890 ms 4.702000 ms 96.16% -3.84% -
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous 4.746 ms 4.414000 ms 93.00% -7.00% --
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous 4.650 ms 4.317000 ms 92.84% -7.16% --
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous 4.662 ms 4.322000 ms 92.71% -7.29% --
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous 4.583 ms 4.238000 ms 92.47% -7.53% --
MicroBench_HostDeviceBandwidth_1D_H2D_Strided 4.923 ms 4.547000 ms 92.36% -7.64% --
MicroBench_HostDeviceBandwidth_2D_H2D_Strided 5.241 ms 4.781000 ms 91.22% -8.78% --
MicroBench_HostDeviceBandwidth_3D_H2D_Strided 5.244 ms 4.574000 ms 87.22% -12.78% ---
Relative perf in group Pattern (10): 100.237%
Benchmark This PR baseline Relative perf Change -
Pattern_Reduction_Hierarchical_int32 16.187000 ms 16.411 ms 101.38% 1.38% .
Pattern_Reduction_NDRange_int32 15.977000 ms 16.163 ms 101.16% 1.16% .
Pattern_SegmentedReduction_Hierarchical_int32 11.589000 ms 11.599 ms 100.09% 0.09% .
Pattern_SegmentedReduction_Hierarchical_fp32 11.586000 ms 11.589 ms 100.03% 0.03% .
Pattern_SegmentedReduction_Hierarchical_int16 11.800000 ms 11.801 ms 100.01% 0.01% .
Pattern_SegmentedReduction_NDRange_int32 2.164000 ms 2.164 ms 100.00% 0.00% .
Pattern_SegmentedReduction_Hierarchical_int64 11.781 ms 11.779000 ms 99.98% -0.02% .
Pattern_SegmentedReduction_NDRange_int16 2.265 ms 2.264000 ms 99.96% -0.04% .
Pattern_SegmentedReduction_NDRange_int64 2.338 ms 2.336000 ms 99.91% -0.09% .
Pattern_SegmentedReduction_NDRange_fp32 2.166 ms 2.163000 ms 99.86% -0.14% .
Relative perf in group ScalarProduct (6): 99.988%
Benchmark This PR baseline Relative perf Change -
ScalarProduct_NDRange_int64 5.445000 ms 5.456 ms 100.20% 0.20% .
ScalarProduct_NDRange_fp32 3.754000 ms 3.759 ms 100.13% 0.13% .
ScalarProduct_Hierarchical_int64 11.486000 ms 11.490 ms 100.03% 0.03% .
ScalarProduct_Hierarchical_fp32 10.174 ms 10.170000 ms 99.96% -0.04% .
ScalarProduct_Hierarchical_int32 10.529 ms 10.523000 ms 99.94% -0.06% .
ScalarProduct_NDRange_int32 3.746 ms 3.733000 ms 99.65% -0.35% .
Relative perf in group USM (7): 101.259%
Benchmark This PR baseline Relative perf Change -
USM_Allocation_latency_fp32_shared 0.056000 ms 0.066 ms 117.86% 17.86% ++++
USM_Allocation_latency_fp32_device 0.065000 ms 0.068 ms 104.62% 4.62% +
USM_Allocation_latency_fp32_host 37.781000 ms 37.899 ms 100.31% 0.31% .
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 1.701 ms 1.661000 ms 97.65% -2.35% -
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 1.868 ms 1.814000 ms 97.11% -2.89% -
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.084 ms 1.046000 ms 96.49% -3.51% -
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.239 ms 1.195000 ms 96.45% -3.55% -
Relative perf in group VectorAddition (3): 98.848%
Benchmark This PR baseline Relative perf Change -
VectorAddition_int64 3.107000 ms 3.139 ms 101.03% 1.03% .
VectorAddition_int32 1.464 ms 1.448000 ms 98.91% -1.09% .
VectorAddition_fp32 1.495 ms 1.445000 ms 96.66% -3.34% -
Relative perf in group Polybench (3): 100.099%
Benchmark This PR baseline Relative perf Change -
Polybench_Atax 6.851000 ms 6.880 ms 100.42% 0.42% .
Polybench_2mm 1.214000 ms 1.216 ms 100.16% 0.16% .
Polybench_3mm 1.732 ms 1.727000 ms 99.71% -0.29% .
Relative perf in group Kmeans (1): 99.907%
Benchmark This PR baseline Relative perf Change -
Kmeans_fp32 16.098 ms 16.083000 ms 99.91% -0.09% .
Relative perf in group MolecularDynamics (1): 96.552%
Benchmark This PR baseline Relative perf Change -
MolecularDynamics 0.029 ms 0.028000 ms 96.55% -3.45% -
Relative perf in group llama.cpp (6): 98.866%
Benchmark This PR baseline Relative perf Change -
llama.cpp Prompt Processing Batched 512 434.334 token/s 434.541716 token/s 99.95% -0.05% .
llama.cpp Prompt Processing Batched 256 867.383 token/s 872.377637 token/s 99.43% -0.57% .
llama.cpp Text Generation Batched 512 62.469 token/s 63.295460 token/s 98.69% -1.31% .
llama.cpp Text Generation Batched 256 62.518 token/s 63.361520 token/s 98.67% -1.33% .
llama.cpp Text Generation Batched 128 62.493 token/s 63.338561 token/s 98.66% -1.34% .
llama.cpp Prompt Processing Batched 128 820.452 token/s 838.869803 token/s 97.80% -2.20% -
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:4 glibc 2658.030000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider 2303.340000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool<os_provider> 3040.100000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool<os_provider> 290.618000 ns -
Relative perf in group alloc/size:10000/0/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/0/4096/iterations:200000/threads:1 glibc 698.246000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider 201.409000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool<os_provider> 266.979000 ns -
alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool<os_provider> 217.624000 ns -
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:4 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc 1236.290000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider 1955.980000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool<os_provider> 3471.010000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool<os_provider> 262.192000 ns -
Relative perf in group alloc/size:10000/100000/4096/iterations:200000/threads:1 (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc 727.605000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider 190.480000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool<os_provider> 305.486000 ns -
alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool<os_provider> 201.022000 ns -
Relative perf in group alloc/min (4): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc 792.249000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc 173.996000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool<os_provider> 1105.120000 ns -
alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool<os_provider> 973.144000 ns -
Relative perf in group multiple (24): cannot calculate
Benchmark This PR baseline Relative perf Change -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc 32383.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc 4236.290000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc 137759.000000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc 31771.100000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool<os_provider> 1179870.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool<os_provider> 165010.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider 1186630.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider 146181.000000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool<os_provider> 41351.400000 ns -
multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool<os_provider> 14804.300000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool<os_provider> 76055.000000 ns -
multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool<os_provider> 28517.100000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc - 32574.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc - 4128.530000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc - 138399.000000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc - 28197.400000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> - 1161430.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> - 161766.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider - 1166110.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider - 141737.000000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> - 42212.800000 ns
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> - 14889.200000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> - 72778.500000 ns
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> - 27538.700000 ns
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc - 2589.180000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc - 710.936000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc - 1188.310000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc - 716.901000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc - 861.597000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc - 175.935000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider - 2246.790000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider - 187.819000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider - 1690.250000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider - 189.702000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 4441.700000 ns
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 256.696000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> - 3268.220000 ns
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> - 306.439000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 299.852000 ns
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 213.534000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> - 263.904000 ns
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> - 197.833000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> - 1051.720000 ns
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> - 952.492000 ns

Details

Benchmark details - environment, command, output...
api_overhead_benchmark_l0 SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.909,11.868,2.40%,11.292,55.011,[CPU],[us]

api_overhead_benchmark_l0 SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.760,11.709,6.28%,11.115,223.968,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),23.575,23.425,3.77%,22.696,257.133,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),24.584,24.437,3.74%,23.575,266.415,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Device --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Device destinationPlacement=Device size=1KB count=100),259.760,258.349,1.78%,254.003,530.512,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Host --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Host destinationPlacement=Device size=1KB count=100),233.250,222.228,18.25%,218.421,515.162,[CPU],[us]

memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueMemcpy --csv --noHeaders --iterations=10000 --sourcePlacement=Device --destinationPlacement=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueMemcpy(api=sycl sourcePlacement=Device destinationPlacement=Device size=1KB),6.201,5.859,14.80%,5.408,38.754,[CPU],[us]

memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=StreamMemory --csv --noHeaders --iterations=10000 --type=Triad --size=10240 --memoryPlacement=Device --useEvents=0 --contents=Zeros --multiplier=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
StreamMemory(api=sycl type=Triad size=10KB useEvents=0 contents=Zeros memoryPlacement=Device multiplier=1),2.999,3.059,6.43%,0.366,3.251,[CPU],[GB/s]

api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=0 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Device --dst=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Device dst=Device size=1KB ioq=0),2.152,2.144,10.95%,1.934,67.172,[CPU],[us]

api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=1 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Host --dst=Host --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Host dst=Host size=1KB ioq=1),1.713,1.703,4.86%,1.587,7.604,[CPU],[us]

miscellaneous_benchmark_sycl VectorSum

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/miscellaneous_benchmark_sycl --test=VectorSum --csv --noHeaders --iterations=1000 --numberOfElementsX=512 --numberOfElementsY=256 --numberOfElementsZ=256

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
VectorSum(api=sycl numberOfElementsX=512 numberOfElementsY=256 numberOfElementsZ=256),857.556,858.316,0.52%,810.755,865.697,[GPU],bw [GB/s]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),6949.586,6927.714,0.79%,6895.019,7075.238,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),17186.917,17286.617,2.99%,15879.004,17735.479,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),46932.711,46984.823,1.80%,43607.679,49684.659,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),2143.999,2053.062,40.65%,1532.587,18040.770,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),7813.349,7775.891,1.43%,7676.946,8075.266,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),9016.518,8846.523,3.61%,8792.928,9909.426,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),27295.871,27153.283,1.80%,26142.483,28876.172,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),1329.420,1211.537,67.71%,941.432,18957.320,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=1 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=1 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),42554.844,42548.498,0.38%,42309.047,42841.107,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=4 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=4 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),113761.660,113757.337,0.32%,113164.859,114352.675,[CPU],[us]

api_overhead_benchmark_ur SubmitKernel out of order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),105523.699,105463.000,6.01%,105355.000,2107735.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),19.070,18.802,255.15%,16.138,15403.115,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),105523.699,105463.000,6.01%,105355.000,2107734.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),19.145,18.979,272.96%,16.730,16543.909,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110882.265,110815.000,3.98%,110815.000,1500622.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.737,16.573,276.47%,15.898,14645.554,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),110882.265,110815.000,3.98%,110815.000,1500622.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.737,16.573,276.47%,15.898,14645.554,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),124425.849,123991.000,3.80%,123364.000,1588989.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),22.052,21.474,325.13%,20.299,22690.039,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),124473.844,123991.000,3.80%,123677.000,1587105.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),21.810,21.478,209.81%,20.138,14486.366,[CPU],time [us]

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.377204 s
355.822816 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00383903 s
bitcracker - total time for whole calculation: 35.1849 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1222 1260 33.1795% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1261 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1116 1267 30.3014% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1261 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1260 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1242 1276 33.7225% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1261 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1164 1273 31.6047% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1139 1261 30.9259% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1102 1251 29.9213% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1259 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1144 1250 31.0616% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1213 1269 32.9351% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1210 1269 32.8537% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1105 1259 30.0027% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1274 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1264 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1270 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1089 1247 29.5683% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1218 1270 33.0709% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1127 1268 30.6001% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1266 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1049 1251 28.4822% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1109 1265 30.1113% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1109 1264 30.1113% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1231 1265 33.4238% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1272 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1264 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1260 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1121 1266 30.4371% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1087 1270 29.514% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1125 1265 30.5458% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1273 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1096 1256 29.7583% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1240 1271 33.6682% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1257 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1066 1258 28.9438% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1110 1259 30.1385% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1111 1264 30.1656% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1262 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1057 1259 28.6994% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1260 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1176 1259 31.9305% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1206 1272 32.745% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1052 1261 28.5637% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1265 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1268 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1203 1254 32.6636% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1149 1265 31.1974% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1254 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 203.736 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 4.324920e-01 6.081300e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.365440e-01 7.448300e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.357800e-01 7.619450e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.668010e-01 8.262260e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.424160e-01 7.907400e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.318040e-01 7.644380e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.318630e-01 7.632580e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.309150e-01 7.844910e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.309060e-01 7.912120e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.310230e-01 7.763190e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.108e+07 1.108e+07 1.108e+07 0.000e+00 100.00
cycleInit 10 3.471e+06 3.471e+06 3.471e+06 0.000e+00 100.00
cycleTracking 10 7.612e+06 7.612e+06 7.612e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.923e+06 4.923e+06 4.923e+06 0.000e+00 100.00
cycleTracking_MPI 117 2.016e+05 2.016e+05 2.016e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 4.060e+02 4.060e+02 4.060e+02 0.000e+00 100.00
Figure Of Merit 118.36 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.52563 s
sobelfilter - total time for whole calculation: 0.618891 s

Velocity-Bench dl-cifar

Environment Variables:

Command:

/home/pmdk/bench_workdir/dl-cifar/dl-cifar_sycl

Output:

	Welcome to DL-CIFAR workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: DL NW size type: WORKLOAD_DEFAULT_SIZE
WL PARAMS: ==================================================
WL PARAMS:

dataFileReadTimer->getTotalOpTime(): 8.2e-05 s
dl-cifar - total time for whole calculation: 23.8482 s

Velocity-Bench dl-mnist

Environment Variables:

NEOReadDebugKeys=1
DisableScratchPages=0

Command:

/home/pmdk/bench_workdir/dl-mnist/dl-mnist-sycl -conv_algo ONEDNN_AUTO

Output:

	Welcome to DL-MNIST workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.6.0)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: Tensor management policy: per_layer
WL PARAMS: Convolution algorithm: ONEDNN_AUTO
WL PARAMS: Dataset reader format: NCHW
WL PARAMS: Dry run: YES
WL PARAMS: OneDNN Conv PD memory format: ONEDNN_CONVPD_ANY
WL PARAMS: No of iterations for inference: 400
WL PARAMS: ==================================================
WL PARAMS:

dl-mnist - total time for whole calculation: 2.38 s

Velocity-Bench svm

Environment Variables:

Command:

/home/pmdk/bench_workdir/svm/svm_sycl /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a9a /home/pmdk/bench_workdir/velocity-bench-repo/svm/SYCL/a.m

Output:

Number of args 3
Using cuSVM (Carpenter)...

Buffering input text file (6989624 B).
Load Done
Starting Training
_C 1.000000
Workgroup Size: 1024
nbrCtas 80
elemsPerCta 1248
threadsPerCta 128
Total run time: 0.068748 seconds
Iter:100
M:97683
N:123
Train done. Calulate Vector counts
Training done

Loading elapsed time : 0.0638 s
Processing elapsed time : 0.0737 s
Storing elapsed time : 0.0022 s
Total elapsed time : 0.1397 s
Result's are correct: 0.0551

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.255548', '0.255713', '0.254647', '0.254647 0.255713 0.256284', '0.000831', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.277266', '0.273291', '0.271832', '0.271832 0.273291 0.286674', '0.008180', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.274833', '0.273737', '0.272863', '0.272863 0.273737 0.277899', '0.002691', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.284393', '0.273604', '0.271884', '0.271884 0.273604 0.307691', '0.020195', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.681926', '1.679346', '1.679122', '1.679122 1.679346 1.687310', '0.004664', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.753150', '1.753295', '1.749926', '1.749926 1.753295 1.756228', '0.003153', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.726054', '1.726628', '1.724615', '1.724615 1.726628 1.726918', '0.001255', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.695228', '1.696629', '1.688456', '1.688456 1.696629 1.700599', '0.006192', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005654', '0.004583', '0.004510', '0.004510 0.004583 0.007870', '0.001919', '27.717280', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004694', '0.004650', '0.004640', '0.004640 0.004650 0.004794', '0.000086', '26.942042', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004688', '0.004662', '0.004625', '0.004625 0.004662 0.004777', '0.000079', '27.026618', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004751', '0.004746', '0.004745', '0.004745 0.004746 0.004763', '0.000010', '26.344441', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618096', '0.618095', '0.618091', '0.618091 0.618095 0.618104', '0.000007', '0.202236', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618110', '0.618113', '0.618080', '0.618080 0.618113 0.618136', '0.000028', '0.202239', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004809', '0.004923', '0.004408', '0.004408 0.004923 0.005096', '0.000358', '28.358535', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005155', '0.005241', '0.004966', '0.004966 0.005241 0.005259', '0.000164', '25.170779', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005164', '0.005244', '0.004987', '0.004987 0.005244 0.005260', '0.000153', '25.064039', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.005034', '0.004890', '0.004885', '0.004885 0.004890 0.005328', '0.000254', '25.587562', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617443', '0.617446', '0.617418', '0.617418 0.617446 0.617465', '0.000024', '0.202456', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617481', '0.617470', '0.617456', '0.617456 0.617470 0.617519', '0.000033', '0.202443', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_int32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029856', '0.029887', '0.029787', '0.029787 0.029887 0.029894', '0.000060', '10474.412320', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_fp32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029909', '0.029897', '0.029896', '0.029896 0.029897 0.029934', '0.000021', '10436.198651', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016069', '0.015977', '0.015888', '0.015888 0.015977 0.016344', '0.000242', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016296', '0.016187', '0.016105', '0.016105 0.016187 0.016595', '0.000263', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003760', '0.003746', '0.003733', '0.003733 0.003746 0.003800', '0.000036', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.005447', '0.005445', '0.005439', '0.005439 0.005445 0.005457', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003762', '0.003754', '0.003733', '0.003733 0.003754 0.003798', '0.000033', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010539', '0.010529', '0.010521', '0.010521 0.010529 0.010565', '0.000023', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011503', '0.011486', '0.011481', '0.011481 0.011486 0.011542', '0.000034', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010181', '0.010174', '0.010162', '0.010162 0.010174 0.010206', '0.000023', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002273', '0.002265', '0.002264', '0.002264 0.002265 0.002291', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002166', '0.002164', '0.002163', '0.002163 0.002164 0.002171', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002343', '0.002338', '0.002338', '0.002338 0.002338 0.002352', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002167', '0.002166', '0.002157', '0.002157 0.002166 0.002179', '0.000011', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011804', '0.011800', '0.011799', '0.011799 0.011800 0.011813', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011589', '0.011589', '0.011571', '0.011571 0.011589 0.011606', '0.000017', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011785', '0.011781', '0.011767', '0.011767 0.011781 0.011807', '0.000020', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011590', '0.011586', '0.011562', '0.011562 0.011586 0.011623', '0.000030', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_device

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_device', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000060', '0.000065', '0.000046', '0.000046 0.000065 0.000070', '0.000013', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_host', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.037803', '0.037781', '0.037758', '0.037758 0.037781 0.037870', '0.000059', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_shared', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000058', '0.000056', '0.000056', '0.000056 0.000056 0.000062', '0.000003', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.002291', '0.001701', '0.001671', '0.001671 0.001701 0.003502', '0.001049', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001089', '0.001084', '0.001083', '0.001083 0.001084 0.001099', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001875', '0.001868', '0.001832', '0.001832 0.001868 0.001924', '0.000046', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001240', '0.001239', '0.001234', '0.001234 0.001239 0.001246', '0.000006', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001471', '0.001464', '0.001448', '0.001448 0.001464 0.001502', '0.000028', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003092', '0.003107', '0.003061', '0.003061 0.003107 0.003108', '0.000027', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001501', '0.001495', '0.001463', '0.001463 0.001495 0.001545', '0.000041', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_2mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/2mm.csv --size=512

Output:

['Polybench_2mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001219', '0.001214', '0.001213', '0.001213 0.001214 0.001230', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_3mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/3mm.csv --size=512

Output:

['Polybench_3mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001731', '0.001732', '0.001724', '0.001724 0.001732 0.001738', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_Atax

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Atax.csv --size=8192

Output:

['Polybench_Atax', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.006846', '0.006851', '0.006811', '0.006811 0.006851 0.006876', '0.000033', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Kmeans_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Kmeans.csv --size=700000000

Output:

['Kmeans_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '700000000', '0.016097', '0.016098', '0.016089', '0.016089 0.016098 0.016102', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MolecularDynamics

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/MolecularDynamics.csv --size=8196

Output:

['MolecularDynamics', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8196', '0.000037', '0.000029', '0.000026', '0.000026 0.000029 0.000056', '0.000017', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

llama.cpp Prompt Processing Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:52Z","624050880","1962122","820.452434","2.588568"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:59Z","2052079648","2790126","62.375839","0.084663"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:09Z","589379839","5582837","868.772591","8.295913"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:24:13Z","2047404863","1542492","62.518196","0.047048"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:23Z","1166560886","2229949","438.898219","0.838496"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:24:30Z","2049012774","2420356","62.469178","0.073722"

llama.cpp Text Generation Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:04Z","617813589","625380","828.729616","0.838513"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:10Z","2048243329","3807220","62.492747","0.115901"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:20Z","590297291","3416368","867.382799","5.019326"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:23Z","2047712537","1690071","62.508808","0.051540"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:34Z","1186926317","2377804","431.367671","0.862937"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:41Z","2050688101","1816009","62.418112","0.055212"

llama.cpp Prompt Processing Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:04Z","617813589","625380","828.729616","0.838513"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:10Z","2048243329","3807220","62.492747","0.115901"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:20Z","590297291","3416368","867.382799","5.019326"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:23Z","2047712537","1690071","62.508808","0.051540"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:34Z","1186926317","2377804","431.367671","0.862937"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:41Z","2050688101","1816009","62.418112","0.055212"

llama.cpp Text Generation Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:52Z","624050880","1962122","820.452434","2.588568"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:59Z","2052079648","2790126","62.375839","0.084663"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:09Z","589379839","5582837","868.772591","8.295913"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:24:13Z","2047404863","1542492","62.518196","0.047048"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:23Z","1166560886","2229949","438.898219","0.838496"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:24:30Z","2049012774","2420356","62.469178","0.073722"

llama.cpp Prompt Processing Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:41Z","630689473","14448267","812.145469","18.308719"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:24:48Z","2046818027","3888633","62.536272","0.118558"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:59Z","591048029","1901095","866.265012","2.780289"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:25:02Z","2045772695","2709634","62.568134","0.082731"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:25:13Z","1178821161","2881630","434.334288","1.062752"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:25:20Z","2043558023","1950145","62.635899","0.059671"

llama.cpp Text Generation Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:23:52Z","624050880","1962122","820.452434","2.588568"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:23:59Z","2052079648","2790126","62.375839","0.084663"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:09Z","589379839","5582837","868.772591","8.295913"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:24:13Z","2047404863","1542492","62.518196","0.047048"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2025-01-17T15:24:23Z","1166560886","2229949","438.898219","0.838496"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2025-01-17T15:24:30Z","2049012774","2420356","62.469178","0.073722"

alloc/size:10000/0/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

alloc/size:10000/0/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

alloc/size:10000/100000/4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2658.03,1827.24,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,698.246,698.245,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1220.42,1166.92,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,727.605,727.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,792.249,762.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.844,173.845,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2414.69,2411.63,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,208.606,208.6,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1971.29,1971.15,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,192.18,192.175,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3160.56,3109.3,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,265.381,265.372,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3558.06,3523.14,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,303.983,303.977,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,292.458,291.12,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,213.202,213.195,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,262.192,261.252,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.463,201.459,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1105.12,1091,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,973.144,973.13,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,32383,30060.1,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4244.16,4244.07,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,139439,87930.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31771.1,31770.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.17987e+06,1.17945e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,163453,163449,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.20498e+06,1.20471e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,146181,146176,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,41351.4,40627.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,15092.3,15091.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,74197.3,73713.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,24895.7,24895.3,ns,,,,,

multiple_malloc_free/size:10000/4096/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2694.58,1936.97,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,694.401,694.399,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1236.29,1176.27,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,718.034,717.846,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,803.298,774.602,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,173.996,173.999,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2303.34,2300.97,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,201.409,201.402,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1955.98,1954.69,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,189.704,189.699,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,3040.1,2995.02,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,270.016,270.008,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3471.01,3423.12,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,306.145,306.141,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,287.785,280.659,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,219.811,219.815,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,259.626,258.937,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,200.643,200.644,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,1117.46,1101.67,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,975.645,975.607,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,33393.2,31864.6,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4203.56,4203.42,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,137759,87750.8,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,31840.4,31840.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.21672e+06,1.21644e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,165010,165008,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18663e+06,1.18632e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,143109,143102,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,40579.8,40284.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14644.2,14643.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,76055,75840.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,29529.5,29529,ns,,,,,

multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2214.69,1802.42,ns,,,,,
"glibc/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,707.46,707.464,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1253.12,1205.35,ns,,,,,
"glibc/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,731.641,731.644,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,783.132,755.561,ns,,,,,
"glibc/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,180.129,180.128,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2094.73,2093.89,ns,,,,,
"os_provider/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,182.433,182.428,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,1836.89,1836.02,ns,,,,,
"os_provider/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,190.48,190.474,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,2974.34,2925.19,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,266.979,266.973,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,3262.58,3213.53,ns,,,,,
"proxy_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,305.486,305.479,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:4",800000,290.618,286.766,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/0/4096/iterations:200000/threads:1",200000,217.624,217.617,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:4",800000,273.484,272.369,ns,,,,,
"scalable_pool<os_provider>/alloc/size:10000/100000/4096/iterations:200000/threads:1",200000,201.022,201.016,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:4",800000,989.795,983.79,ns,,,,,
"scalable_pool<os_provider>/alloc/min size:10000/max size:0/granularity:8/65536/8/iterations:200000/threads:1",200000,953.613,953.604,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,31956.6,30055.2,ns,,,,,
"glibc/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,4236.29,4236.16,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,136051,86006,ns,,,,,
"glibc/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,30213.8,30213.7,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.15223e+06,1.15173e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,167135,167133,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,1.18271e+06,1.18219e+06,ns,,,,,
"os_provider/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,154165,154164,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:4",8000,42905.8,41871.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/size:10000/4096/iterations:2000/threads:1",2000,14804.3,14804,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:4",8000,77513,77053,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/min size:10000/max size:8/granularity:65536/8/iterations:2000/threads:1",2000,28517.1,28516.5,ns,,,,,

@igchor igchor merged commit ed09541 into main Jan 17, 2025
26 of 149 checks passed
@pbalcer pbalcer deleted the bench-build-umd branch January 20, 2025 09:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci/cd Continuous integration/devliery
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants