Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[L0 v2] implement deferred kernel deallocation #2451

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

igchor
Copy link
Member

@igchor igchor commented Dec 11, 2024

No description provided.

@github-actions github-actions bot added conformance Conformance test suite issues. level-zero L0 adapter specific issues labels Dec 11, 2024
@igchor igchor force-pushed the deffered_kernel_keep_submitted branch 4 times, most recently from 6eba5c9 to c09361a Compare December 12, 2024 19:06
@oneapi-src oneapi-src deleted a comment from github-actions bot Dec 12, 2024
@oneapi-src oneapi-src deleted a comment from github-actions bot Dec 12, 2024
Copy link

Compute Benchmarks level_zero_v2 run (with params: --compare baseline-v2):
https://github.com/oneapi-src/unified-runtime/actions/runs/12303108388

@oneapi-src oneapi-src deleted a comment from github-actions bot Dec 12, 2024
@oneapi-src oneapi-src deleted a comment from github-actions bot Dec 12, 2024
@oneapi-src oneapi-src deleted a comment from github-actions bot Dec 12, 2024
@oneapi-src oneapi-src deleted a comment from github-actions bot Dec 12, 2024
Copy link

Compute Benchmarks level_zero_v2 run (--compare baseline-v2):
https://github.com/oneapi-src/unified-runtime/actions/runs/12303108388
Job status: success. Test status: success.

Summary

No diffs to calculate performance change

(result is better)

Performance change in benchmark groups

Relative perf in group api (11): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
api_overhead_benchmark_l0 SubmitKernel out of order 11.689 μs 15.106 μs 11.480000 μs
api_overhead_benchmark_sycl SubmitKernel out of order 24.790 μs 26.622 μs 21.754000 μs
api_overhead_benchmark_sycl SubmitKernel in order 22.172 μs 24.625 μs 22.118000 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024 1.850000 μs 2.438 μs 1.902 μs
api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024 2.144 μs 1.660000 μs 1.862 μs
api_overhead_benchmark_ur SubmitKernel out of order CPU count 95094.000 instr 101653.000 instr 94784.000000 instr
api_overhead_benchmark_ur SubmitKernel out of order 16.050 μs 18.566 μs 13.468000 μs
api_overhead_benchmark_ur SubmitKernel in order CPU count 95094.000 instr 106771.000 instr 94784.000000 instr
api_overhead_benchmark_ur SubmitKernel in order 16.559 μs 16.326000 μs 16.789 μs
api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count 96602.000000 instr - -
api_overhead_benchmark_ur SubmitKernel in order with measure completion 20.410000 μs - -
Relative perf in group memory (4): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024 204.241 μs 251.714 μs 200.797000 μs
memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024 85.519000 μs 133.281 μs 86.622 μs
memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024 6.082 μs 5.537000 μs 6.070 μs
memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240 2.934 GB/s 3.178000 GB/s 2.967 GB/s
Relative perf in group miscellaneous (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
miscellaneous_benchmark_sycl VectorSum 803.250 bw GB/s 802.226 bw GB/s 729.022000 bw GB/s
Relative perf in group multithread (10): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1 3580.598 μs 6962.444 μs 3554.575000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1 8176.248000 μs 17543.043 μs 8376.730 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1 26293.733000 μs 48014.641 μs 26351.892 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1 1131.730 μs 2046.635 μs 1095.315000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1 4525.514 μs 7363.204 μs 4489.265000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1 6769.547 μs 8577.774 μs 6697.709000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1 26547.173 μs 25823.197000 μs 27030.041 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1 1124.053 μs 1182.072 μs 1112.265000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events 29470.166 μs 41125.560 μs 28897.706000 μs
multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events 114922.030 μs 110418.895000 μs 114981.529 μs
Relative perf in group Velocity-Bench (9): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Velocity-Bench Hashtable 381.957 M keys/sec 379.484 M keys/sec 384.922230 M keys/sec
Velocity-Bench Bitcracker 35.228 s 35.201 s 35.145600 s
Velocity-Bench CudaSift 202.851000 ms 203.857 ms -
Velocity-Bench Easywave 234.000000 ms 241.000 ms 236.000 ms
Velocity-Bench QuickSilver 121.340 MMS/CTT 118.600 MMS/CTT 121.360000 MMS/CTT
Velocity-Bench Sobel Filter 514.208 ms 530.614 ms 513.216000 ms
Velocity-Bench dl-cifar 17.457400 s 24.483 s 17.690 s
Velocity-Bench dl-mnist 2.700 s 2.730 s 2.690000 s
Velocity-Bench svm - 0.135800 s -
Relative perf in group Runtime (8): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Runtime_IndependentDAGTaskThroughput_SingleTask 175.898 ms 260.763 ms 173.622000 ms
Runtime_IndependentDAGTaskThroughput_BasicParallelFor 183.271000 ms 269.665 ms 187.100 ms
Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor 185.821 ms 272.768 ms 184.117000 ms
Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor 183.160 ms 273.458 ms 181.955000 ms
Runtime_DAGTaskThroughput_SingleTask 1289.630000 ms 1640.062 ms 1292.755 ms
Runtime_DAGTaskThroughput_BasicParallelFor 1371.226 ms 1708.507 ms 1369.885000 ms
Runtime_DAGTaskThroughput_HierarchicalParallelFor 1351.879000 ms 1719.016 ms 1352.726 ms
Runtime_DAGTaskThroughput_NDRangeParallelFor 1318.376000 ms 1651.204 ms 1336.026 ms
Relative perf in group MicroBench (14): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous 4.454 ms 4.383 ms 4.313000 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous 4.474 ms 4.402000 ms 4.473 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous 4.342000 ms 4.564 ms 4.468 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous 3.647000 ms 4.582 ms 3.724 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous 618.094 ms 618.085 ms 618.046000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous 618.095000 ms 618.138 ms 618.138 ms
MicroBench_HostDeviceBandwidth_1D_H2D_Strided 4.371000 ms 4.376 ms 4.439 ms
MicroBench_HostDeviceBandwidth_2D_H2D_Strided 4.584 ms 4.543 ms 4.491000 ms
MicroBench_HostDeviceBandwidth_3D_H2D_Strided 4.554 ms 4.492 ms 4.430000 ms
MicroBench_HostDeviceBandwidth_1D_D2H_Strided 3.754000 ms 4.614 ms 3.864 ms
MicroBench_HostDeviceBandwidth_2D_D2H_Strided 617.434 ms 617.458 ms 617.433000 ms
MicroBench_HostDeviceBandwidth_3D_D2H_Strided 617.448 ms 617.469 ms 617.384000 ms
MicroBench_LocalMem_int32_4096 29.913 ms 29.912000 ms 29.916 ms
MicroBench_LocalMem_fp32_4096 29.926 ms 29.820000 ms 29.887 ms
Relative perf in group Pattern (10): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Pattern_Reduction_NDRange_int32 17.032 ms 16.581000 ms 16.662 ms
Pattern_Reduction_Hierarchical_int32 17.045 ms 16.994 ms 16.977000 ms
Pattern_SegmentedReduction_NDRange_int16 2.251000 ms 2.267 ms 2.254 ms
Pattern_SegmentedReduction_NDRange_int32 2.166000 ms 2.170 ms 2.168 ms
Pattern_SegmentedReduction_NDRange_int64 2.347 ms 2.343000 ms 2.345 ms
Pattern_SegmentedReduction_NDRange_fp32 2.163000 ms 2.173 ms 2.163 ms
Pattern_SegmentedReduction_Hierarchical_int16 11.800 ms 11.809 ms 11.796000 ms
Pattern_SegmentedReduction_Hierarchical_int32 11.600 ms 11.592000 ms 11.599 ms
Pattern_SegmentedReduction_Hierarchical_int64 11.782 ms 11.779000 ms 11.788 ms
Pattern_SegmentedReduction_Hierarchical_fp32 11.587000 ms 11.596 ms 11.602 ms
Relative perf in group ScalarProduct (6): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
ScalarProduct_NDRange_int32 3.959 ms 3.888000 ms 3.955 ms
ScalarProduct_NDRange_int64 5.605 ms 5.454000 ms 5.610 ms
ScalarProduct_NDRange_fp32 3.873 ms 3.752000 ms 3.889 ms
ScalarProduct_Hierarchical_int32 10.333 ms 10.332 ms 10.320000 ms
ScalarProduct_Hierarchical_int64 11.378 ms 11.327000 ms 11.360 ms
ScalarProduct_Hierarchical_fp32 10.006 ms 9.960000 ms 9.969 ms
Relative perf in group USM (7): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
USM_Allocation_latency_fp32_host 37.507 ms 37.576 ms 37.372000 ms
USM_Allocation_latency_fp32_shared 0.063000 ms 0.064 ms 0.067 ms
USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch 1.334000 ms 1.674 ms 1.335 ms
USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch 1.039 ms 1.057 ms 1.017000 ms
USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch 1.585 ms 1.817 ms 1.568000 ms
USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch 1.192 ms 1.196 ms 1.168000 ms
USM_Allocation_latency_fp32_device - 0.066000 ms -
Relative perf in group VectorAddition (3): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
VectorAddition_int32 1.684 ms 1.597000 ms 1.604 ms
VectorAddition_int64 3.125000 ms 3.128 ms 3.229 ms
VectorAddition_fp32 1.491000 ms 1.566 ms 1.659 ms
Relative perf in group Polybench (3): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Polybench_2mm 1.226 ms 1.222000 ms 1.223 ms
Polybench_3mm 1.815 ms 1.728000 ms 1.803 ms
Polybench_Atax 6.845000 ms 6.865 ms 6.875 ms
Relative perf in group Kmeans (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
Kmeans_fp32 16.056000 ms 16.056 ms 16.057 ms
Relative perf in group LinearRegressionCoeff (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
LinearRegressionCoeff_fp32 687.151000 ms 844.076 ms 717.509 ms
Relative perf in group MolecularDynamics (1): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
MolecularDynamics 0.030 ms 0.031 ms 0.029000 ms
Relative perf in group llama.cpp (6): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
llama.cpp Prompt Processing Batched 128 865.683142 token/s 840.657 token/s 809.359 token/s
llama.cpp Text Generation Batched 128 65.184 token/s 62.642 token/s 65.422755 token/s
llama.cpp Prompt Processing Batched 256 939.670724 token/s 894.023 token/s 938.272 token/s
llama.cpp Text Generation Batched 256 65.131 token/s 62.623 token/s 65.353065 token/s
llama.cpp Prompt Processing Batched 512 488.138258 token/s 455.270 token/s 483.743 token/s
llama.cpp Text Generation Batched 512 65.271 token/s 62.636 token/s 65.393527 token/s
Relative perf in group alloc/max (20): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc 2696.670 ns 2403.900000 ns -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc 699.753000 ns 707.017 ns -
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc 1241.320000 ns 1271.710 ns -
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc 752.731 ns 745.948000 ns -
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc 883.178 ns 863.918000 ns -
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc 174.332 ns 174.292000 ns -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider 2164.190 ns 2162.040000 ns -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider 187.841 ns 186.889000 ns -
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider 1942.740 ns 1897.660000 ns -
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider 192.017 ns 191.382000 ns -
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> 4682.930 ns 4305.000000 ns -
alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> 270.297 ns 263.433000 ns -
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool<os_provider> 4634.400 ns 3865.570000 ns -
alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool<os_provider> 302.373 ns 301.460000 ns -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> 281.535 ns 269.766000 ns -
alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> 216.620 ns 214.796000 ns -
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool<os_provider> 245.713000 ns 261.526 ns -
alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool<os_provider> 209.871 ns 206.096000 ns -
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool<os_provider> 1010.610 ns 1002.850000 ns -
alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool<os_provider> 964.081000 ns 967.182 ns -
Relative perf in group multiple (12): cannot calculate
Benchmark This PR baseline baseline-v2 Relative perf Change -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc 32015.000000 ns 32646.200 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc 4170.530 ns 4138.910000 ns -
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc 138847.000 ns 137197.000000 ns -
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc 30802.100000 ns 30826.900 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool<os_provider> 1187820.000000 ns 1189740.000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool<os_provider> 164323.000 ns 158272.000000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider 1190450.000 ns 1179140.000000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider 144905.000 ns 140583.000000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool<os_provider> 45963.500 ns 41640.200000 ns -
multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool<os_provider> 15157.000 ns 14614.500000 ns -
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool<os_provider> 72189.100 ns 70492.600000 ns -
multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool<os_provider> 25804.100 ns 25434.100000 ns -

Details

Benchmark details - environment, command, output...
api_overhead_benchmark_l0 SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_l0 --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=l0 Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),11.783,11.689,4.70%,10.974,63.486,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),24.782,24.790,3.87%,21.994,290.647,[CPU],[us]

api_overhead_benchmark_sycl SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=sycl Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),22.254,22.172,3.75%,21.243,229.448,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Device --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Device destinationPlacement=Device size=1KB count=100),204.103,204.241,1.66%,193.525,396.984,[CPU],[us]

memory_benchmark_sycl QueueInOrderMemcpy from Host to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueInOrderMemcpy --csv --noHeaders --iterations=10000 --IsCopyOnly=0 --sourcePlacement=Host --destinationPlacement=Device --size=1024 --count=100

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueInOrderMemcpy(api=sycl IsCopyOnly=0 sourcePlacement=Host destinationPlacement=Device size=1KB count=100),85.749,85.519,1.38%,83.628,138.701,[CPU],[us]

memory_benchmark_sycl QueueMemcpy from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=QueueMemcpy --csv --noHeaders --iterations=10000 --sourcePlacement=Device --destinationPlacement=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
QueueMemcpy(api=sycl sourcePlacement=Device destinationPlacement=Device size=1KB),5.958,6.082,14.46%,4.801,58.365,[CPU],[us]

memory_benchmark_sycl StreamMemory, placement Device, type Triad, size 10240

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/memory_benchmark_sycl --test=StreamMemory --csv --noHeaders --iterations=10000 --type=Triad --size=10240 --memoryPlacement=Device --useEvents=0 --contents=Zeros --multiplier=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
StreamMemory(api=sycl type=Triad size=10KB useEvents=0 contents=Zeros memoryPlacement=Device multiplier=1),2.935,2.934,3.57%,0.445,3.363,[CPU],[GB/s]

api_overhead_benchmark_sycl ExecImmediateCopyQueue out of order from Device to Device, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=0 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Device --dst=Device --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Device dst=Device size=1KB ioq=0),1.871,1.850,7.70%,1.658,25.987,[CPU],[us]

api_overhead_benchmark_sycl ExecImmediateCopyQueue in order from Device to Host, size 1024

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_sycl --test=ExecImmediateCopyQueue --csv --noHeaders --iterations=100000 --ioq=1 --IsCopyOnly=1 --MeasureCompletionTime=0 --src=Host --dst=Host --size=1024

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
ExecImmediateCopyQueue(api=sycl IsCopyOnly=1 MeasureCompletionTime=0 src=Host dst=Host size=1KB ioq=1),2.145,2.144,5.59%,1.867,8.959,[CPU],[us]

miscellaneous_benchmark_sycl VectorSum

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/miscellaneous_benchmark_sycl --test=VectorSum --csv --noHeaders --iterations=1000 --numberOfElementsX=512 --numberOfElementsY=256 --numberOfElementsZ=256

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
VectorSum(api=sycl numberOfElementsX=512 numberOfElementsY=256 numberOfElementsZ=256),802.624,803.250,0.55%,770.304,811.539,[GPU],bw [GB/s]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),3583.756,3580.598,0.16%,3578.538,3594.178,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),8277.433,8176.248,6.24%,7282.922,8932.419,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),26531.427,26293.733,5.59%,23249.827,33395.791,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:1 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=1 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=1 DstUSM=1),1139.565,1131.730,6.53%,910.813,1824.168,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:1, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=1 --NumOpsPerThread=400 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=1 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),4541.978,4525.514,1.23%,4485.188,4685.898,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:100, numThreads:8, allocSize:102400 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=102400 --NumThreads=8 --NumOpsPerThread=100 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=100 NumThreads=8 AllocSize=102400 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),6799.789,6769.547,3.09%,6525.580,7089.452,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:400, numThreads:8, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=8 --NumOpsPerThread=400 --iterations=1000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=400 NumThreads=8 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),26735.732,26547.173,5.59%,23670.383,33226.903,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:10, numThreads:16, allocSize:1024 srcUSM:0 dstUSM:1

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=1 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=16 --NumOpsPerThread=10 --iterations=10000 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=10 NumThreads=16 AllocSize=1024 MeasureCompletion=1 UseEvents=1 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),1133.465,1124.053,6.53%,946.337,1919.421,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:1, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=1 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=1 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),29515.166,29470.166,1.65%,28861.026,30392.291,[CPU],[us]

multithread_benchmark_ur MemcpyExecute opsPerThread:4096, numThreads:4, allocSize:1024 srcUSM:0 dstUSM:1 without events

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/multithread_benchmark_ur --test=MemcpyExecute --csv --noHeaders --Ioq=1 --UseEvents=0 --MeasureCompletion=1 --UseQueuePerThread=1 --AllocSize=1024 --NumThreads=4 --NumOpsPerThread=4096 --iterations=10 --SrcUSM=0 --DstUSM=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
MemcpyExecute(api=ur Ioq=1 NumOpsPerThread=4096 NumThreads=4 AllocSize=1024 MeasureCompletion=1 UseEvents=0 UseQueuePerThread=1 SrcUSM=0 DstUSM=1),114883.759,114922.030,0.62%,113831.496,116041.563,[CPU],[us]

api_overhead_benchmark_ur SubmitKernel out of order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),95134.546,95094.000,0.36%,95094.000,101321.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.635,16.488,262.64%,15.000,13827.249,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel out of order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),95134.544,95094.000,0.36%,95094.000,101318.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=0 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.225,16.050,300.21%,14.602,15416.177,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),95134.544,95094.000,0.36%,95094.000,101318.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.489,16.198,528.14%,15.267,27555.517,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),95134.546,95094.000,0.36%,95094.000,101327.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=0),16.717,16.559,260.60%,15.605,13790.099,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion CPU count

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),97150.553,96602.000,132.72%,96209.000,40869175.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),21.019,20.410,620.94%,19.013,41290.221,[CPU],time [us]

api_overhead_benchmark_ur SubmitKernel in order with measure completion

Environment Variables:

Command:

/home/pmdk/bench_workdir/compute-benchmarks-build/bin/api_overhead_benchmark_ur --test=SubmitKernel --csv --noHeaders --Ioq=1 --DiscardEvents=0 --MeasureCompletion=1 --iterations=100000 --Profiling=0 --NumKernels=10 --KernelExecTime=1

Output:

TestCase,Mean,Median,StdDev,Min,Max,Type
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),97150.553,96602.000,132.72%,96209.000,40869175.000,[CPU],hw instructions [count]
SubmitKernel(api=ur Profiling=0 Ioq=1 DiscardEvents=0 NumKernels=10 KernelExecTime=1 MeasureCompletion=1),21.019,20.410,620.94%,19.013,41290.221,[CPU],time [us]

Velocity-Bench Hashtable

Environment Variables:

Command:

/home/pmdk/bench_workdir/hashtable/hashtable_sycl --no-verify

Output:

hashtable - total time for whole calculation: 0.351395 s
381.956611 million keys/second

Velocity-Bench Bitcracker

Environment Variables:

Command:

/home/pmdk/bench_workdir/bitcracker/bitcracker -f /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt -d /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt -b 60000

Output:

---------> BitCracker: BitLocker password cracking tool <---------

==================================
Retrieving Info

Reading hash file "/home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/img_win8_user_hash.txt"

              Attack

================================================
Type of attack: User Password
Psw per thread: 1
max_num_pswd_per_read: 60000
Dictionary: /home/pmdk/bench_workdir/velocity-bench-repo/bitcracker/hash_pass/user_passwords_60000.txt
MAC Comparison (-m): Yes

Iter: 1, num passwords read: 60000
Kernel execution:
Effective passwords: 60000
Passwords Range:
npknpByH7N2m3OnLNH1X9DJxLrzIFWk
.....
dL_7uuf3QCz-c6K3xDu0

================================================
Bitcracker attack completed
Total passwords evaluated: 60000
Password not found!

time to subtract from total: 0.00408108 s
bitcracker - total time for whole calculation: 35.2284 s

Velocity-Bench CudaSift

Environment Variables:

Command:

/home/pmdk/bench_workdir/cudaSift/cudaSift

Output:

UNKN:

UNKN: ==================================================
UNKN: User input parameters:
UNKN: Trace: ../../inputData
UNKN: ==================================================
UNKN:

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1264 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1210 1247 32.8537% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1227 1261 33.3152% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1244 1280 33.7768% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1258 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1261 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1266 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1220 1255 33.1252% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1263 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1268 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1269 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1054 1255 28.618% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1115 1254 30.2742% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1260 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1248 1282 33.8854% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1264 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1261 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1257 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1233 1270 33.4781% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1260 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1022 1259 27.7491% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1106 1263 30.0299% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1260 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1082 1269 29.3782% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1224 1262 33.2338% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1265 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1128 1279 30.6272% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1228 1262 33.3424% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1116 1262 30.3014% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1260 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1116 1268 30.3014% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1090 1248 29.5954% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1262 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1223 1257 33.2066% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1261 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1056 1268 28.6723% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1094 1262 29.704% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1235 1269 33.5324% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1131 1268 30.7087% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1269 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1261 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1225 1263 33.2609% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1239 1271 33.6411% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1264 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1236 1272 33.5596% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1234 1269 33.5053% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1232 1268 33.451% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1230 1264 33.3967% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1229 1262 33.3695% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Image size = (1920,1080)
Initializing data...
Number of original features: 3683 3933
Number of matching features: 1226 1261 33.2881% 1 2

Performing data verification
Data verification is SUCCESSFUL.

Avg workload time = 202.851 ms

Velocity-Bench Easywave

Environment Variables:

Command:

/home/pmdk/bench_workdir/easywave/easyWave_sycl -grid /home/pmdk/bench_workdir/data/easywave/examples/e2Asean.grd -source /home/pmdk/bench_workdir/data/easywave/examples/BengkuluSept2007.flt -time 120

Output:

MAIN: Starting SYCL main program
MAIN: Attempting to clean up previous eWave tsunami files
MAIN: Clean up completed
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
MAIN: Program successfully completed

Velocity-Bench QuickSilver

Environment Variables:

QS_DEVICE=GPU

Command:

/home/pmdk/bench_workdir/QuickSilver/qs -i /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp

Output:

Copyright (c) 2016
Lawrence Livermore National Security, LLC
All Rights Reserved
Quicksilver Version :
Quicksilver Git Hash :
MPI Version : 3.0
Number of MPI ranks : 1
Number of OpenMP Threads: 1
Number of OpenMP CPUs : 1

Loading params
Finished loading params
Simulation:
dt: 1e-08
fMax: 0.1
inputFile: /home/pmdk/bench_workdir/velocity-bench-repo/QuickSilver/Examples/AllScattering/scatteringOnly.inp
energySpectrum:
boundaryCondition: octant
loadBalance: 1
cycleTimers: 0
debugThreads: 0
lx: 100
ly: 100
lz: 100
nParticles: 10000000
batchSize: 0
nBatches: 10
nSteps: 10
nx: 10
ny: 10
nz: 10
seed: 1029384756
xDom: 0
yDom: 0
zDom: 0
eMax: 20
eMin: 1e-09
nGroups: 230
lowWeightCutoff: 0.001
bTally: 1
fTally: 1
cTally: 1
coralBenchmark: 0
crossSectionsOut:

Geometry:
material: sourceMaterial
shape: brick
xMax: 100
xMin: 0
yMax: 100
yMin: 0
zMax: 100
zMin: 0

Material:
name: sourceMaterial
mass: 1000
nIsotopes: 10
nReactions: 9
sourceRate: 1e+10
totalCrossSection: 0.1
absorptionCrossSection: flat
fissionCrossSection: flat
scatteringCrossSection: flat
absorptionCrossSectionRatio: 0
fissionCrossSectionRatio: 0
scatteringCrossSectionRatio: 1

CrossSection:
name: flat
A: 0
B: 0
C: 0
D: 0
E: 1
nuBar: 2.4
setting GPU
setting parameters
Building partition 0
Building partition 1
Building partition 2
Building partition 3
Building MC_Domain 0
Building MC_Domain 1
Building MC_Domain 2
Building MC_Domain 3
Starting Consistency Check
Finished Consistency Check
Finished initMesh
Started copyMaterialDatabase_device
Finished copyMaterialDatabase_device
Finished copyNuclearData_device
Finished copyDomainDevice
cycle start source rr split absorb scatter fission produce collisn escape census num_seg scalar_flux cycleInit cycleTracking cycleFinalize
0 0 1000000 0 9000000 0 18533189 0 0 18533189 1151780 8848220 55527935 1.854923e+09 3.709160e-01 5.919190e-01 0.000000e+00
1 8848220 1000000 0 151478 0 34281997 0 0 34281997 1664159 8335539 94633679 5.047651e+09 3.461740e-01 7.314940e-01 0.000000e+00
2 8335539 1000000 0 663717 0 34354432 0 0 34354432 1366771 8632485 95010375 7.705930e+09 3.449820e-01 7.480650e-01 0.000000e+00
3 8632485 1000000 0 367978 0 34302727 0 0 34302727 1242216 8758247 94953591 9.992076e+09 3.768180e-01 8.001410e-01 0.000000e+00
4 8758247 1000000 0 242076 0 34141236 0 0 34141236 1168452 8831871 94599337 1.199834e+10 3.445430e-01 7.731980e-01 0.000000e+00
5 8831871 1000000 0 168070 0 33948724 0 0 33948724 1121156 8878785 94148236 1.377636e+10 3.432180e-01 7.524900e-01 0.000000e+00
6 8878785 1000000 0 120572 0 33760567 0 0 33760567 1089103 8910254 93689264 1.535668e+10 3.452420e-01 7.489790e-01 0.000000e+00
7 8910254 1000000 0 89810 0 33552179 0 0 33552179 1065203 8934861 93216931 1.676993e+10 3.428500e-01 7.677120e-01 0.000000e+00
8 8934861 1000000 0 65491 0 33384605 0 0 33384605 1047720 8952632 92768273 1.804559e+10 3.449690e-01 7.671870e-01 0.000000e+00
9 8952632 1000000 0 47165 0 33198494 0 0 33198494 1033968 8965829 92324678 1.920208e+10 3.411250e-01 7.434150e-01 0.000000e+00

Timer Cumulative Cumulative Cumulative Cumulative Cumulative Cumulative
Name number microSecs microSecs microSecs microSecs Efficiency
of calls min avg max stddev Rating
main 1 1.093e+07 1.093e+07 1.093e+07 0.000e+00 100.00
cycleInit 10 3.501e+06 3.501e+06 3.501e+06 0.000e+00 100.00
cycleTracking 10 7.425e+06 7.425e+06 7.425e+06 0.000e+00 100.00
cycleTracking_Kernel 104 4.930e+06 4.930e+06 4.930e+06 0.000e+00 100.00
cycleTracking_MPI 117 1.945e+05 1.945e+05 1.945e+05 0.000e+00 100.00
cycleTracking_Test_Done 0 0.000e+00 0.000e+00 0.000e+00 0.000e+00 0.00
cycleFinalize 20 3.960e+02 3.960e+02 3.960e+02 0.000e+00 100.00
Figure Of Merit 121.34 [Num Mega Segments / Cycle Tracking Time]

Velocity-Bench Sobel Filter

Environment Variables:

OPENCV_IO_MAX_IMAGE_PIXELS=1677721600

Command:

/home/pmdk/bench_workdir/sobel_filter/sobel_filter -i /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png -n 5

Output:

SYMN: Welcome to the SYCL version of Sobel filter workload.
SYMN: Input image file: /home/pmdk/bench_workdir/data/sobel_filter/sobel_filter_data/silverfalls_32Kx32K.png
SYMN: Launching SYCL kernel with # of iterations: 5
time to subtract from total: 7.42915 s
sobelfilter - total time for whole calculation: 0.514208 s

Velocity-Bench dl-cifar

Environment Variables:

Command:

/home/pmdk/bench_workdir/dl-cifar/dl-cifar_sycl

Output:

	Welcome to DL-CIFAR workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: DL NW size type: WORKLOAD_DEFAULT_SIZE
WL PARAMS: ==================================================
WL PARAMS:

dataFileReadTimer->getTotalOpTime(): 9.4e-05 s
dl-cifar - total time for whole calculation: 17.4574 s

Velocity-Bench dl-mnist

Environment Variables:

NEOReadDebugKeys=1
DisableScratchPages=0

Command:

/home/pmdk/bench_workdir/dl-mnist/dl-mnist-sycl -conv_algo ONEDNN_AUTO

Output:

	Welcome to DL-MNIST workload: SYCL version.

=======================================================================
SYCL: SYCL Queue initialization successful
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero
SYCL: Using SYCL device : Intel(R) Data Center GPU Max 1100 (Driver version 1.3.30049+10)
SYCL: Platform : Intel(R) oneAPI Unified Runtime over Level-Zero

WL PARAMS:

WL PARAMS: ==================================================
WL PARAMS: User input parameters:
WL PARAMS: Trace: notrace
WL PARAMS: Tensor management policy: per_layer
WL PARAMS: Convolution algorithm: ONEDNN_AUTO
WL PARAMS: Dataset reader format: NCHW
WL PARAMS: Dry run: YES
WL PARAMS: OneDNN Conv PD memory format: ONEDNN_CONVPD_ANY
WL PARAMS: No of iterations for inference: 400
WL PARAMS: ==================================================
WL PARAMS:

dl-mnist - total time for whole calculation: 2.7 s

Runtime_IndependentDAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.191001', '0.175898', '0.175588', '0.175588 0.175898 0.221518', '0.026429', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.187392', '0.183271', '0.183236', '0.183236 0.183271 0.195668', '0.007167', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.187276', '0.185821', '0.184475', '0.184475 0.185821 0.191532', '0.003747', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_independent --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/IndependentDAGTaskThroughput_multi.csv --size=32768

Output:

['Runtime_IndependentDAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '32768', '0.184450', '0.183160', '0.182597', '0.182597 0.183160 0.187593', '0.002737', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_SingleTask

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_SingleTask', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.289329', '1.289630', '1.285739', '1.285739 1.289630 1.292618', '0.003449', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_BasicParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_BasicParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.371626', '1.371226', '1.370737', '1.370737 1.371226 1.372915', '0.001143', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_HierarchicalParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_HierarchicalParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.352070', '1.351879', '1.351712', '1.351712 1.351879 1.352620', '0.000483', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Runtime_DAGTaskThroughput_NDRangeParallelFor

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/dag_task_throughput_sequential --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/DAGTaskThroughput_multi.csv --size=327680

Output:

['Runtime_DAGTaskThroughput_NDRangeParallelFor', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '327680', '1.319348', '1.318376', '1.317586', '1.317586 1.318376 1.322081', '0.002400', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004893', '0.004454', '0.004370', '0.004370 0.004454 0.005856', '0.000834', '28.601023', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004467', '0.004474', '0.004441', '0.004441 0.004474 0.004485', '0.000023', '28.145077', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004389', '0.004342', '0.004320', '0.004320 0.004342 0.004505', '0.000101', '28.936545', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.003645', '0.003647', '0.003586', '0.003586 0.003647 0.003701', '0.000058', '34.861329', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618072', '0.618094', '0.618014', '0.618014 0.618094 0.618107', '0.000050', '0.202261', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Contiguous', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.618093', '0.618095', '0.617986', '0.617986 0.618095 0.618198', '0.000106', '0.202270', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004325', '0.004371', '0.004233', '0.004233 0.004371 0.004372', '0.000080', '29.529501', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004600', '0.004584', '0.004577', '0.004577 0.004584 0.004639', '0.000034', '27.311653', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_H2D_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_H2D_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.004551', '0.004554', '0.004540', '0.004540 0.004554 0.004558', '0.000009', '27.531554', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_1D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_1D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.003798', '0.003754', '0.003745', '0.003745 0.003754 0.003896', '0.000085', '33.376759', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_2D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_2D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617444', '0.617434', '0.617429', '0.617429 0.617434 0.617469', '0.000022', '0.202453', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_HostDeviceBandwidth_3D_D2H_Strided

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/host_device_bandwidth --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/HostDeviceBandwidth_multi.csv --size=512

Output:

['MicroBench_HostDeviceBandwidth_3D_D2H_Strided', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.617466', '0.617448', '0.617437', '0.617437 0.617448 0.617512', '0.000040', '0.202450', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '0.125000']

MicroBench_LocalMem_int32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_int32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029880', '0.029913', '0.029784', '0.029784 0.029913 0.029943', '0.000084', '10475.355869', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

MicroBench_LocalMem_fp32_4096

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/local_mem --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LocalMem_multi.csv --size=10240000

Output:

['MicroBench_LocalMem_fp32_4096', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.029891', '0.029926', '0.029799', '0.029799 0.029926 0.029949', '0.000081', '10470.106085', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '312.000000']

Pattern_Reduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.016957', '0.017032', '0.016790', '0.016790 0.017032 0.017047', '0.000144', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_Reduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/reduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_Reduction_multi.csv --size=10240000

Output:

['Pattern_Reduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '10240000', '0.017037', '0.017045', '0.016747', '0.016747 0.017045 0.017318', '0.000285', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003919', '0.003959', '0.003798', '0.003798 0.003959 0.004001', '0.000107', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.005609', '0.005605', '0.005501', '0.005501 0.005605 0.005722', '0.000110', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003885', '0.003873', '0.003795', '0.003795 0.003873 0.003986', '0.000096', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010350', '0.010333', '0.010319', '0.010319 0.010333 0.010397', '0.000042', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011393', '0.011378', '0.011353', '0.011353 0.011378 0.011448', '0.000049', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

ScalarProduct_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/scalar_prod --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/ScalarProduct_multi.csv --size=102400000

Output:

['ScalarProduct_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.010007', '0.010006', '0.009990', '0.009990 0.010006 0.010026', '0.000018', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002254', '0.002251', '0.002249', '0.002249 0.002251 0.002262', '0.000007', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002168', '0.002166', '0.002165', '0.002165 0.002166 0.002172', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002346', '0.002347', '0.002341', '0.002341 0.002347 0.002350', '0.000005', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_NDRange_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_NDRange_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.002165', '0.002163', '0.002160', '0.002160 0.002163 0.002171', '0.000006', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int16

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int16', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011800', '0.011800', '0.011798', '0.011798 0.011800 0.011804', '0.000003', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011601', '0.011600', '0.011597', '0.011597 0.011600 0.011607', '0.000005', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011783', '0.011782', '0.011756', '0.011756 0.011782 0.011809', '0.000027', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Pattern_SegmentedReduction_Hierarchical_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/segmentedreduction --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Pattern_SegmentedReduction_multi.csv --size=102400000

Output:

['Pattern_SegmentedReduction_Hierarchical_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.011598', '0.011587', '0.011578', '0.011578 0.011587 0.011629', '0.000027', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_host

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_host', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.037497', '0.037507', '0.037328', '0.037328 0.037507 0.037655', '0.000164', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Allocation_latency_fp32_shared

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_allocation_latency --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Allocation_latency_multi.csv --size=1024000000

Output:

['USM_Allocation_latency_fp32_shared', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1024000000', '0.000063', '0.000063', '0.000055', '0.000055 0.000063 0.000071', '0.000008', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001831', '0.001334', '0.001333', '0.001333 0.001334 0.002827', '0.000863', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_with_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001041', '0.001039', '0.001030', '0.001030 0.001039 0.001056', '0.000014', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_device_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001595', '0.001585', '0.001580', '0.001580 0.001585 0.001619', '0.000021', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/usm_instr_mix --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/USM_Instr_Mix_multi.csv --size=8192

Output:

['USM_Instr_Mix_fp32_host_1:1mix_no_init_no_prefetch', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.001193', '0.001192', '0.001190', '0.001190 0.001192 0.001196', '0.000003', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001622', '0.001684', '0.001494', '0.001494 0.001684 0.001688', '0.000111', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_int64

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_int64', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.003153', '0.003125', '0.003118', '0.003118 0.003125 0.003214', '0.000054', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

VectorAddition_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/vec_add --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/VectorAddition_multi.csv --size=102400000

Output:

['VectorAddition_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '102400000', '0.001550', '0.001491', '0.001487', '0.001487 0.001491 0.001671', '0.000105', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_2mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/2mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/2mm.csv --size=512

Output:

['Polybench_2mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001224', '0.001226', '0.001213', '0.001213 0.001226 0.001234', '0.000010', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_3mm

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/3mm --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/3mm.csv --size=512

Output:

['Polybench_3mm', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '512', '0.001814', '0.001815', '0.001809', '0.001809 0.001815 0.001816', '0.000004', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Polybench_Atax

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/atax --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Atax.csv --size=8192

Output:

['Polybench_Atax', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8192', '0.006804', '0.006845', '0.006709', '0.006709 0.006845 0.006858', '0.000083', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

Kmeans_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/kmeans --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/Kmeans.csv --size=700000000

Output:

['Kmeans_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '700000000', '0.016060', '0.016056', '0.016054', '0.016054 0.016056 0.016071', '0.000009', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

LinearRegressionCoeff_fp32

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/lin_reg_coeff --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/LinearRegressionCoeff.csv --size=1638400000

Output:

['LinearRegressionCoeff_fp32', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '1638400000', '0.695345', '0.687151', '0.687065', '0.687065 0.687151 0.711818', '0.014266', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

MolecularDynamics

Environment Variables:

Command:

/home/pmdk/bench_workdir/sycl-bench-build/mol_dyn --warmup-run --num-runs=3 --output=/home/pmdk/bench_workdir/MolecularDynamics.csv --size=8196

Output:

['MolecularDynamics', 'PASS', 'Intel(R) Data Center GPU Max 1100', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', '256', '8196', '0.000037', '0.000030', '0.000027', '0.000027 0.000030 0.000055', '0.000015', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'LLVM (Intel DPC++)', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']

llama.cpp Prompt Processing Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:12Z","592033487","21119981","865.683142","30.388480"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:16Z","1963680633","1590422","65.183750","0.052723"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:25Z","547564288","9302907","935.261367","15.549792"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:29Z","1965257380","806441","65.131427","0.026716"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:39Z","1046996305","2077078","489.019501","0.969627"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:45Z","1961065466","628720","65.270646","0.020852"

llama.cpp Text Generation Batched 128

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:12Z","592033487","21119981","865.683142","30.388480"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:16Z","1963680633","1590422","65.183750","0.052723"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:25Z","547564288","9302907","935.261367","15.549792"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:29Z","1965257380","806441","65.131427","0.026716"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:39Z","1046996305","2077078","489.019501","0.969627"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:45Z","1961065466","628720","65.270646","0.020852"

llama.cpp Prompt Processing Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:40:28Z","581680926","2479054","880.220421","3.746234"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:40:31Z","1974965132","1932971","64.811320","0.063345"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:40:41Z","544871781","211132","939.670724","0.363018"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:40:45Z","1976344906","1210674","64.766043","0.039643"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:40:55Z","1060000815","10535058","483.056671","4.801350"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:01Z","1972776623","659783","64.883175","0.021628"

llama.cpp Text Generation Batched 256

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:12Z","592033487","21119981","865.683142","30.388480"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:16Z","1963680633","1590422","65.183750","0.052723"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:25Z","547564288","9302907","935.261367","15.549792"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:29Z","1965257380","806441","65.131427","0.026716"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:39Z","1046996305","2077078","489.019501","0.969627"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:45Z","1961065466","628720","65.270646","0.020852"

llama.cpp Prompt Processing Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:39:44Z","601198475","14545220","852.027952","20.448075"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:39:47Z","1958430370","3976888","65.358679","0.132395"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:39:57Z","541290745","4243832","945.933543","7.385219"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:40:01Z","1956054278","2477208","65.437941","0.082818"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:40:10Z","1048914725","6416160","488.138258","3.001569"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:40:17Z","1956941541","1042493","65.408203","0.034814"

llama.cpp Text Generation Batched 512

Environment Variables:

Command:

/home/pmdk/bench_workdir/llamacpp-build/bin/llama-bench --output csv -n 128 -p 512 -b 128,256,512 --numa isolate -t 56 --model /home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf

Output:

build_commit,build_number,cuda,vulkan,kompute,metal,sycl,rpc,gpu_blas,blas,cpu_info,gpu_info,model_filename,model_type,model_size,model_n_params,n_batch,n_ubatch,n_threads,cpu_mask,cpu_strict,poll,type_k,type_v,n_gpu_layers,split_mode,main_gpu,no_kv_offload,flash_attn,tensor_split,use_mmap,embeddings,n_prompt,n_gen,test_time,avg_ns,stddev_ns,avg_ts,stddev_ts
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:12Z","592033487","21119981","865.683142","30.388480"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","128","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:16Z","1963680633","1590422","65.183750","0.052723"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:25Z","547564288","9302907","935.261367","15.549792"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","256","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:29Z","1965257380","806441","65.131427","0.026716"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","512","0","2024-12-12T19:41:39Z","1046996305","2077078","489.019501","0.969627"
"1ee9eea0","4073","0","0","0","0","1","0","1","1","INTEL(R) XEON(R) PLATINUM 8580","Intel(R) Data Center GPU Max 1100","/home/pmdk/bench_workdir/models/Phi-3-mini-4k-instruct-q4.gguf","phi3 3B Q4_K - Medium","2392493568","3821079552","512","512","56","0x0","0","50","f16","f16","99","layer","0","0","0","0.00","1","0","0","128","2024-12-12T19:41:45Z","1961065466","628720","65.270646","0.020852"

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2696.67,1909.93,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.271,699.268,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1252.78,1168.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.049,754.047,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.254,842.95,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.674,176.674,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2209.08,2207.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.237,184.232,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2285.17,2285.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,194.028,194.021,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4207.38,4201.05,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,270.297,270.242,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3283.36,3276,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,303.922,303.913,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,293.118,292.477,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,218.538,218.536,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,267.954,267.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.443,195.437,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1010.61,992.626,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,964.081,963.813,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32164.2,30572.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4136.25,4136.12,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,140694,89099.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30272.9,30269.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20645e+06,1.20563e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160597,160590,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25634e+06,1.25589e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140579,140578,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,45963.5,45509.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15021,15020.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,74039,73145,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26127.6,26127.1,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2594.7,1787.57,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.753,699.753,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1237.74,1178.73,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,736.786,736.783,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,878.432,830.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.651,179.645,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2292.71,2291.77,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,189.432,189.426,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.65,1737.14,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.017,192.013,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4682.93,4678.35,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,252.383,252.375,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3602.38,3596.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,299.742,299.737,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,259.004,257.146,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,208.292,208.292,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,245.713,239.679,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,209.871,209.865,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1056.34,1035.26,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,999.33,999.323,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32015,30470.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4112.96,4112.81,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,139313,89132.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31815.4,31815.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18782e+06,1.18645e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163354,163349,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14193e+06,1.14079e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146207,146206,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43078.2,42928,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15109.6,15109.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70968.6,70956,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25985.3,25984.8,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2632.92,1890.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.641,715.643,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1314.26,1183.09,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,752.731,752.729,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,883.178,824.458,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,168.186,168.182,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2005.6,2004.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.862,184.857,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1794.17,1793.24,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.749,187.744,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3869.14,3857.25,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,254.446,254.397,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4925.7,4917.13,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,293.168,293.159,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,247.157,246.041,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,224.053,224.047,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,280.095,255.033,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.537,207.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,854.205,846.075,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,949.007,948.998,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31029.7,29348.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4088.81,4088.65,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138847,88541.6,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28233.2,28232.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17476e+06,1.17438e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163656,163653,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19045e+06,1.18955e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144905,144904,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43394,42834.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15531.6,15531.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,73869.6,73151.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25781.2,25780.7,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2632.92,1890.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.641,715.643,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1314.26,1183.09,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,752.731,752.729,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,883.178,824.458,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,168.186,168.182,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2005.6,2004.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.862,184.857,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1794.17,1793.24,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.749,187.744,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3869.14,3857.25,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,254.446,254.397,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4925.7,4917.13,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,293.168,293.159,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,247.157,246.041,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,224.053,224.047,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,280.095,255.033,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.537,207.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,854.205,846.075,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,949.007,948.998,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31029.7,29348.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4088.81,4088.65,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138847,88541.6,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28233.2,28232.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17476e+06,1.17438e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163656,163653,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19045e+06,1.18955e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144905,144904,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43394,42834.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15531.6,15531.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,73869.6,73151.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25781.2,25780.7,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2623.44,1864.54,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,718.033,717.998,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1220.35,1175.53,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.215,754.158,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,841.419,813.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,174.332,174.331,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2147.61,2147.58,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,188.325,188.32,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1942.74,1942.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.715,192.709,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4986.6,4980.93,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.077,258.071,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4865.86,4860.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,307.718,307.713,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,266.863,264.644,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.62,216.612,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,232.556,231.988,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.671,193.663,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1107.53,1098.31,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,982.148,982.13,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31178.5,29493.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4172.29,4172.19,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137976,87314.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29566.2,29565.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.173e+06,1.17117e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164839,164837,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25077e+06,1.25035e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144517,144512,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,47015.9,46457.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15157,15156.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72189.1,72170.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25200.5,25200.1,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2623.44,1864.54,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,718.033,717.998,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1220.35,1175.53,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.215,754.158,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,841.419,813.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,174.332,174.331,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2147.61,2147.58,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,188.325,188.32,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1942.74,1942.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.715,192.709,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4986.6,4980.93,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.077,258.071,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4865.86,4860.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,307.718,307.713,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,266.863,264.644,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.62,216.612,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,232.556,231.988,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.671,193.663,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1107.53,1098.31,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,982.148,982.13,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31178.5,29493.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4172.29,4172.19,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137976,87314.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29566.2,29565.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.173e+06,1.17117e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164839,164837,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25077e+06,1.25035e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144517,144512,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,47015.9,46457.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15157,15156.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72189.1,72170.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25200.5,25200.1,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2594.7,1787.57,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.753,699.753,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1237.74,1178.73,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,736.786,736.783,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,878.432,830.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.651,179.645,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2292.71,2291.77,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,189.432,189.426,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.65,1737.14,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.017,192.013,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4682.93,4678.35,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,252.383,252.375,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3602.38,3596.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,299.742,299.737,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,259.004,257.146,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,208.292,208.292,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,245.713,239.679,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,209.871,209.865,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1056.34,1035.26,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,999.33,999.323,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32015,30470.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4112.96,4112.81,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,139313,89132.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31815.4,31815.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18782e+06,1.18645e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163354,163349,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14193e+06,1.14079e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146207,146206,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43078.2,42928,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15109.6,15109.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70968.6,70956,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25985.3,25984.8,ns,,,,,

alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2594.7,1787.57,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.753,699.753,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1237.74,1178.73,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,736.786,736.783,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,878.432,830.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.651,179.645,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2292.71,2291.77,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,189.432,189.426,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.65,1737.14,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.017,192.013,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4682.93,4678.35,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,252.383,252.375,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3602.38,3596.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,299.742,299.737,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,259.004,257.146,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,208.292,208.292,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,245.713,239.679,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,209.871,209.865,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1056.34,1035.26,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,999.33,999.323,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32015,30470.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4112.96,4112.81,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,139313,89132.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31815.4,31815.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18782e+06,1.18645e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163354,163349,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14193e+06,1.14079e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146207,146206,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43078.2,42928,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15109.6,15109.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70968.6,70956,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25985.3,25984.8,ns,,,,,

alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2696.67,1909.93,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.271,699.268,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1252.78,1168.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.049,754.047,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.254,842.95,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.674,176.674,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2209.08,2207.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.237,184.232,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2285.17,2285.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,194.028,194.021,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4207.38,4201.05,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,270.297,270.242,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3283.36,3276,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,303.922,303.913,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,293.118,292.477,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,218.538,218.536,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,267.954,267.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.443,195.437,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1010.61,992.626,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,964.081,963.813,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32164.2,30572.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4136.25,4136.12,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,140694,89099.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30272.9,30269.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20645e+06,1.20563e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160597,160590,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25634e+06,1.25589e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140579,140578,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,45963.5,45509.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15021,15020.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,74039,73145,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26127.6,26127.1,ns,,,,,

alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2623.44,1864.54,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,718.033,717.998,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1220.35,1175.53,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.215,754.158,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,841.419,813.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,174.332,174.331,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2147.61,2147.58,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,188.325,188.32,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1942.74,1942.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.715,192.709,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4986.6,4980.93,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.077,258.071,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4865.86,4860.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,307.718,307.713,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,266.863,264.644,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.62,216.612,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,232.556,231.988,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.671,193.663,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1107.53,1098.31,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,982.148,982.13,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31178.5,29493.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4172.29,4172.19,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137976,87314.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29566.2,29565.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.173e+06,1.17117e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164839,164837,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25077e+06,1.25035e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144517,144512,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,47015.9,46457.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15157,15156.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72189.1,72170.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25200.5,25200.1,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2594.7,1787.57,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.753,699.753,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1237.74,1178.73,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,736.786,736.783,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,878.432,830.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.651,179.645,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2292.71,2291.77,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,189.432,189.426,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.65,1737.14,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.017,192.013,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4682.93,4678.35,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,252.383,252.375,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3602.38,3596.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,299.742,299.737,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,259.004,257.146,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,208.292,208.292,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,245.713,239.679,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,209.871,209.865,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1056.34,1035.26,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,999.33,999.323,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32015,30470.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4112.96,4112.81,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,139313,89132.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31815.4,31815.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18782e+06,1.18645e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163354,163349,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14193e+06,1.14079e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146207,146206,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43078.2,42928,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15109.6,15109.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70968.6,70956,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25985.3,25984.8,ns,,,,,

alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2594.7,1787.57,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.753,699.753,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1237.74,1178.73,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,736.786,736.783,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,878.432,830.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.651,179.645,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2292.71,2291.77,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,189.432,189.426,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.65,1737.14,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.017,192.013,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4682.93,4678.35,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,252.383,252.375,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3602.38,3596.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,299.742,299.737,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,259.004,257.146,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,208.292,208.292,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,245.713,239.679,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,209.871,209.865,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1056.34,1035.26,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,999.33,999.323,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32015,30470.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4112.96,4112.81,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,139313,89132.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31815.4,31815.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18782e+06,1.18645e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163354,163349,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14193e+06,1.14079e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146207,146206,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43078.2,42928,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15109.6,15109.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70968.6,70956,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25985.3,25984.8,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2696.67,1909.93,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.271,699.268,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1252.78,1168.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.049,754.047,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.254,842.95,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.674,176.674,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2209.08,2207.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.237,184.232,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2285.17,2285.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,194.028,194.021,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4207.38,4201.05,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,270.297,270.242,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3283.36,3276,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,303.922,303.913,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,293.118,292.477,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,218.538,218.536,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,267.954,267.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.443,195.437,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1010.61,992.626,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,964.081,963.813,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32164.2,30572.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4136.25,4136.12,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,140694,89099.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30272.9,30269.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20645e+06,1.20563e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160597,160590,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25634e+06,1.25589e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140579,140578,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,45963.5,45509.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15021,15020.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,74039,73145,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26127.6,26127.1,ns,,,,,

alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2696.67,1909.93,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.271,699.268,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1252.78,1168.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.049,754.047,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.254,842.95,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.674,176.674,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2209.08,2207.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.237,184.232,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2285.17,2285.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,194.028,194.021,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4207.38,4201.05,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,270.297,270.242,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3283.36,3276,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,303.922,303.913,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,293.118,292.477,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,218.538,218.536,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,267.954,267.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.443,195.437,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1010.61,992.626,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,964.081,963.813,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32164.2,30572.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4136.25,4136.12,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,140694,89099.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30272.9,30269.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20645e+06,1.20563e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160597,160590,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25634e+06,1.25589e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140579,140578,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,45963.5,45509.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15021,15020.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,74039,73145,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26127.6,26127.1,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2594.7,1787.57,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.753,699.753,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1237.74,1178.73,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,736.786,736.783,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,878.432,830.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.651,179.645,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2292.71,2291.77,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,189.432,189.426,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.65,1737.14,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.017,192.013,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4682.93,4678.35,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,252.383,252.375,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3602.38,3596.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,299.742,299.737,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,259.004,257.146,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,208.292,208.292,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,245.713,239.679,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,209.871,209.865,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1056.34,1035.26,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,999.33,999.323,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32015,30470.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4112.96,4112.81,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,139313,89132.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31815.4,31815.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18782e+06,1.18645e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163354,163349,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14193e+06,1.14079e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146207,146206,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43078.2,42928,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15109.6,15109.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70968.6,70956,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25985.3,25984.8,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2632.92,1890.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.641,715.643,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1314.26,1183.09,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,752.731,752.729,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,883.178,824.458,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,168.186,168.182,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2005.6,2004.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.862,184.857,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1794.17,1793.24,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.749,187.744,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3869.14,3857.25,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,254.446,254.397,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4925.7,4917.13,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,293.168,293.159,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,247.157,246.041,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,224.053,224.047,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,280.095,255.033,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.537,207.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,854.205,846.075,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,949.007,948.998,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31029.7,29348.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4088.81,4088.65,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138847,88541.6,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28233.2,28232.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17476e+06,1.17438e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163656,163653,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19045e+06,1.18955e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144905,144904,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43394,42834.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15531.6,15531.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,73869.6,73151.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25781.2,25780.7,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 glibc

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.34,1895.19,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,698.885,698.887,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1246.51,1149.84,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,738.639,738.634,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,885.771,817.665,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,174.331,174.33,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2086.78,2076.81,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.263,186.219,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1869.65,1869.58,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.455,191.448,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4162.01,4156.49,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,278.961,278.909,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4269.6,4261.88,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,297.892,297.885,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,320.624,311.523,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,214.378,214.376,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,234.559,233.782,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.212,210.206,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,964.615,930.993,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,950.187,950.175,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31383.3,29645.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4265.24,4265.15,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,136303,87836.6,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30802.1,30801.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18406e+06,1.18334e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,165881,165879,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18401e+06,1.18273e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,142819,142814,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43152.4,43051.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15320.3,15320,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,68288.3,68251.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25804.1,25803.6,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2594.7,1787.57,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.753,699.753,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1237.74,1178.73,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,736.786,736.783,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,878.432,830.573,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,179.651,179.645,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2292.71,2291.77,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,189.432,189.426,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1737.65,1737.14,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.017,192.013,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4682.93,4678.35,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,252.383,252.375,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3602.38,3596.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,299.742,299.737,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,259.004,257.146,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,208.292,208.292,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,245.713,239.679,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,209.871,209.865,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1056.34,1035.26,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,999.33,999.323,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32015,30470.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4112.96,4112.81,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,139313,89132.7,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31815.4,31815.2,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18782e+06,1.18645e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163354,163349,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.14193e+06,1.14079e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146207,146206,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43078.2,42928,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15109.6,15109.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70968.6,70956,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25985.3,25984.8,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 proxy_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2809.48,1842.45,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,703.323,703.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1241.32,1176.5,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,734.362,734.358,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,917.409,803.232,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,172.501,172.502,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2164.19,2163.31,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,187.841,187.836,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2128.58,2127.94,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.484,191.477,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,5849.31,5840.14,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,276.333,276.33,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4634.4,4630.51,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,302.373,302.366,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,281.535,280.432,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,211.141,211.139,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,244.009,243.522,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,225.308,225.3,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,909.149,905.687,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,955.897,955.886,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,33357.2,31455.5,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4170.53,4170.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,135936,87508.8,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,31761.2,31760.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.21671e+06,1.21549e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164323,164321,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.49171e+06,1.49114e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,146789,146787,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,46552.8,46132.9,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,14494.2,14493.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,70562.2,70008.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,27112.6,27112.2,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2632.92,1890.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.641,715.643,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1314.26,1183.09,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,752.731,752.729,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,883.178,824.458,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,168.186,168.182,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2005.6,2004.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.862,184.857,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1794.17,1793.24,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.749,187.744,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3869.14,3857.25,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,254.446,254.397,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4925.7,4917.13,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,293.168,293.159,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,247.157,246.041,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,224.053,224.047,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,280.095,255.033,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.537,207.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,854.205,846.075,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,949.007,948.998,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31029.7,29348.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4088.81,4088.65,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138847,88541.6,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28233.2,28232.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17476e+06,1.17438e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163656,163653,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19045e+06,1.18955e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144905,144904,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43394,42834.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15531.6,15531.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,73869.6,73151.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25781.2,25780.7,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 os_provider

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2632.92,1890.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,715.641,715.643,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1314.26,1183.09,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,752.731,752.729,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,883.178,824.458,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,168.186,168.182,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2005.6,2004.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.862,184.857,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1794.17,1793.24,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,187.749,187.744,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,3869.14,3857.25,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,254.446,254.397,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4925.7,4917.13,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,293.168,293.159,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,247.157,246.041,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,224.053,224.047,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,280.095,255.033,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,207.537,207.53,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,854.205,846.075,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,949.007,948.998,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31029.7,29348.1,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4088.81,4088.65,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,138847,88541.6,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,28233.2,28232.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.17476e+06,1.17438e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,163656,163653,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.19045e+06,1.18955e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144905,144904,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43394,42834.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15531.6,15531.2,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,73869.6,73151.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25781.2,25780.7,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2696.67,1909.93,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,699.271,699.268,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1252.78,1168.32,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.049,754.047,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,872.254,842.95,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,176.674,176.674,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2209.08,2207.89,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,184.237,184.232,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,2285.17,2285.01,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,194.028,194.021,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4207.38,4201.05,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,270.297,270.242,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,3283.36,3276,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,303.922,303.913,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,293.118,292.477,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,218.538,218.536,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,267.954,267.077,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,195.443,195.437,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1010.61,992.626,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,964.081,963.813,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,32164.2,30572.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4136.25,4136.12,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,140694,89099.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30272.9,30269.3,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.20645e+06,1.20563e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,160597,160590,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25634e+06,1.25589e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,140579,140578,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,45963.5,45509.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15021,15020.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,74039,73145,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,26127.6,26127.1,ns,,,,,

multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2623.44,1864.54,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,718.033,717.998,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1220.35,1175.53,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.215,754.158,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,841.419,813.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,174.332,174.331,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2147.61,2147.58,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,188.325,188.32,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1942.74,1942.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.715,192.709,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4986.6,4980.93,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.077,258.071,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4865.86,4860.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,307.718,307.713,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,266.863,264.644,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.62,216.612,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,232.556,231.988,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.671,193.663,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1107.53,1098.31,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,982.148,982.13,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31178.5,29493.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4172.29,4172.19,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137976,87314.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29566.2,29565.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.173e+06,1.17117e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164839,164837,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25077e+06,1.25035e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144517,144512,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,47015.9,46457.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15157,15156.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72189.1,72170.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25200.5,25200.1,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2623.44,1864.54,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,718.033,717.998,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1220.35,1175.53,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,754.215,754.158,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,841.419,813.58,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,174.332,174.331,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2147.61,2147.58,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,188.325,188.32,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1942.74,1942.16,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,192.715,192.709,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4986.6,4980.93,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,258.077,258.071,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4865.86,4860.02,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,307.718,307.713,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,266.863,264.644,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,216.62,216.612,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,232.556,231.988,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,193.671,193.663,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,1107.53,1098.31,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,982.148,982.13,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31178.5,29493.2,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4172.29,4172.19,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,137976,87314.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,29566.2,29565.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.173e+06,1.17117e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,164839,164837,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.25077e+06,1.25035e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,144517,144512,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,47015.9,46457.1,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15157,15156.7,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,72189.1,72170.6,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25200.5,25200.1,ns,,,,,

multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1 scalable_pool

Environment Variables:

Command:

/home/pmdk/ur-actions-runner/_work/unified-runtime/unified-runtime/umf_build/benchmark/umf-benchmark --benchmark_format=csv

Output:

name,iterations,real_time,cpu_time,time_unit,bytes_per_second,items_per_second,label,error_occurred,error_message
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2774.34,1895.19,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,698.885,698.887,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1246.51,1149.84,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,738.639,738.634,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,885.771,817.665,ns,,,,,
"glibc/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,174.331,174.33,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,2086.78,2076.81,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,186.263,186.219,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,1869.65,1869.58,ns,,,,,
"os_provider/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,191.455,191.448,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,4162.01,4156.49,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,278.961,278.909,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,4269.6,4261.88,ns,,,,,
"proxy_pool<os_provider>/alloc/max_allocs:1000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,297.892,297.885,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:4",800000,320.624,311.523,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/size:4096/iterations:200000/threads:1",200000,214.378,214.376,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:4",800000,234.559,233.782,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:100000/size:4096/iterations:200000/threads:1",200000,210.212,210.206,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:4",800000,964.615,930.993,ns,,,,,
"scalable_pool<os_provider>/alloc/max_allocs:10000/pre_allocs:0/min size:8/max size:65536/granularity:8/iterations:200000/threads:1",200000,950.187,950.175,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,31383.3,29645.4,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,4265.24,4265.15,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,136303,87836.6,ns,,,,,
"glibc/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,30802.1,30801.9,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18406e+06,1.18334e+06,ns,,,,,
"proxy_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,165881,165879,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,1.18401e+06,1.18273e+06,ns,,,,,
"os_provider/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,142819,142814,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:4",8000,43152.4,43051.8,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/size:4096/iterations:2000/threads:1",2000,15320.3,15320,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:4",8000,68288.3,68251.3,ns,,,,,
"scalable_pool<os_provider>/multiple_malloc_free/max_allocs:10000/min size:8/max size:65536/granularity:8/iterations:2000/threads:1",2000,25804.1,25803.6,ns,,,,,

@igchor igchor force-pushed the deffered_kernel_keep_submitted branch from c09361a to 1fb113e Compare December 12, 2024 20:35
@igchor igchor marked this pull request as ready for review December 12, 2024 21:04
@igchor igchor requested review from a team as code owners December 12, 2024 21:04
Keep vector of submitted kernels in the queue and
retain kerenels when enqueuing the kernel. Release
the kernels on queueFinish.
@igchor igchor force-pushed the deffered_kernel_keep_submitted branch from 1fb113e to 5fc4e53 Compare December 13, 2024 17:53
@github-actions github-actions bot added the ci/cd Continuous integration/devliery label Dec 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci/cd Continuous integration/devliery conformance Conformance test suite issues. level-zero L0 adapter specific issues
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant