xe: sdpa: add support for reusable sdpa #3322

syurkevi · 2025-05-23T03:29:52Z

Description

This PR introduces support for reusable sdpa so recompilation can be skipped for different sequence and query lengths. Head size is still baked into the kernel. The configuration for microkernel headers has been extracted and made part of the headers since this must now be done ahead of execution for caching.

Checklist

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
Have you formatted the code using clang-format?

syurkevi · 2025-05-29T03:35:36Z

<3% geomean regression from reusable params in kernel.
Example of reusable cache hit:

onednn_verbose,v1,primitive,create:cache_miss,gpu,sdpa,ocl:micro:any,undef,query:f16::blocked:abcd::f0 key:f16::blocked:abdc::f0 val:f16::blocked:abcd::f0 msk:f16::blocked:abcd::f0 dst:f16::blocked:abcd::f0,,msk:1d,1x4x1x128:1x4x128x386:1x4x386x128,6.40015

onednn_verbose,v1,primitive,create:kernel_cache_hit,gpu,sdpa,ocl:micro:any,undef,query:f16::blocked:abcd::f0 key:f16::blocked:abdc::f0 val:f16::blocked:abcd::f0 msk:f16::blocked:abcd::f0 dst:f16::blocked:abcd::f0,,msk:1d,1x4x1x128:1x4x128x391:1x4x391x128,0.00683594

syurkevi · 2025-05-29T03:52:31Z

make test
disable benchdnn_all
enable benchdnn_graph
enable test_device_gpu
enable arch_gpu_xe-hpc
enable arch_gpu_xe-hpg-atsm
enable arch_gpu_xe-hpg-dg2
enable arch_gpu_xe-lpg+
enable arch_gpu_xe2-hpg-bmg
enable arch_gpu_xe2-lpg

syurkevi · 2025-05-29T17:15:20Z

make test
disable benchdnn_all
enable benchdnn_graph
enable test_device_gpu
enable arch_gpu_xe-hpc
enable arch_gpu_xe-hpg-atsm
enable arch_gpu_xe-hpg-dg2
enable arch_gpu_xe-lpg+
enable arch_gpu_xe2-hpg-bmg
enable arch_gpu_xe2-lpg

syurkevi requested review from a team as code owners May 23, 2025 03:29

github-actions bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:tests Codeowner: @oneapi-src/onednn-arch labels May 23, 2025

syurkevi force-pushed the syurkevi/reusable_sdpa branch from cee7f45 to 66a9d8b Compare May 29, 2025 03:31

syurkevi changed the title ~~xe: sdpa: add support for reusable sdpa [WIP]~~ xe: sdpa: add support for reusable sdpa May 29, 2025

syurkevi requested a review from umar456 May 29, 2025 03:33

xe: sdpa: add support for reusable sdpa

f1f7cce

syurkevi force-pushed the syurkevi/reusable_sdpa branch from 66a9d8b to f1f7cce Compare May 29, 2025 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

xe: sdpa: add support for reusable sdpa #3322

xe: sdpa: add support for reusable sdpa #3322

Uh oh!

syurkevi commented May 23, 2025

Uh oh!

syurkevi commented May 29, 2025

Uh oh!

syurkevi commented May 29, 2025

Uh oh!

syurkevi commented May 29, 2025

Uh oh!

Uh oh!

xe: sdpa: add support for reusable sdpa #3322

Are you sure you want to change the base?

xe: sdpa: add support for reusable sdpa #3322

Uh oh!

Conversation

syurkevi commented May 23, 2025

Description

Checklist

General

Uh oh!

syurkevi commented May 29, 2025

Uh oh!

syurkevi commented May 29, 2025

Uh oh!

syurkevi commented May 29, 2025

Uh oh!

Uh oh!