xe: sdpa: add support for reusable sdpa #3322
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR introduces support for reusable sdpa so recompilation can be skipped for different sequence and query lengths. Head size is still baked into the kernel. The configuration for microkernel headers has been extracted and made part of the headers since this must now be done ahead of execution for caching.
Checklist
General
make test
andmake test_benchdnn_*
) pass locally for each commit?