[WIP]Enable assymetric heads for D_qk and D_v for micro kernel sdpa by h-sadia · Pull Request #5301 · uxlfoundation/oneDNN

h-sadia · 2026-06-11T21:04:57Z

Description

Fused micro kernel doesn't allow different head sizes of QK and V tensors. This PR focuses on enabling that as per this request here: https://jira.devtools.intel.com/browse/MFDNN-14385

N.B: Will continue testing it with the tests present in Graph API and extend testing starting ww25.2

Fixes # (github issue)

Checklist

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
Have you formatted the code using clang-format?

TaoLv · 2026-06-12T00:58:58Z

 # f16 inputs + f32 intermediates + f16 outputs
 --reset --op-kind=1:Multiply,1:Divide --case=complex_fusion/mha/sdpa-plain-simplified-f16-f32.json
+# Asymmetric heads: Q/K head_size=64, V head_size=128
+--reset --op-kind=1:Multiply,1:Divide --case=complex_fusion/mha/sdpa-plain-asymm-heads-f16-f32.json


We already have test cases to cover d_qk != d_v.

oneDNN/tests/benchdnn/inputs/graph/complex_fusion/harness_mha_all

Lines 211 to 215 in 8cde797

# d_qk != d_v

--reset --in-shapes=8:1x16x384x32,8:1x16x384x64,8:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-simplified-f32.json

--reset --in-shapes=3:1x16x384x32,3:1x16x384x64,3:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-simplified-f16-f32.json

--reset --in-shapes=3:1x16x384x32,3:1x16x384x64,3:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-implicit-causal-mask-fp32-bs1.json

--reset --in-shapes=24:1x16x384x32,24:1x16x384x64,24:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-bottom-right-implicit-causal-mask-f16-f32.json

h-sadia requested review from a team as code owners June 11, 2026 21:04

github-actions Bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:tests Codeowner: @oneapi-src/onednn-arch labels Jun 11, 2026

h-sadia added 4 commits June 11, 2026 14:38

src: gpu: intel: sdpa: split head between kq and v tensors

6722f6c

src: gpu: intel: sdpa: div_up the group count

795f18d

src: gpu: intel: sdpa: decouple D_MAX for QK and V

8c3d272

tests: benchdnn: input: graph: add test for assym head

0fa2486

h-sadia force-pushed the hsadia/assym_heads_sdpa branch from 4a844b2 to 0fa2486 Compare June 11, 2026 21:39

src: gpu: intel: sdpa: fix config dispatch for ugemm_vs

f3a67be

h-sadia force-pushed the hsadia/assym_heads_sdpa branch from f3a67be to 943a4cc Compare June 11, 2026 21:43

h-sadia changed the title ~~[WIP] Enable assymetric heads for D_qk and D_v for micro kernel sdpa~~ Enable assymetric heads for D_qk and D_v for micro kernel sdpa Jun 11, 2026

h-sadia force-pushed the hsadia/assym_heads_sdpa branch from 943a4cc to f3a67be Compare June 11, 2026 21:47

h-sadia changed the title ~~Enable assymetric heads for D_qk and D_v for micro kernel sdpa~~ [WIP]Enable assymetric heads for D_qk and D_v for micro kernel sdpa Jun 11, 2026

TaoLv reviewed Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]Enable assymetric heads for D_qk and D_v for micro kernel sdpa#5301

[WIP]Enable assymetric heads for D_qk and D_v for micro kernel sdpa#5301
h-sadia wants to merge 5 commits into
mainfrom
hsadia/assym_heads_sdpa

h-sadia commented Jun 11, 2026 •

edited

Loading

Uh oh!

TaoLv Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	# d_qk != d_v
	--reset --in-shapes=8:1x16x384x32,8:1x16x384x64,8:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-simplified-f32.json
	--reset --in-shapes=3:1x16x384x32,3:1x16x384x64,3:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-simplified-f16-f32.json
	--reset --in-shapes=3:1x16x384x32,3:1x16x384x64,3:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-implicit-causal-mask-fp32-bs1.json
	--reset --in-shapes=24:1x16x384x32,24:1x16x384x64,24:1x16x384x128 --case=complex_fusion/mha/sdpa-plain-bottom-right-implicit-causal-mask-f16-f32.json

Conversation

h-sadia commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

General

Uh oh!

TaoLv Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

h-sadia commented Jun 11, 2026 •

edited

Loading