[GPU] fixup matmul ref implementation by dyoussif · Pull Request #4958 · uxlfoundation/oneDNN

dyoussif · 2026-04-06T22:31:08Z

Ref implementation incorrectly uses dst layout for dynamic scales layout; scales layout should always be plain.
Ref gemm does not properly support runtime dims; defer to ref matmul.
jit gemm expects plain layout for dst when applying dynamic scales

dyoussif · 2026-04-06T22:38:21Z

make test
set test_scope=NIGHTLY
disable test_device_cpu
disable benchdnn_all
enable benchdnn_matmul
enable benchdnn_ip

kealan-barbieri · 2026-04-06T22:40:09Z

src/gpu/intel/dynamic_scale.cl

            c_stride_d3);
 #else
 #if NDIMS == 5
-    scale_off = DST_SCALE_OFF(


It looks like DST_SCALE_OFF is only used here so either we should fix that macro or drop it from src/gpu/intel/include/types_specific.h.

rjoursler · 2026-04-07T18:07:45Z

src/gpu/intel/dynamic_scale.cl

+    scale_off = scale_off_dst(d0 % DST_D0, m, n, groupSize);
 #else
-    scale_off = DST_SCALE_OFF(m, n, 0, 0, 0, groupSize, 1);
+    scale_off = scale_off_dst(m, n, groupSize);


nit: we could normalize these calls to scale_off = scale_off_dst(n, m, d0 % DST_D0, d1 %DST_D1, ...) and consolidate all the #ifdef logic at the function definition.

rjoursler · 2026-04-07T18:10:00Z

src/gpu/intel/gemm/ref.hpp

+                    !utils::one_of(DNNL_RUNTIME_DIM_VAL, desc()->m(),
+                            desc()->n(), desc()->k(), desc()->lda(),
+                            desc()->ldb(), desc()->ldc(), desc()->batch()),
+                    VERBOSE_RUNTIMEDIM_UNSUPPORTED);


Comment: The OpenCL kernel looks like it supports runtime dimensions, so we are likely just missing some logic in the execute function, it might be good to explain why were are disabling it here (in general, I think ref_gemm should just be removed, but RNN relies on it as I recall).

dyoussif added 2 commits March 31, 2026 11:27

xe: ocl: fixup dynamic scale offset calculation

99101c8

xe: gemm: jit: restrict supported layouts for dynamic scale

8698446

dyoussif requested review from a team as code owners April 6, 2026 22:31

github-actions bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:tests Codeowner: @oneapi-src/onednn-arch labels Apr 6, 2026

dyoussif changed the title ~~Dyoussif/gemm ref scales~~ [GPU] fixup matmul ref implementation Apr 6, 2026

dyoussif added 2 commits April 6, 2026 15:37

xe: gemm: ref: return unimplemented for runtime dims cases

45e9e85

tests: benchdnn: inputs: matmul: add ref regression case

b1d324b

dyoussif force-pushed the dyoussif/gemm_ref_scales branch from a3665fa to b1d324b Compare April 6, 2026 22:37

kealan-barbieri reviewed Apr 6, 2026

View reviewed changes

kealan-barbieri approved these changes Apr 6, 2026

View reviewed changes

rjoursler approved these changes Apr 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] fixup matmul ref implementation#4958

[GPU] fixup matmul ref implementation#4958
dyoussif wants to merge 4 commits intomainfrom
dyoussif/gemm_ref_scales

dyoussif commented Apr 6, 2026

Uh oh!

dyoussif commented Apr 6, 2026

Uh oh!

kealan-barbieri Apr 6, 2026

Uh oh!

rjoursler Apr 7, 2026

Uh oh!

rjoursler Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dyoussif commented Apr 6, 2026

Uh oh!

dyoussif commented Apr 6, 2026

Uh oh!

kealan-barbieri Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

rjoursler Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

rjoursler Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants