Enable VECT_MUL for blackwell attention #536

neildhar · 2025-10-09T05:55:18Z

Enabling VECT_MUL previously caused a regression, but this seems to come from the fma vectorisation in particular. Enable it just for multiplication for now, which seems to be a performance win.

Enabling `VECT_MUL` previously caused a regression, but this seems to come from the fma vectorisation in particular. Enable it just for multiplication for now, which seems to be a performance win.

manman-ren · 2025-10-09T14:40:19Z

tritonbench/kernels/blackwell_triton_fused_attention_dp.py

-        if VECT_MUL:
+        # TODO: Figure out why vector FMA slows things down.
+        if VECT_MUL and False:
            qk = _fma_f32x2(qk, qk_scale, -m_ij[:, None])


Wondering if we should make VECT_MUL an integer with each bit representing one vectorization so we can autotune.

I like that idea.

Enable VECT_MUL for blackwell attention

90eda66

Enabling `VECT_MUL` previously caused a regression, but this seems to come from the fma vectorisation in particular. Enable it just for multiplication for now, which seems to be a performance win.

neildhar temporarily deployed to docker-s3-upload October 9, 2025 05:55 — with GitHub Actions Inactive

meta-cla bot added the cla signed label Oct 9, 2025

neildhar requested review from njriasan, manman-ren and htyu October 9, 2025 05:56

manman-ren reviewed Oct 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable VECT_MUL for blackwell attention #536

Enable VECT_MUL for blackwell attention #536

Uh oh!

neildhar commented Oct 9, 2025

Uh oh!

manman-ren Oct 9, 2025

Uh oh!

njriasan Oct 9, 2025

Uh oh!

Uh oh!

Enable VECT_MUL for blackwell attention #536

Are you sure you want to change the base?

Enable VECT_MUL for blackwell attention #536

Uh oh!

Conversation

neildhar commented Oct 9, 2025

Uh oh!

manman-ren Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

njriasan Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!