Add Multi-Head Attention support for Vitis #1163

rianbrooksflynn · 2025-01-14T14:35:13Z

Description

This PR adds support for Multi-Head Attention using either Keras or PyTorch with the Vitis backend in io_parallel mode.

Tests have been added for both Keras and Pytorch parsing.

Credit is due to @Ethan0Jiang and @LostEcho365 (Zhixing Jiang and Dennis Yin) for their original implementation and Keras parsing support; my contributions were implementing PyTorch support and adding unit tests. (Here's a link to their pre-print.) The original code authors have given permission for their code to be merged into hls4ml.

There are some important notes for PyTorch (TODO: add documentation to this effect):

Need to set batch_first=True when instantiating nn.MultiheadAttention so that the inputs match up ((batch_size, seq_len, embed_dim) instead of (seq_len, batch_size, embed_dim)).
Need to set channels_last_conversion='off' when calling config_from_pytorch_model() since batch-first PyTorch and Keras use the same input shape.
Keras lets you call MultiHeadAttention using just two inputs (or even just one input for self-attention), but PyTorch insists that you give it all three of query, key, and value; hls4ml currently only supports the case where key and value are the same; thus, you must give PyTorch the same data for the second input and the third input.

Type of change

New feature (non-breaking change which adds functionality)
A new research paper code implementation

Tests

Two unit tests added: test/pytest/test_multiheadattention.py and test/pytest/test_multiheadattention_pytorch.py

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

…nto transformer

Ethan0Jiang · 2025-01-14T14:42:39Z

Thank you so much for merging it to the main!

rianbrooksflynn · 2025-01-14T14:55:25Z

pre-commit.ci autofix

mahadkhaliq · 2025-09-25T22:26:53Z

Hi @rianbrooksflynn! Great work on the Multi-Head Attention implementation.

Could you consider adding usage examples (e.g., examples/multihead_attention_keras.py and examples/multihead_attention_pytorch.py) to help users understand how to properly use this feature?

The examples could demonstrate the important PyTorch requirements you mentioned (batch_first=True, channels_last_conversion='off', same key/value inputs) and basic Keras usage.

Thanks!

porridgewithraisins · 2025-09-30T08:33:40Z

As far as I can tell, masking (e.g causal masking) is not supported in this. Would it be ok if I build on top of this PR and add it?

Ethan0Jiang and others added 30 commits September 13, 2024 12:25

paser_mht

c4c818b

change parser and modify keras_to_hls

3ee64d1

IR_mutihead_attention

5626a1a

IR done

d51f8a9

create mha file in template

89025a2

mha .h file dummy algo

d76cf60

config of mha

56811de

update mha config

45cd493

dummy mha

1402f48

add transpose into mha

430b9ea

projection_of_qkv_in_mha

97f3e8d

mha_first_draft

52cc7e8

able to predict model correct

3961f97

delete some unnassary comments

3533999

delete comments

d2f0df6

resource strategy of transformer

6aaa5ed

change sm lagacy

3b7a288

update MHA, optimized

130092d

support resource

09b0ba0

update

b49fffd

dense_muti_dim_support

5324a11

parallel execute dense

bf8c788

updates

b6be2c4

add_layerNorm_support

2472b7d

MHA updated

97e71e9

LayerNorm_bug_fix

5ed4a76

update bit precision

5d28f58

config update

2fc68d0

add some comment

b5c95cf

run pre-commit

3b8aa8d

JanFSchulte and others added 10 commits November 13, 2024 15:54

fix multiplier_limit config for mha

6139f54

remove extraneous seq_len in dense config

a041095

add hls stream pragmas

0c8cd71

Merge branch 'transformer' of https://github.com/JanFSchulte/hls4ml i…

1d5f8bb

…nto transformer

fix stream size

8006825

Merge remote-tracking branch 'upstream/main' into multi-head-attention

097b6a0

remove layernorm changes

0580360

remove softmax changes

a0b9390

Merge remote-tracking branch 'upstream/main' into multi-head-attention

a82a6aa

port to vitis

88beb79

pre-commit-ci bot and others added 2 commits January 14, 2025 14:55

[pre-commit.ci] auto fixes from pre-commit hooks

9ed1eec

delete extraneous print statement

20f2729

JanFSchulte added the please test Trigger testing by creating local PR branch label Jan 14, 2025

Merge branch 'main' into multi-head-attention

702d13a

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Feb 20, 2025

Merge branch 'main' into multi-head-attention

4c90dd4

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Mar 18, 2025

revert changes that break Conv1D tests

c7449bd

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Mar 20, 2025

bo3z mentioned this pull request Apr 9, 2025

Does hls4ml currently support transform architectures? #1097

Open

fix merge conflicts with main branch

fabd646

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Aug 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Multi-Head Attention support for Vitis #1163

Add Multi-Head Attention support for Vitis #1163

Uh oh!

rianbrooksflynn commented Jan 14, 2025

Uh oh!

Ethan0Jiang commented Jan 14, 2025

Uh oh!

rianbrooksflynn commented Jan 14, 2025

Uh oh!

mahadkhaliq commented Sep 25, 2025

Uh oh!

porridgewithraisins commented Sep 30, 2025

Uh oh!

Uh oh!

Add Multi-Head Attention support for Vitis #1163

Are you sure you want to change the base?

Add Multi-Head Attention support for Vitis #1163

Uh oh!

Conversation

rianbrooksflynn commented Jan 14, 2025

Description

Type of change

Tests

Checklist

Uh oh!

Ethan0Jiang commented Jan 14, 2025

Uh oh!

rianbrooksflynn commented Jan 14, 2025

Uh oh!

mahadkhaliq commented Sep 25, 2025

Uh oh!

porridgewithraisins commented Sep 30, 2025

Uh oh!

Uh oh!