⚠️ 🔴 Add ministral model #40247

manueldeprada · 2025-08-18T11:49:20Z

Coming from #39799, adding Ministral model to support its interleaved attention.

HuggingFaceDocBuilderDev · 2025-08-18T12:02:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

manueldeprada · 2025-08-20T17:16:37Z

still have to check the slow tests, but rest is ready @ArthurZucker

ArthurZucker

Nice!

src/transformers/models/ministral/modular_ministral.py

manueldeprada · 2025-08-21T15:15:36Z

run-slow: ministral

github-actions · 2025-08-21T15:17:01Z

This comment contains run-slow, running the specified jobs:

models: ['models/ministral']
quantizations: [] ...

manueldeprada · 2025-08-21T15:27:52Z

run-slow: ministral

…ormers into model-add-ministral

manueldeprada · 2025-08-22T11:17:45Z

@ArthurZucker tests are ready. One note: after this merges, if i open a pr in mistralai/Ministral-8B-Instruct-2410 to change the architecture to MinistralForCausalLM, older transformers versions won’t load the model. Are we ok with this? Is there a preferred way of handling these situations? maybe a warning in the readme?

…-add-ministral

…ormers into model-add-ministral

ArthurZucker

Okay!
Regarding BC, we are kinda obliged to break with a proper ⚠️ 🔴 on PR title should be alirght

github-actions · 2025-08-25T10:14:56Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, ministral

manueldeprada · 2025-08-25T11:09:52Z

run-slow: auto, ministral

github-actions · 2025-08-25T11:11:16Z

This comment contains run-slow, running the specified jobs:

models: ['models/auto', 'models/ministral']
quantizations: [] ...

ArthurZucker · 2025-08-25T11:13:01Z

src/transformers/models/ministral/modeling_ministral.py

+            # The sliding window alternating layers are not always activated depending on the config
+            if self.has_sliding_layers:
+                causal_mask_mapping["sliding_attention"] = create_sliding_window_causal_mask(**mask_kwargs)


ministral must have siliding and non sliding! let's update the forwaard to make sur it always creates it!

ArthurZucker · 2025-08-25T11:13:43Z

maybe a warning in the readme?

Yep we can have that!
Older version will load Mistral instead but should be fine it's not breaking

manueldeprada · 2025-08-25T11:16:47Z

Yep we can have that! Older version will load Mistral instead but should be fine it's not breaking

It is breaking: once we change mistralai/Ministral-8B-Instruct-2410/config.json to be

{
"architectures": [
"MiNIstralForCausalLM"
], ...

old versions will not recognize that architecture. Thats why I was worried 😅

add ministral model

5bee627

manueldeprada added 2 commits August 18, 2025 14:54

docs, tests

528fe18

nits

e9d8bbc

manueldeprada marked this pull request as ready for review August 20, 2025 17:16

github-actions bot requested review from ArthurZucker and Rocketknight1 August 20, 2025 17:17

ArthurZucker added the New model label Aug 21, 2025

ArthurZucker reviewed Aug 21, 2025

View reviewed changes

src/transformers/models/ministral/modular_ministral.py Show resolved Hide resolved

manueldeprada removed the request for review from Rocketknight1 August 21, 2025 13:27

manueldeprada and others added 2 commits August 21, 2025 15:58

Merge branch 'main' into model-add-ministral

c8c2157

fix tests

d251e88

huggingface deleted a comment from github-actions bot Aug 21, 2025

run modular after merge

fc7de60

Merge branch 'main' into model-add-ministral

18d2bf9

manueldeprada added 3 commits August 22, 2025 10:32

opsie

e515a5b

Merge branch 'model-add-ministral' of github.com:manueldeprada/transf…

bdadc2c

…ormers into model-add-ministral

integration tests

2eed170

manueldeprada force-pushed the model-add-ministral branch from 4096cd7 to 2eed170 Compare August 22, 2025 10:38

manueldeprada and others added 4 commits August 22, 2025 12:53

again

b2ca2ae

fff

f47b51f

Merge branch 'main' into model-add-ministral

44597b1

Merge branch 'main' into model-add-ministral

7f94b14

huggingface deleted a comment from github-actions bot Aug 22, 2025

manueldeprada requested a review from ArthurZucker August 22, 2025 11:17

manueldeprada added 3 commits August 22, 2025 13:21

dtype

8476dec

Merge branch 'main' of github.com:huggingface/transformers into model…

16814c4

…-add-ministral

Merge branch 'model-add-ministral' of github.com:manueldeprada/transf…

8b8433d

…ormers into model-add-ministral

ArthurZucker approved these changes Aug 22, 2025

View reviewed changes

manueldeprada changed the title ~~Add ministral model~~ ⚠️ 🔴 Add ministral model Aug 25, 2025

manueldeprada added 2 commits August 25, 2025 10:25

Merge branch 'main' into model-add-ministral

d9aa776

Merge branch 'main' into model-add-ministral

e5ff6dc

rerun modular

9edd006

ArthurZucker reviewed Aug 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⚠️ 🔴 Add ministral model #40247

⚠️ 🔴 Add ministral model #40247

manueldeprada commented Aug 18, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 18, 2025

Uh oh!

manueldeprada commented Aug 20, 2025 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

manueldeprada commented Aug 21, 2025

Uh oh!

github-actions bot commented Aug 21, 2025

Uh oh!

manueldeprada commented Aug 21, 2025

Uh oh!

manueldeprada commented Aug 22, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

github-actions bot commented Aug 25, 2025

Uh oh!

manueldeprada commented Aug 25, 2025

Uh oh!

github-actions bot commented Aug 25, 2025

Uh oh!

ArthurZucker Aug 25, 2025

Uh oh!

ArthurZucker commented Aug 25, 2025

Uh oh!

manueldeprada commented Aug 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

⚠️ 🔴 Add ministral model #40247

Are you sure you want to change the base?

⚠️ 🔴 Add ministral model #40247

Conversation

manueldeprada commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 18, 2025

Uh oh!

manueldeprada commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

manueldeprada commented Aug 21, 2025

Uh oh!

github-actions bot commented Aug 21, 2025

Uh oh!

manueldeprada commented Aug 21, 2025

Uh oh!

manueldeprada commented Aug 22, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 25, 2025

Uh oh!

manueldeprada commented Aug 25, 2025

Uh oh!

github-actions bot commented Aug 25, 2025

Uh oh!

ArthurZucker Aug 25, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Aug 25, 2025

Uh oh!

manueldeprada commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

manueldeprada commented Aug 18, 2025 •

edited

Loading

manueldeprada commented Aug 20, 2025 •

edited

Loading

manueldeprada commented Aug 25, 2025 •

edited

Loading