Skip to content

Conversation

zucchini-nlp
Copy link
Member

@zucchini-nlp zucchini-nlp commented Jul 30, 2025

What does this PR do?

As per title. TBH the helper is same for most models (excluding Qwen or other special arch), so we could just move it to the base modeling class. I thought it's against transformers philosophy, so that's why we have near-same code copied everywhere

Half of these are anyway handled by modular and copied from LLaVA

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think its fine: modular makes it obvious that its re-used, and its still core part of the modeling file!

@zucchini-nlp zucchini-nlp enabled auto-merge (squash) August 1, 2025 07:48
Copy link
Contributor

github-actions bot commented Aug 1, 2025

[For maintainers] Suggested jobs to run (before merge)

run-slow: aria, aya_vision, blip_2, chameleon, cohere2_vision, deepseek_vl, deepseek_vl_hybrid, emu3, fuyu, gemma3, gemma3n, glm4v, got_ocr2, instructblip, instructblipvideo, internvl

@zucchini-nlp zucchini-nlp merged commit d3b8627 into huggingface:main Aug 1, 2025
25 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants