-
Notifications
You must be signed in to change notification settings - Fork 28.9k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
disable deepspeed when setting up fake trainer
#38101
opened May 13, 2025 by
winglian
Loading…
5 tasks
In Llama4 fix wrongly inverted causal attention mask when using SDPA implementation
#38094
opened May 12, 2025 by
sogartar
Loading…
Fix InternVL interpolate_pos_encoding and add to video_processing_auto
#38092
opened May 12, 2025 by
yonigozlan
Loading…
Omit creation of positional IDs within ESM if applicable
#38089
opened May 12, 2025 by
simonlevine
•
Draft
Refactor
MambaCache
to modeling_mamba.py
(parity with Zamba)
#38086
opened May 12, 2025 by
manueldeprada
Loading…
Cache System Refactor: Layered Architecture
#38077
opened May 12, 2025 by
manueldeprada
•
Draft
7 of 23 tasks
Fix temporal padding in Qwen2VLImageProcessor when the number of frames is not divisible by temporal_patch_size
#38076
opened May 12, 2025 by
ritwickchaudhry
Loading…
Fix description and formatting errors in code docs
#38074
opened May 12, 2025 by
bilibili12433014
Loading…
1 of 5 tasks
Updated the Model docs - for the ALIGN model
#38072
opened May 11, 2025 by
1himan
Loading…
3 of 5 tasks
Fix bug in prefill_chunk_size that ignores disable_compile flag
#38067
opened May 11, 2025 by
xmarva
Loading…
3 of 5 tasks
Added scores in the streamer classes based on generation flag
#38064
opened May 10, 2025 by
LuisCarlos-104171
Loading…
Handling Overlapping Annotations in Mask2Former by A Small Trick
#38054
opened May 9, 2025 by
Ahmed-G-ElTaher
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-10.