add support for Qwen Image Pruning #874

wbruna · 2025-10-05T19:46:04Z

For #851 . Allow the model loading logic to tolerate missing layers, which is enough to run the 12B Pruning variant:

https://huggingface.co/OPPOer/Qwen-Image-Pruning

Tested with the Q4_K_M quant from https://huggingface.co/wsbagnsv1/Qwen-Image-Pruning-GGUF :

wbruna · 2025-10-05T21:19:20Z

Quality seems a little worse than the Lightning model, with ~30% less peak VRAM usage, and similar speed gains.

From leejet/stable-diffusion.cpp#874 .

add support for Qwen Image Pruning

77c679f

wbruna added a commit to wbruna/llama.cpp that referenced this pull request Oct 6, 2025

add support for Qwen Image Pruning

fe14626

From leejet/stable-diffusion.cpp#874 .

wbruna mentioned this pull request Oct 6, 2025

add support for Qwen Image Pruning LostRuins/koboldcpp#1779

Open

follow the prefix parameter and avoid hardcoded max number

b9d7b2b

wbruna added a commit to wbruna/llama.cpp that referenced this pull request Oct 9, 2025

add support for Qwen Image Pruning

9c4ec67

From leejet/stable-diffusion.cpp#874 .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for Qwen Image Pruning #874

add support for Qwen Image Pruning #874

wbruna commented Oct 5, 2025

Uh oh!

wbruna commented Oct 5, 2025

Uh oh!

Uh oh!

add support for Qwen Image Pruning #874

Are you sure you want to change the base?

add support for Qwen Image Pruning #874

Conversation

wbruna commented Oct 5, 2025

Uh oh!

wbruna commented Oct 5, 2025

Uh oh!

Uh oh!