Flag for llama-server to ignore client-side samplers #14129

lilblam · 2025-06-11T17:05:37Z

lilblam
Jun 11, 2025

Would it be possible to add a flag to llama-server to ignore any samplers sent by the client and to use the samplers you set when running llama-server? Something like --ignore-client-samplers 1.

Alternatively, maybe the default GUI can have a setting to avoid sending the samplers (and sampler order) to the server? Although I would prefer the first option as it's the most flexible and is client-agnostic and would work for any client that uses the API but sends its own samplers to the model (even when you don't want it to).

I ask because every model lately comes with developer-recommended sampler settings. So I have to tweak them on the client for every model I run as they override the sever-side settings in the server's launch parameters. It would be nice to not have to worry about it, let the server control it (optionally) just like it currently controls the prompt template from gguf embeddings. Except default behavior for samplers should be as it is now with the option to ignore them at launch.

This also makes it difficult to let other non-tech people play with local models that I serve for them using the default web client because they don't know how/why to change samplers etc.

I'm not a coder and I've never submitted a PR (I don't even know how to git lol), but the code is here:

llama.cpp/tools/server/server.cpp

Line 253 in 2baf077

    
           params.sampling.top_k              = json_value(data, "top_k",              defaults.sampling.top_k);

You would take all of these lines::
params.sampling.top_k = json_value(data, "top_k", defaults.sampling.top_k);

And essentially change them to:
params.sampling.top_k = defaults.sampling.top_k;

Including the sampler order:
params.sampling.samplers = defaults.sampling.samplers;

Except this would be unconditionally ignoring the client entirely. So the proper solution would be to create a flag/parameter and then put the above into an IF statement, depending on which way the flag is set when running llama-server. Default behavior would be as it is now, but with flag, just ignore the json_value.

I could also be entirely wrong about the code part :D

lilblam · 2025-06-11T17:09:00Z

lilblam
Jun 11, 2025
Author

If you guys need me to learn how to use github, and figure out how to submit a proper PR, and then research how exactly to properly implement the solution I'm proposing and actually code it etc, I'm willing to give it a shot because I think the quality of life improvement is worth it. But it's going to take me a cool minute as I'm not a coder, I'm more of a dabbler.

0 replies

jukofyork · 2025-06-12T07:12:39Z

jukofyork
Jun 12, 2025
Collaborator

Yeah, this would be very useful!

I've run into this problem earlier this week when testing speculative decoding and accidentally had OpenWebUI messing with my command-line settings without me realising.

0 replies

mostlygeek · 2025-06-24T21:38:05Z

mostlygeek
Jun 24, 2025

fwiw: I added support for stripping sampling params into llama-swap as I was also waiting on this feature.

I was going to PR it here but after looking through llama-server’s code it had a bit of a code smell to put it in there. I think it fits better in a layer outside of llama-server.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Flag for llama-server to ignore client-side samplers #14129

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Flag for llama-server to ignore client-side samplers #14129

Uh oh!

Uh oh!

lilblam Jun 11, 2025

Replies: 3 comments

Uh oh!

Uh oh!

lilblam Jun 11, 2025 Author

Uh oh!

jukofyork Jun 12, 2025 Collaborator

Uh oh!

mostlygeek Jun 24, 2025

lilblam
Jun 11, 2025

lilblam
Jun 11, 2025
Author

jukofyork
Jun 12, 2025
Collaborator

mostlygeek
Jun 24, 2025