Skip to content

Support streaming delta.reasoning_content in WebUI #15052

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

mostlygeek
Copy link
Contributor

@mostlygeek mostlygeek commented Aug 3, 2025

When llama-server sends reasoning content in choices[0].delta.reasoning_content add that content within <think>...</think> tags.

Additionally fixes issue with Qwen3 thinking models that do not send a tag by default.

Ref: Addresses feedback from @ngxson in #14997 (comment) by using Message.content instead of introducing a new reasoningContent?: string.

When llama-server sends reasoning content in
choices[0].delta.reasoning_content add that content within
<think></think> tags.

Additionally fixes issue with QWen3 thinking models that do not send a
<think> tag by default.
@mostlygeek
Copy link
Contributor Author

Tested with Qwen3-30B-A3B-Thinking-2507-UD-Q4_K_XL.gguf. With PR, Thinking content from model is correctly handled in Message.content:

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant