Add escaping of whitespaces in qwen3coder tool output #3691

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

atobiszei merged 10 commits into main from atobisze_check_qwen3coder_fix

Oct 17, 2025

Collaborator

atobiszei commented Oct 8, 2025 •

edited

Loading

Without escaping several multi turn scenarios in BFCL had problem parsing model response. While this escaping is shallow (does not differentiate at which level of JSON/array we are) it improves results, and we don't have for now examples of this being to eager with escaping.

Additionally:
-> treat improper begining of tool (<function=...>) as an additional start of tool section. Several issues in BFCL were due to improper model output (lack of <tool_call>), but there were <function=...>

Ticket:CVS-174650

atobiszei added the WIP label

atobiszei force-pushed the atobisze_check_qwen3coder_fix branch from 0b9b92d to cca1f42 Compare

October 8, 2025 13:52

atobiszei added 2 commits

October 13, 2025 16:28


          Add escaping of whitespaces in qwen3coder tool output

dd9714c


          Extend parsing with incomplete tool call entry tag

8b7f8f6

atobiszei force-pushed the atobisze_check_qwen3coder_fix branch from 6b13486 to 8b7f8f6 Compare

October 13, 2025 14:28

atobiszei requested a review from Copilot

October 14, 2025 07:16

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull Request Overview

This PR adds whitespace escaping functionality to the Qwen3Coder tool output parser and improves handling of improperly formatted tool calls that begin with <function=...> tags instead of the expected <tool_call> tags.

Introduces an escapeString function to properly escape newlines in tool call arguments
Adds support for parsing tool calls that start with <function=...> directly (without <tool_call>)
Updates test cases to reflect the new escaped output format and additional parsing scenarios

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
qwen3coder_tool_parser.cpp	Implements whitespace escaping function and enhanced parsing logic for malformed tool calls
qwen3coder_tool_parser.hpp	Adds `<function=` tag to special parsing start tags set
qwen3coder_output_parser_test.cpp	Updates tests for escaped output and adds test for improper tag handling
spelling-whitelist.txt	Removes specific line reference for test file spelling issues

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/llm/io_processing/qwen3coder/qwen3coder_tool_parser.cpp Show resolved Hide resolved

src/test/llm/output_parsers/qwen3coder_output_parser_test.cpp Outdated Show resolved Hide resolved

src/test/llm/output_parsers/qwen3coder_output_parser_test.cpp Outdated Show resolved Hide resolved

atobiszei and others added 2 commits

October 14, 2025 10:58


          Apply suggestion from @Copilot

9c15e60

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>


          Apply suggestion from @Copilot

6b6214b

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

atobiszei removed the WIP label

atobiszei commented

View reviewed changes

src/llm/io_processing/qwen3coder/qwen3coder_tool_parser.cpp Outdated Show resolved Hide resolved


          Apply suggestion from @atobiszei

7fc0563

atobiszei requested review from dkalinowski and mzegla

October 15, 2025 10:02

atobiszei added 2 commits

October 16, 2025 10:50


          Fix start tags


          Merge branch 'main' into atobisze_check_qwen3coder_fix

72b54fc

atobiszei commented

View reviewed changes

src/llm/io_processing/output_parser.cpp Outdated Show resolved Hide resolved


          Apply suggestion from @atobiszei

0e4abd9

dtrawins approved these changes

View reviewed changes

mzegla reviewed

View reviewed changes

src/llm/io_processing/qwen3coder/qwen3coder_tool_parser.cpp Outdated

    
                      this->lastProcessedPosition = pos + Qwen3CoderToolParser::TOOL_START_TAG.length();

                      this->currentState = State::InsideToolCall;

                      this->toolCallPositions.begin.push(pos);

                      // normally we expect <tool_call> tag but we observerd that sometimes model generates <function=...> directly

Collaborator

mzegla Oct 16, 2025

Suggested change

      
                    // normally we expect <tool_call> tag but we observerd that sometimes model generates <function=...> directly
          
                    // normally we expect <tool_call> tag but we observed that sometimes model generates <function=...> directly

src/llm/io_processing/qwen3coder/qwen3coder_tool_parser.cpp Outdated

    
                          this->currentState = State::InsideToolCall;

                          this->toolCallPositions.begin.push(posTool);

                      } else {

                          // found <function=...> first, we will assume <tool_call> is missing and we will add it

Collaborator

mzegla Oct 16, 2025

"we will add it" - do you inject "<tool_call>" somewhere?

Collaborator Author

atobiszei Oct 17, 2025

misleading comment - i will remove it.

src/llm/io_processing/qwen3coder/qwen3coder_tool_parser.cpp

Comment on lines +182 to +187

    
                      } else if (posTool < posFunc) {

                          // found <tool_call> first

                          this->lastProcessedPosition = posTool + Qwen3CoderToolParser::TOOL_START_TAG.length();

                          this->currentState = State::InsideToolCall;

                          this->toolCallPositions.begin.push(posTool);

                      } else {

Collaborator

mzegla Oct 16, 2025

Does it cover well cases when posTool == std::string::npos or posFunc == std::string::npos?

Collaborator Author

atobiszei Oct 17, 2025

then posTool != posFunc so we hit either else if or else

atobiszei added 2 commits

October 17, 2025 08:37


          Merge remote-tracking branch 'origin/main' into atobisze_check_qwen3c…

f30be1d

…oder_fix


          Review fix

c92d05b

atobiszei requested a review from mzegla

October 17, 2025 06:39

mzegla approved these changes

View reviewed changes

atobiszei merged commit 18fb0fa into main

1 check passed

atobiszei deleted the atobisze_check_qwen3coder_fix branch

October 17, 2025 12:32

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet