feat: Add DeepSeek V3.1 variants and GLM-4.6 with reasoning support (… #8479

yieldsurfer · 2025-10-02T22:28:32Z

Closes #8256

Changes

Added DeepSeek-V3.1-Terminus and DeepSeek-V3.1-turbo model variants to Chutes provider
Added GLM-4.6-FP8 model with 200K context window (204,800 tokens)
Fixed reasoning implementation to use chat_template_kwargs with thinking: true parameter
Parse reasoning_content field for hybrid reasoning models (DeepSeek V3.1, GLM-4.5, GLM-4.6)
Updated tests to verify reasoning mode functionality with proper API response parsing

Related GitHub issue

Closes: Support for DeepSeek V3.1 Terminus/Turbo Variants and Hybrid Model Reasoning via Chutes.ai Provider #8256

Roo Code task context (optional)

N/A — First contribution

Description

This PR implements support for the DeepSeek V3.1 model variants (Terminus and turbo) and adds the GLM-4.6-FP8 model to Chutes

Key implementation details

Reasoning mode is triggered by adding chat_template_kwargs: { thinking: true } to the API request when enableReasoningEffort is true
Hybrid reasoning models (DeepSeek V3.1, GLM-4.5, GLM-4.6) are detected and handled separately from DeepSeek R1 models
GLM-4.6 uses the same reasoning mechanism as GLM-4.5

Review notes

The capitalization fix for "turbo" (lowercase 't') was necessary to match Chutes API requirements
Tests were updated to properly mock reasoning_content responses instead of XML tag parsing

Test procedure

Automated tests:
- All 26 tests pass, including 2 new tests for reasoning mode with DeepSeek V3.1 and GLM models
- Test coverage includes model configuration, reasoning parameter passing, and response parsing
Manual testing:
- Tested all three new models (DeepSeek-V3.1-Terminus, DeepSeek-V3.1-turbo, GLM-4.6-FP8) with Chutes.ai API
- Verified reasoning mode toggle correctly adds chat_template_kwargs parameter
- Confirmed no errors with real API calls

Pre-submission checklist

Issue Linked: This PR is linked to an approved GitHub Issue (Support for DeepSeek V3.1 Terminus/Turbo Variants and Hybrid Model Reasoning via Chutes.ai Provider #8256).
Scope: My changes are focused on the linked issue
Self-Review: I have performed a self-review of my code.
Testing: New tests added for GLM-4.6-FP8 model and reasoning mode functionality.
Documentation Impact: None
Contribution Guidelines: I have read and agree to the Contributor Guidelines.

Screenshots / videos

N/A

Documentation updates

No documentation updates are required.

Alignment with roadmap

? (unclear to me)

Additional notes

This is my first contribution to Roo Code

Get in touch

Discord: paeperbag

Important

Add DeepSeek V3.1 variants and GLM-4.6 with reasoning support to Chutes provider, updating models and tests.

Models:
- Add DeepSeek-V3.1-Terminus and DeepSeek-V3.1-turbo to ChutesModelId in chutes.ts.
- Add GLM-4.6-FP8 with 200K context window to ChutesModelId in chutes.ts.
Reasoning:
- Implement reasoning support using chat_template_kwargs: { thinking: true } in chutes.ts.
- Parse reasoning_content for DeepSeek V3.1, GLM-4.5, and GLM-4.6 in createMessage() in chutes.ts.
Tests:
- Add tests for GLM-4.6-FP8 and reasoning mode in chutes.spec.ts.
- Update tests to mock reasoning_content responses in chutes.spec.ts.

^{This description was created by}^{for be2ad23. You can customize this summary. It will automatically update as commits are pushed.}

…for hybrid models - Added deepseek-ai/DeepSeek-V3.1-Terminus and deepseek-ai/DeepSeek-V3.1-Turbo model variants to ChutesModelId type - Enabled reasoning mode support for DeepSeek V3.1 and GLM-4.5 models when enableReasoningEffort is true - Updated ChutesHandler to parse <think> tags for reasoning content in supported hybrid models - Added tests for new model variants and reasoning mode functionality Fixes RooCodeInc#8256

…ooCodeInc#8256) - Add DeepSeek-V3.1-Terminus and DeepSeek-V3.1-turbo models - Add GLM-4.6-FP8 model with 200K context window - Fix reasoning implementation to use chat_template_kwargs with thinking parameter - Parse reasoning_content field for hybrid reasoning models (DeepSeek V3.1, GLM-4.5, GLM-4.6) - Update tests to verify reasoning mode functionality - Fix capitalization: DeepSeek-V3.1-Turbo -> DeepSeek-V3.1-turbo Fixes RooCodeInc#8256

roomote

I found some issues that need attention. See inline comments for details.

roomote · 2025-10-02T22:36:34Z

packages/types/src/providers/chutes.ts

 	| "deepseek-ai/DeepSeek-V3.1"
 	| "deepseek-ai/DeepSeek-V3.1-Terminus"
-	| "deepseek-ai/DeepSeek-V3.1-Turbo"
+	| "deepseek-ai/DeepSeek-V3.1-turbo"


[P2] Potential breaking change: Renaming the model id from "DeepSeek-V3.1-Turbo" to "DeepSeek-V3.1-turbo" will break users who have existing configs referencing the old id. Consider adding a temporary alias/back-compat mapping (accept both ids) or a migration to remap the old value to the new one before lookup to avoid surprising failures.

the correct model id for chutes for the model is "deepseek-ai/DeepSeek-V3.1-turbo"

roomote · 2025-10-02T22:36:34Z

src/api/providers/chutes.ts

+		// Handle DeepSeek V3.1, GLM-4.5, and GLM-4.6 models with reasoning_content parsing
+		const isHybridReasoningModel =
+			model.id.includes("DeepSeek-V3.1") || model.id.includes("GLM-4.5") || model.id.includes("GLM-4.6")
+		const reasoningEnabled = this.options.enableReasoningEffort === true


[P2] Consistency with reasoning toggle: This direct check (=== true) bypasses the shared helper and may diverge from global defaults or future logic. Prefer using the existing shouldUseReasoningEffort helper so provider behavior stays consistent across backends. Also remove the unused import if you decide to keep the direct check.

roomote · 2025-10-02T22:36:34Z

src/api/providers/chutes.ts

 		const temperature = this.options.modelTemperature ?? this.getModel().info.temperature

-		return {
+		const params: any = {


[P3] Typing: Avoid any here; you can return the exact type to improve maintainability and catch mistakes earlier.

Suggested change

const params: any = {

const params: OpenAI.Chat.Completions.ChatCompletionCreateParamsStreaming = {

roomote and others added 2 commits September 23, 2025 13:14

yieldsurfer requested review from cte, jr and mrubens as code owners October 2, 2025 22:28

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Oct 2, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Oct 2, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Oct 2, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request labels Oct 2, 2025

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Oct 2, 2025

roomote bot reviewed Oct 2, 2025

View reviewed changes

yieldsurfer changed the base branch from feat/chutes-deepseek-v3-1-variants-reasoning to main October 2, 2025 22:43

hannesrudolph moved this from Triage to PR [Needs Prelim Review] in Roo Code Roadmap Oct 2, 2025

hannesrudolph added PR - Needs Preliminary Review and removed Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. labels Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add DeepSeek V3.1 variants and GLM-4.6 with reasoning support (… #8479

feat: Add DeepSeek V3.1 variants and GLM-4.6 with reasoning support (… #8479

yieldsurfer commented Oct 2, 2025 •

edited by hannesrudolph

Loading

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Oct 2, 2025

Uh oh!

yieldsurfer Oct 2, 2025

Uh oh!

roomote bot Oct 2, 2025

Uh oh!

roomote bot Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	const params: any = {
	const params: OpenAI.Chat.Completions.ChatCompletionCreateParamsStreaming = {

feat: Add DeepSeek V3.1 variants and GLM-4.6 with reasoning support (… #8479

Are you sure you want to change the base?

feat: Add DeepSeek V3.1 variants and GLM-4.6 with reasoning support (… #8479

Conversation

yieldsurfer commented Oct 2, 2025 • edited by hannesrudolph Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Related GitHub issue

Roo Code task context (optional)

Description

Key implementation details

Review notes

Test procedure

Pre-submission checklist

Screenshots / videos

Documentation updates

Alignment with roadmap

Additional notes

Get in touch

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

yieldsurfer Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yieldsurfer commented Oct 2, 2025 •

edited by hannesrudolph

Loading