feat(models): support reasoning_effort config for DeepSeek V4

2026-07-19 12:47:25 +08:00 · 2026-05-15 18:17:35 +08:00
parent fe871aad77
commit f5479c56af
5 changed files with 73 additions and 19 deletions
--- a/docs/en/models/deepseek.mdx
+++ b/docs/en/models/deepseek.mdx
@@ -42,6 +42,24 @@ Controlled by the global `enable_thinking` setting:
 - `true`: thinking is on across all channels. The Web console renders the reasoning trace; IM channels (WeChat / WeCom / DingTalk / Feishu) don't render it but still benefit from higher answer quality.
 - `false`: thinking off, faster responses with lower first-token latency.

+### Reasoning Effort
+
+Under thinking mode, `reasoning_effort` controls how hard the model thinks:
+
+```json
+{
+  "enable_thinking": true,
+  "reasoning_effort": "high"
+}
+```
+
+| Value | Use Case |
+| --- | --- |
+| `high` (default) | Day-to-day agent tasks; balanced thinking depth and latency |
+| `max` | Complex coding, long-horizon planning, strict-constraint tasks. Deeper reasoning at the cost of more output tokens and higher latency |
+
+`reasoning_effort` only takes effect when `enable_thinking` is `true`. It is silently ignored on models that do not support thinking mode.
+
 ### Notes

 - **Sampling parameters**: under thinking mode, `temperature`, `top_p`, `presence_penalty`, and `frequency_penalty` are silently ignored by the server (no error). CowAgent skips sending them automatically.