mirror of
https://github.com/zhayujie/chatgpt-on-wechat.git
synced 2026-06-02 00:57:41 +08:00
73 lines
2.6 KiB
Plaintext
73 lines
2.6 KiB
Plaintext
---
|
|
title: DeepSeek
|
|
description: DeepSeek model configuration (Text Chat + Thinking Mode)
|
|
---
|
|
|
|
DeepSeek is one of the default recommended vendors in Agent mode, focused on cost-effective text chat and task planning.
|
|
|
|
## Text Chat
|
|
|
|
```json
|
|
{
|
|
"model": "deepseek-v4-flash",
|
|
"deepseek_api_key": "YOUR_API_KEY"
|
|
}
|
|
```
|
|
|
|
| Parameter | Description |
|
|
| --- | --- |
|
|
| `model` | Supports `deepseek-v4-flash` (Default), `deepseek-v4-pro` |
|
|
| `deepseek_api_key` | Create one on the [DeepSeek Platform](https://platform.deepseek.com/api_keys) |
|
|
| `deepseek_api_base` | Optional, defaults to `https://api.deepseek.com/v1`. Can be changed to a third-party proxy |
|
|
|
|
### Model Selection
|
|
|
|
| Model | Use Case |
|
|
| --- | --- |
|
|
| `deepseek-v4-flash` | Default recommended; fast and low cost |
|
|
| `deepseek-v4-pro` | Smarter; better for complex tasks |
|
|
|
|
## Thinking Mode
|
|
|
|
The V4 series (`deepseek-v4-flash` / `deepseek-v4-pro`) supports an explicit "thinking mode": before producing the final answer, the model emits a chain of thought (`reasoning_content`) to improve answer quality.
|
|
|
|
### Toggle
|
|
|
|
Controlled by the global `enable_thinking` config, and can also be toggled from the Web Console's configuration page:
|
|
|
|
```json
|
|
{
|
|
"enable_thinking": true
|
|
}
|
|
```
|
|
|
|
- `true`: the model thinks before answering across all channels. The Web Console displays the thinking process; IM channels (WeChat / WeCom / DingTalk / Feishu) do not show it but still get better answers.
|
|
- `false`: thinking is disabled, responses are faster, and time-to-first-token is lower.
|
|
|
|
### Reasoning Effort
|
|
|
|
Under thinking mode, `reasoning_effort` controls reasoning intensity:
|
|
|
|
```json
|
|
{
|
|
"enable_thinking": true,
|
|
"reasoning_effort": "high"
|
|
}
|
|
```
|
|
|
|
| Value | Use Case |
|
|
| --- | --- |
|
|
| `high` (Default) | Day-to-day Agent tasks; balanced reasoning and speed |
|
|
| `max` | Complex coding, long-horizon planning, strictly constrained tasks; deeper reasoning but more time and output tokens |
|
|
|
|
`reasoning_effort` only takes effect when `enable_thinking` is `true`; it is ignored automatically when the model does not support thinking mode.
|
|
|
|
### Behavior Notes
|
|
|
|
- **Sampling parameters**: in thinking mode, `temperature`, `top_p`, `presence_penalty`, and `frequency_penalty` are ignored by the server (without errors). CowAgent automatically skips them.
|
|
- **Multi-turn tool calls**: when the history contains tool calls, DeepSeek requires every assistant message to include `reasoning_content`. CowAgent handles this automatically, so toggling thinking mode across turns will not cause errors.
|
|
|
|
<Tip>
|
|
`deepseek-v4-flash` is used by default; switch to `deepseek-v4-pro` for complex tasks; enable `enable_thinking` when deep reasoning is needed.
|
|
</Tip>
|