docs: make English the default docs language and fix link paths

This commit is contained in:
zhayujie
2026-05-31 17:52:22 +08:00
parent 126649f70f
commit 7bf4ef3d05
231 changed files with 8999 additions and 8974 deletions

View File

@@ -1,15 +1,15 @@
---
title: Claude
description: Anthropic Claude 模型配置(文本对话 + 图像理解)
description: Anthropic Claude model configuration (Text Chat + Image Understanding)
---
Claude 由 Anthropic 提供,支持文本对话与图像理解,主流 Sonnet / Opus 模型均原生支持视觉,无需额外指定 Vision 模型。
Claude is provided by Anthropic and supports both text chat and image understanding. The mainstream Sonnet / Opus models natively support vision, so no separate Vision model needs to be specified.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,26 +18,26 @@ Claude 由 Anthropic 提供,支持文本对话与图像理解,主流 Sonnet
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 支持 `claude-opus-4-8``claude-opus-4-7``claude-sonnet-4-6``claude-opus-4-6``claude-sonnet-4-5``claude-sonnet-4-0``claude-3-5-sonnet-latest` 等,参考 [官方模型](https://docs.anthropic.com/en/docs/about-claude/models/overview) |
| `claude_api_key` | [Claude 控制台](https://console.anthropic.com/settings/keys) 创建 |
| `claude_api_base` | 可选,默认为 `https://api.anthropic.com/v1`,可改为第三方代理 |
| `model` | Supports `claude-opus-4-8`, `claude-opus-4-7`, `claude-sonnet-4-6`, `claude-opus-4-6`, `claude-sonnet-4-5`, `claude-sonnet-4-0`, `claude-3-5-sonnet-latest`, etc. See [official models](https://docs.anthropic.com/en/docs/about-claude/models/overview) |
| `claude_api_key` | Create one in the [Claude Console](https://console.anthropic.com/settings/keys) |
| `claude_api_base` | Optional, defaults to `https://api.anthropic.com/v1`. Can be changed to a third-party proxy |
### 模型选择
### Model Selection
| 模型 | 适用场景 |
| Model | Use Case |
| --- | --- |
| `claude-opus-4-8` | 默认推荐,最新旗舰,复杂推理与长链路任务效果最佳 |
| `claude-opus-4-7` | 上一代 Opus 旗舰 |
| `claude-sonnet-4-6` | 性价比与速度平衡,成本更低 |
| `claude-opus-4-6` / `claude-sonnet-4-5` / `claude-sonnet-4-0` | 更早的旗舰,价格更低 |
| `claude-opus-4-8` | Default recommended, latest flagship; best for complex reasoning and long-running tasks |
| `claude-opus-4-7` | Previous-generation Opus flagship |
| `claude-sonnet-4-6` | Balanced cost and speed, lower cost |
| `claude-opus-4-6` / `claude-sonnet-4-5` / `claude-sonnet-4-0` | Earlier flagships at a lower price |
## 图像理解
## Image Understanding
配置 `claude_api_key` Agent Vision 工具会自动使用 Claude 主模型识别图像,无需额外配置。
Once `claude_api_key` is configured, the Agent's Vision tool automatically uses the Claude main model to recognize images, with no extra setup required.
如需手动指定 Vision 模型,可在配置文件中显式配置:
To manually specify a Vision model, set it explicitly in the configuration file:
```json
{

View File

@@ -1,41 +1,41 @@
---
title: Coding Plan
description: Coding Plan 模式模型配置
description: Coding Plan model configuration
---
> Coding Plan 是各厂商推出的编程包月套餐,适合高频使用 Agent 的场景。CowAgent 支持通过 OpenAI 兼容方式接入各厂商的 Coding Plan 接口。
> Coding Plan is a monthly subscription package offered by various providers, ideal for high-frequency Agent usage. CowAgent supports all Coding Plan providers via OpenAI-compatible mode.
<Note>
Coding Plan API Base API Key 通常与普通按量计费接口不通用,请在各厂商平台单独获取。
Coding Plan API Base and API Key are usually separate from the standard pay-as-you-go ones. Please obtain them from each provider's platform.
</Note>
## 通用配置格式
## General Configuration
所有厂商均可使用 OpenAI 兼容协议接入可在web控制台快速配置。设置模型厂商为**OpenAI**,选择自定义模型并填入模型编码,最后填写对应厂商的API Base API Key
All providers can be accessed via the OpenAI-compatible protocol, and can be quickly configured through the web console. Set the model provider to **OpenAI**, select a custom model and enter the model code, then fill in the corresponding provider's API Base and API Key:
<img src="https://cdn.link-ai.tech/doc/20260318113134.png" width="800"/>
也可通过 `config.json` 配置文件直接修改:
You can also configure directly in `config.json`:
```json
{
"bot_type": "openai",
"model": "模型名称",
"open_ai_api_base": "厂商 Coding Plan API Base",
"model": "MODEL_NAME",
"open_ai_api_base": "PROVIDER_CODING_PLAN_API_BASE",
"open_ai_api_key": "YOUR_API_KEY"
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `bot_type` | 固定为 `openai`OpenAI 兼容方式) |
| `model` | 各厂商支持的模型名称 |
| `open_ai_api_base` | 各厂商 Coding Plan 专用 API Base |
| `open_ai_api_key` | 各厂商 Coding Plan 专用 API Key |
| `bot_type` | Must be `openai` (OpenAI-compatible mode) |
| `model` | Model name supported by the provider |
| `open_ai_api_base` | Provider's Coding Plan API Base URL |
| `open_ai_api_key` | Provider's Coding Plan API Key |
---
## 阿里云
## Alibaba Cloud
```json
{
@@ -46,13 +46,13 @@ description: Coding Plan 模式模型配置
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | `qwen3.5-plus``qwen3-max-2026-01-23``qwen3-coder-next``qwen3-coder-plus``glm-5``glm-4.7``kimi-k2.5``MiniMax-M2.5` |
| `model` | `qwen3.5-plus`, `qwen3-max-2026-01-23`, `qwen3-coder-next`, `qwen3-coder-plus`, `glm-5`, `glm-4.7`, `kimi-k2.5`, `MiniMax-M2.5` |
| `open_ai_api_base` | `https://coding.dashscope.aliyuncs.com/v1` |
| `open_ai_api_key` | Coding Plan 专用 Key与按量计费接口不通用 |
| `open_ai_api_key` | Coding Plan specific key (not shared with pay-as-you-go) |
官方文档:[快速开始](https://help.aliyun.com/zh/model-studio/coding-plan-quickstart?spm=a2c4g.11186623.help-menu-2400256.d_0_2_1.70115203zi5Igc)、[模型列表](https://help.aliyun.com/zh/model-studio/coding-plan)
Reference: [Quick Start](https://help.aliyun.com/zh/model-studio/coding-plan-quickstart?spm=a2c4g.11186623.help-menu-2400256.d_0_2_1.70115203zi5Igc), [Model List](https://help.aliyun.com/zh/model-studio/coding-plan)
---
@@ -67,18 +67,17 @@ description: Coding Plan 模式模型配置
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | `MiniMax-M2.5``MiniMax-M2.5-highspeed``MiniMax-M2.1``MiniMax-M2` |
| `open_ai_api_base` | 国内:`https://api.minimaxi.com/v1`;海外:`https://api.minimax.io/v1` |
| `open_ai_api_key` | Coding Plan 专用 Key与按量计费接口不通用 |
| `model` | `MiniMax-M2.5`, `MiniMax-M2.5-highspeed`, `MiniMax-M2.1`, `MiniMax-M2` |
| `open_ai_api_base` | China: `https://api.minimaxi.com/v1`; Global: `https://api.minimax.io/v1` |
| `open_ai_api_key` | Coding Plan specific key (not shared with pay-as-you-go) |
官方文档:[国内 Key 获取](https://platform.minimaxi.com/docs/coding-plan/quickstart)、[模型列表](https://platform.minimaxi.com/docs/guides/pricing-coding-plan)、[国际 Key 获取](https://platform.minimax.io/docs/coding-plan/quickstart)
Reference: [China Key](https://platform.minimaxi.com/docs/coding-plan/quickstart), [Model List](https://platform.minimaxi.com/docs/guides/pricing-coding-plan), [Global Key](https://platform.minimax.io/docs/coding-plan/quickstart)
---
## 智谱 GLM
## GLM
```json
{
@@ -89,34 +88,13 @@ description: Coding Plan 模式模型配置
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | `glm-5``glm-4.7``glm-4.6``glm-4.5``glm-4.5-air` |
| `open_ai_api_base` | 中国区:`https://open.bigmodel.cn/api/coding/paas/v4`;全球区:`https://api.z.ai/api/coding/paas/v4` |
| `open_ai_api_key` | API Key 与普通接口通用 |
| `model` | `glm-5`, `glm-4.7`, `glm-4.6`, `glm-4.5`, `glm-4.5-air` |
| `open_ai_api_base` | China: `https://open.bigmodel.cn/api/coding/paas/v4`; Global: `https://api.z.ai/api/coding/paas/v4` |
| `open_ai_api_key` | Shared with standard API |
官方文档:[国内版快速开始](https://docs.bigmodel.cn/cn/coding-plan/quick-start)、[国际版快速开始](https://docs.z.ai/devpack/quick-start)
---
## 火山引擎
```json
{
"bot_type": "openai",
"model": "Doubao-Seed-2.0-Code",
"open_ai_api_base": "https://ark.cn-beijing.volces.com/api/coding/v3",
"open_ai_api_key": "YOUR_API_KEY"
}
```
| 参数 | 说明 |
| --- | --- |
| `model` | `Doubao-Seed-2.0-Code`、`Doubao-Seed-2.0-pro`、`Doubao-Seed-2.0-lite`、`Doubao-Seed-Code`、`MiniMax-M2.5`、`Kimi-K2.5`、`GLM-4.7`、`DeepSeek-V3.2` |
| `open_ai_api_base` | `https://ark.cn-beijing.volces.com/api/coding/v3` |
| `open_ai_api_key` | API Key 与普通接口通用 |
官方文档:[快速开始](https://www.volcengine.com/docs/82379/1928261?lang=zh)
Reference: [China Quick Start](https://docs.bigmodel.cn/cn/coding-plan/quick-start), [Global Quick Start](https://docs.z.ai/devpack/quick-start)
---
@@ -131,10 +109,31 @@ description: Coding Plan 模式模型配置
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 填写 `kimi-for-coding` 会自动更新模型,或指定模型例如 `kimi-k2.6` |
| `model` | Use `kimi-for-coding` for auto-updating model, or specify a model such as `kimi-k2.6` |
| `moonshot_base_url` | `https://api.kimi.com/coding/v1` |
| `moonshot_api_key` | Coding Plan 专用 Key与按量计费接口不通用 |
| `moonshot_api_key` | Coding Plan specific key (not shared with pay-as-you-go) |
官方文档:[Key 获取](https://www.kimi.com/code/docs/)
Reference: [Key & Docs](https://www.kimi.com/code/docs/)
---
## Volcengine
```json
{
"bot_type": "openai",
"model": "Doubao-Seed-2.0-Code",
"open_ai_api_base": "https://ark.cn-beijing.volces.com/api/coding/v3",
"open_ai_api_key": "YOUR_API_KEY"
}
```
| Parameter | Description |
| --- | --- |
| `model` | `Doubao-Seed-2.0-Code`, `Doubao-Seed-2.0-pro`, `Doubao-Seed-2.0-lite`, `Doubao-Seed-Code`, `MiniMax-M2.5`, `Kimi-K2.5`, `GLM-4.7`, `DeepSeek-V3.2` |
| `open_ai_api_base` | `https://ark.cn-beijing.volces.com/api/coding/v3` |
| `open_ai_api_key` | Shared with standard API |
Reference: [Quick Start](https://www.volcengine.com/docs/82379/1928261?lang=zh)

View File

@@ -1,21 +1,21 @@
---
title: 自定义
description: 自定义厂商配置,适用于第三方 API 代理和本地模型
title: Custom
description: Custom vendor configuration for third-party API proxies and local models
---
适用于通过 OpenAI 兼容协议接入的第三方模型服务或本地部署的模型,例如:
For model services accessed via the OpenAI-compatible protocol or locally deployed models, such as:
- **第三方 API 代理**:使用统一的 API Base 调用多种模型
- **本地模型**:通过 OllamavLLMLocalAI 等工具在本地部署的模型
- **私有化部署**:企业内部部署的模型服务
- **Third-party API proxies**: call multiple models through a unified API base
- **Local models**: models deployed locally with tools like Ollama, vLLM, LocalAI
- **Private deployments**: model services deployed inside an enterprise
<Note>
与 `openai` 厂商的区别:选择自定义厂商后,通过 `/config model` 切换模型时,不会自动切换厂商类型,始终使用自定义的 API 地址。
Difference from the `openai` vendor: when a custom vendor is selected, switching models via `/config model` does not automatically switch the vendor type — the custom API address is always used.
</Note>
## 文本对话
## Text Chat
### 第三方 API 代理
### Third-party API proxy
```json
{
@@ -26,16 +26,16 @@ description: 自定义厂商配置,适用于第三方 API 代理和本地模
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `bot_type` | 必须设为 `custom` |
| `model` | 模型名称,填写代理服务支持的任意模型名 |
| `custom_api_key` | API 密钥,由代理服务提供 |
| `custom_api_base` | API 地址,由代理服务提供,需兼容 OpenAI 协议 |
| `bot_type` | Must be set to `custom` |
| `model` | Model name; any model name supported by the proxy service |
| `custom_api_key` | API key provided by the proxy service |
| `custom_api_base` | API endpoint provided by the proxy service; must be OpenAI-compatible |
### 本地模型
### Local models
本地模型通常不需要 API Key,只需填写 API Base
Local models usually do not require an API key — only the API base needs to be filled in:
```json
{
@@ -45,17 +45,17 @@ description: 自定义厂商配置,适用于第三方 API 代理和本地模
}
```
常见的本地部署工具及默认地址:
Common local deployment tools and their default endpoints:
| 工具 | 默认 API Base |
| Tool | Default API Base |
| --- | --- |
| [Ollama](https://ollama.com) | `http://localhost:11434/v1` |
| [vLLM](https://docs.vllm.ai) | `http://localhost:8000/v1` |
| [LocalAI](https://localai.io) | `http://localhost:8080/v1` |
### 切换模型
### Switching Models
自定义厂商下切换模型时,只会修改 `model`,不会改变 `bot_type` 和 API 地址:
Switching models under a custom vendor only changes `model` `bot_type` and the API endpoint remain unchanged:
```
/config model qwen3.5:27b

View File

@@ -1,11 +1,11 @@
---
title: DeepSeek
description: DeepSeek 模型配置(文本对话 + 思考模式)
description: DeepSeek model configuration (Text Chat + Thinking Mode)
---
DeepSeek 是当前 Agent 模式默认推荐的厂商之一,主打高性价比的文本对话和任务规划能力。
DeepSeek is one of the default recommended vendors in Agent mode, focused on cost-effective text chat and task planning.
## 文本对话
## Text Chat
```json
{
@@ -14,26 +14,26 @@ DeepSeek 是当前 Agent 模式默认推荐的厂商之一,主打高性价比
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 支持 `deepseek-v4-flash`(默认)、`deepseek-v4-pro` |
| `deepseek_api_key` | [DeepSeek 平台](https://platform.deepseek.com/api_keys) 创建 |
| `deepseek_api_base` | 可选,默认为 `https://api.deepseek.com/v1`,可修改为第三方代理地址 |
| `model` | Supports `deepseek-v4-flash` (Default), `deepseek-v4-pro` |
| `deepseek_api_key` | Create one on the [DeepSeek Platform](https://platform.deepseek.com/api_keys) |
| `deepseek_api_base` | Optional, defaults to `https://api.deepseek.com/v1`. Can be changed to a third-party proxy |
### 模型选择
### Model Selection
| 模型 | 适用场景 |
| Model | Use Case |
| --- | --- |
| `deepseek-v4-flash` | 默认推荐,速度快、成本低 |
| `deepseek-v4-pro` | 更智能,复杂任务效果更强 |
| `deepseek-v4-flash` | Default recommended; fast and low cost |
| `deepseek-v4-pro` | Smarter; better for complex tasks |
## 思考模式
## Thinking Mode
V4 系列(`deepseek-v4-flash` / `deepseek-v4-pro`)支持显式的「思考模式」:模型在输出最终回答前,先输出一段思维链(`reasoning_content`),从而提升答案质量。
The V4 series (`deepseek-v4-flash` / `deepseek-v4-pro`) supports an explicit "thinking mode": before producing the final answer, the model emits a chain of thought (`reasoning_content`) to improve answer quality.
### 开关
### Toggle
通过全局配置 `enable_thinking` 控制,也可在 web控制台 - 配置页面中进行切换:
Controlled by the global `enable_thinking` config, and can also be toggled from the Web Console's configuration page:
```json
{
@@ -41,12 +41,12 @@ V4 系列(`deepseek-v4-flash` / `deepseek-v4-pro`)支持显式的「思考
}
```
- `true`所有渠道下模型都会先思考再作答。Web 控制台会展示思考过程IM 渠道(微信 / 企微 / 钉钉 / 飞书)虽不展示但同样获得更好答案。
- `false`:关闭思考,响应更快,首字延迟更低。
- `true`: the model thinks before answering across all channels. The Web Console displays the thinking process; IM channels (WeChat / WeCom / DingTalk / Feishu) do not show it but still get better answers.
- `false`: thinking is disabled, responses are faster, and time-to-first-token is lower.
### 推理强度
### Reasoning Effort
思考模式下可通过 `reasoning_effort` 控制推理强度:
Under thinking mode, `reasoning_effort` controls reasoning intensity:
```json
{
@@ -55,18 +55,18 @@ V4 系列(`deepseek-v4-flash` / `deepseek-v4-pro`)支持显式的「思考
}
```
| 取值 | 适用场景 |
| Value | Use Case |
| --- | --- |
| `high`(默认) | 日常 Agent 任务,思考与速度的平衡 |
| `max` | 复杂编码、长链路规划、严格约束的任务,推理更深但耗时与输出 token 更多 |
| `high` (Default) | Day-to-day Agent tasks; balanced reasoning and speed |
| `max` | Complex coding, long-horizon planning, strictly constrained tasks; deeper reasoning but more time and output tokens |
`reasoning_effort` 仅在 `enable_thinking` `true` 时生效;模型不支持思考模式时该字段自动忽略。
`reasoning_effort` only takes effect when `enable_thinking` is `true`; it is ignored automatically when the model does not support thinking mode.
### 行为说明
### Behavior Notes
- **采样参数**:思考模式下 `temperature``top_p``presence_penalty``frequency_penalty` 会被服务端忽略不会报错CowAgent 会自动跳过传入。
- **多轮工具调用**当历史中包含工具调用时DeepSeek 要求所有 assistant 消息必须回传 `reasoning_content`CowAgent 会自动处理回传逻辑,跨轮次切换思考开关也不会出错。
- **Sampling parameters**: in thinking mode, `temperature`, `top_p`, `presence_penalty`, and `frequency_penalty` are ignored by the server (without errors). CowAgent automatically skips them.
- **Multi-turn tool calls**: when the history contains tool calls, DeepSeek requires every assistant message to include `reasoning_content`. CowAgent handles this automatically, so toggling thinking mode across turns will not cause errors.
<Tip>
默认使用 `deepseek-v4-flash`;复杂任务可使用 `deepseek-v4-pro`;需要深度推理可开启 `enable_thinking`。
`deepseek-v4-flash` is used by default; switch to `deepseek-v4-pro` for complex tasks; enable `enable_thinking` when deep reasoning is needed.
</Tip>

View File

@@ -1,15 +1,15 @@
---
title: 豆包 Doubao
description: 豆包(火山方舟)模型配置(文本 / 图像理解 / 图像生成 / 向量)
title: Doubao
description: Doubao (Volcengine Ark) model configuration (Text / Image Understanding / Image Generation / Embedding)
---
豆包火山方舟支持文本对话、图像理解、图像生成Seedream和向量能力一份 `ark_api_key` 即可启用全部能力。
Doubao (Volcengine Ark) supports text chat, image understanding, image generation (Seedream), and embedding. A single `ark_api_key` enables all capabilities.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,17 +18,17 @@ description: 豆包(火山方舟)模型配置(文本 / 图像理解 / 图
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 可填 `doubao-seed-2-0-pro-260215``doubao-seed-2-0-code-preview-260215``doubao-seed-2-0-lite-260215` |
| `ark_api_key` | 在 [火山方舟控制台](https://console.volcengine.com/ark/region:ark+cn-beijing/apikey) 创建 |
| `ark_base_url` | 可选,默认为 `https://ark.cn-beijing.volces.com/api/v3` |
| `model` | Can be `doubao-seed-2-0-pro-260215`, `doubao-seed-2-0-code-preview-260215`, `doubao-seed-2-0-lite-260215`, etc. |
| `ark_api_key` | Create one in the [Volcengine Ark Console](https://console.volcengine.com/ark/region:ark+cn-beijing/apikey) |
| `ark_base_url` | Optional, defaults to `https://ark.cn-beijing.volces.com/api/v3` |
## 图像理解
## Image Understanding
配置 `ark_api_key` Agent Vision 工具会自动使用 `doubao-seed-2-0-pro-260215` 识别图像,无需额外配置。
Once `ark_api_key` is configured, the Agent's Vision tool automatically uses `doubao-seed-2-0-pro-260215` to recognize images, with no extra setup required.
如需手动指定 Vision 模型:
To manually specify a Vision model:
```json
{
@@ -40,7 +40,7 @@ description: 豆包(火山方舟)模型配置(文本 / 图像理解 / 图
}
```
## 图像生成
## Image Generation
```json
{
@@ -52,9 +52,9 @@ description: 豆包(火山方舟)模型配置(文本 / 图像理解 / 图
}
```
可选模型:`seedream-5.0-lite``seedream-4.5`
Available models: `seedream-5.0-lite`, `seedream-4.5`.
## 向量
## Embedding
```json
{
@@ -63,4 +63,4 @@ description: 豆包(火山方舟)模型配置(文本 / 图像理解 / 图
}
```
默认模型 `doubao-embedding-vision-251215`(多模态 embedding可在配置文件中通过 `embedding_dimensions` 指定 1024 或 2048 维。修改 embedding 后需执行 `/memory rebuild-index` 命令重建索引。
The default model is `doubao-embedding-vision-251215` (multimodal embedding); the dimension (1024 or 2048) can be set via `embedding_dimensions` in the configuration file. After changing the embedding, run `/memory rebuild-index` to rebuild the index.

View File

@@ -1,15 +1,15 @@
---
title: Gemini
description: Google Gemini 模型配置(文本对话 + 图像理解 + 图像生成)
description: Google Gemini model configuration (Text Chat + Image Understanding + Image Generation)
---
Google Gemini 支持文本对话、图像理解和图像生成Nano Banana 系列),一个 `gemini_api_key` 即可启用全部能力。
Google Gemini supports text chat, image understanding, and image generation (Nano Banana series). A single `gemini_api_key` enables all capabilities.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,17 +18,17 @@ Google Gemini 支持文本对话、图像理解和图像生成Nano Banana 系
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 推荐 `gemini-3.5-flash`,亦支持 `gemini-3.1-pro-preview``gemini-3.1-flash-lite-preview``gemini-3-flash-preview``gemini-3-pro-preview` 等,参考 [官方文档](https://ai.google.dev/gemini-api/docs/models) |
| `gemini_api_key` | [Google AI Studio](https://aistudio.google.com/app/apikey) 创建 |
| `gemini_api_base` | 可选,默认为 `https://generativelanguage.googleapis.com`,可改为第三方代理 |
| `model` | Recommended: `gemini-3.5-flash`; also supports `gemini-3.1-pro-preview`, `gemini-3.1-flash-lite-preview`, `gemini-3-flash-preview`, `gemini-3-pro-preview`, etc. See [official docs](https://ai.google.dev/gemini-api/docs/models) |
| `gemini_api_key` | Create one in [Google AI Studio](https://aistudio.google.com/app/apikey) |
| `gemini_api_base` | Optional, defaults to `https://generativelanguage.googleapis.com`. Can be changed to a third-party proxy |
## 图像理解
## Image Understanding
Gemini 全系列模型均原生支持视觉,配置 `gemini_api_key` 后 Agent 的 Vision 工具会自动使用主模型识别图像,无需额外配置。
All Gemini models natively support vision. Once `gemini_api_key` is configured, the Agent's Vision tool automatically uses the main model to recognize images, with no extra setup required.
如需手动指定 Vision 模型:
To manually specify a Vision model:
```json
{
@@ -40,7 +40,7 @@ Gemini 全系列模型均原生支持视觉,配置 `gemini_api_key` 后 Agent
}
```
## 图像生成
## Image Generation
```json
{
@@ -52,7 +52,7 @@ Gemini 全系列模型均原生支持视觉,配置 `gemini_api_key` 后 Agent
}
```
| 模型 ID | 别名 |
| Model ID | Alias |
| --- | --- |
| `gemini-3.1-flash-image-preview` | Nano Banana 2 |
| `gemini-3-pro-image-preview` | Nano Banana Pro |

View File

@@ -1,15 +1,15 @@
---
title: 智谱 GLM
description: 智谱 AI GLM 模型配置(文本 / 图像理解 / 语音识别 / 向量)
title: GLM
description: Zhipu AI GLM model configuration (Text / Image Understanding / Speech-to-Text / Embedding)
---
智谱 AI 支持文本对话、图像理解、语音识别ASR和向量Embedding一份 `zhipu_ai_api_key` 即可启用全部能力。
Zhipu AI supports text chat, image understanding, speech-to-text (ASR), and embedding. A single `zhipu_ai_api_key` enables all capabilities.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,17 +18,17 @@ description: 智谱 AI GLM 模型配置(文本 / 图像理解 / 语音识别 /
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 可填 `glm-5.1``glm-5-turbo``glm-5``glm-4.7``glm-4-plus``glm-4-flash``glm-4-air` 等,参考 [模型编码](https://bigmodel.cn/dev/api/normal-model/glm-4) |
| `zhipu_ai_api_key` | 在 [智谱 AI 控制台](https://www.bigmodel.cn/usercenter/proj-mgmt/apikeys) 创建 |
| `zhipu_ai_api_base` | 可选,默认为 `https://open.bigmodel.cn/api/paas/v4` |
| `model` | Can be `glm-5.1`, `glm-5-turbo`, `glm-5`, `glm-4.7`, `glm-4-plus`, `glm-4-flash`, `glm-4-air`, etc. See [model codes](https://bigmodel.cn/dev/api/normal-model/glm-4) |
| `zhipu_ai_api_key` | Create one in the [Zhipu AI Console](https://www.bigmodel.cn/usercenter/proj-mgmt/apikeys) |
| `zhipu_ai_api_base` | Optional, defaults to `https://open.bigmodel.cn/api/paas/v4` |
## 图像理解
## Image Understanding
智谱 chat 系列模型(`glm-5.1``glm-5-turbo` 等)不支持视觉,视觉调用统一路由到 `glm-5v-turbo`。配置 `zhipu_ai_api_key` Agent Vision 工具会自动使用该模型,无需在配置文件中显式指定。
Zhipu's chat models (`glm-5.1`, `glm-5-turbo`, etc.) do not support vision; vision calls are uniformly routed to `glm-5v-turbo`. Once `zhipu_ai_api_key` is configured, the Agent's Vision tool automatically uses this model, with no need to specify it explicitly in the configuration file.
## 语音识别
## Speech-to-Text (ASR)
```json
{
@@ -37,14 +37,14 @@ description: 智谱 AI GLM 模型配置(文本 / 图像理解 / 语音识别 /
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `voice_to_text` | 设为 `zhipu` 启用智谱 ASR |
| `voice_to_text_model` | 可选,默认 `glm-asr-2512` |
| `voice_to_text` | Set to `zhipu` to enable Zhipu ASR |
| `voice_to_text_model` | Optional, defaults to `glm-asr-2512` |
凭证自动复用 `zhipu_ai_api_key`。语音文件建议小于 25MB超大文件可能被服务端拒绝。
Credentials are automatically reused from `zhipu_ai_api_key`. Audio files should be smaller than 25MB; oversized files may be rejected by the server.
## 向量
## Embedding
```json
{
@@ -53,4 +53,4 @@ description: 智谱 AI GLM 模型配置(文本 / 图像理解 / 语音识别 /
}
```
可选模型:`embedding-3``embedding-2`。修改 embedding 后需执行 `/memory rebuild-index` 命令重建索引。
Available models: `embedding-3`, `embedding-2`. After changing the embedding, run `/memory rebuild-index` to rebuild the index.

View File

@@ -1,40 +1,38 @@
---
title: 模型概览
description: CowAgent 支持的模型厂商及能力矩阵
title: Models Overview
description: Model vendors supported by CowAgent and their capability matrix
---
CowAgent 支持国内外主流厂商的大语言模型,模型接口实现在项目的 `models/` 目录下。除文本对话外,部分厂商还提供视觉理解、图像生成、语音识别、语音合成、向量等能力,可在 Agent 流程中按需调用。
CowAgent supports a wide range of mainstream large language models. Model interfaces live under the project's `models/` directory. Beyond text chat, several vendors also provide vision understanding, image generation, speech-to-text, text-to-speech, and embeddings — all of which can be invoked on demand in the Agent flow.
## Capability Matrix
## 模型能力总览
A snapshot of each vendor's capabilities. "Text" refers to the main chat model; the remaining columns show which Agent capabilities the vendor can power.
各厂商提供的能力一览。「文本」指主对话模型,其余列表示该厂商可承担对应 Agent 能力。
| 厂商 | 代表模型 | 文本 | 图像理解 | 图像生成 | 语音识别 | 语音合成 | 向量 |
| Vendor | Representative Models | Text | Vision | Image Gen | STT | TTS | Embedding |
| --- | --- | :-: | :-: | :-: | :-: | :-: | :-: |
| [DeepSeek](/models/deepseek) | deepseek-v4-flash / pro | ✅ | | | | | |
| [MiniMax](/models/minimax) | MiniMax-M2.7 | ✅ | ✅ | ✅ | | ✅ | |
| [Claude](/models/claude) | claude-opus-4-8 | ✅ | ✅ | | | | |
| [Gemini](/models/gemini) | gemini-3.5-flash | ✅ | ✅ | ✅ | | | |
| [OpenAI](/models/openai) | gpt-5.5、o 系列 | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| [智谱 GLM](/models/glm) | glm-5.1glm-5v-turbo | ✅ | ✅ | | ✅ | | ✅ |
| [通义千问](/models/qwen) | qwen3.7-max | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| [豆包 Doubao](/models/doubao) | doubao-seed-2.0 系列 | ✅ | ✅ | ✅ | | | ✅ |
| [OpenAI](/models/openai) | gpt-5.5, o-series | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| [GLM](/models/glm) | glm-5.1, glm-5v-turbo | ✅ | ✅ | | ✅ | | ✅ |
| [Qwen](/models/qwen) | qwen3.7-max | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| [Doubao](/models/doubao) | doubao-seed-2.0 series | ✅ | ✅ | ✅ | | | ✅ |
| [Kimi](/models/kimi) | kimi-k2.6 | ✅ | ✅ | | | | |
| [百度千帆](/models/qianfan) | ernie-5.1 | ✅ | ✅ | | | | |
| [小米 MiMo](/models/mimo) | mimo-v2.5-pro / v2.5 | ✅ | ✅ | | | ✅ | |
| [LinkAI](/models/linkai) | 多厂商 100+ 模型统一接入 | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| [自定义](/models/custom) |本地模型 / 三方代理 | ✅ | | | | | |
| [ERNIE](/models/qianfan) | ernie-5.1 | ✅ | ✅ | | | | |
| [MiMo](/models/mimo) | mimo-v2.5-pro / v2.5 | ✅ | ✅ | | | ✅ | |
| [LinkAI](/models/linkai) | 100+ models from multiple vendors | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| [Custom](/models/custom) | Local models / third-party proxies | ✅ | | | | | |
<Tip>
Web 控制台中各项能力(视觉 / 图像 / 语音识别 / 语音合成 / 向量 / 网络搜索)均可独立配置厂商与模型,互相之间不强制绑定。
Every capability in the Web console (Vision / Image / STT / TTS / Embedding / Web Search) can be configured independently with its own vendor and model — there is no forced binding between them.
</Tip>
## How to Configure
## 配置方式
**Option 1 (recommended):** Manage models and capabilities online via the [Web console](/channels/web), with no need to edit the configuration file:
**方式一(推荐):** 通过 [Web 控制台](/channels/web) 在线管理模型与各项能力,无需手动编辑配置文件:
<img width="900" src="https://cdn.jsdelivr.net/gh/zhayujie/cowagent-assets@main/screenshots/en/web-console-models-config.png" />
<img width="900" src="https://cdn.link-ai.tech/doc/20260521212527.png" />
**方式二:** 手动编辑 `config.json`,根据所选模型填写对应的模型名称和 API Key。每个模型也支持 OpenAI 兼容方式接入,将 `bot_type` 设为 `openai`,配置 `open_ai_api_base` 和 `open_ai_api_key` 即可。
**Option 2:** Edit `config.json` manually and fill in the model name and API key for the selected vendor. Every model also supports OpenAI-compatible access — just set `bot_type` to `openai` and configure `open_ai_api_base` and `open_ai_api_key`.

View File

@@ -1,15 +1,15 @@
---
title: Kimi
description: KimiMoonshot)模型配置(文本对话 + 图像理解)
description: Kimi (Moonshot) model configuration (Text Chat + Image Understanding)
---
Kimi 由 Moonshot 提供,支持文本对话与图像理解,`kimi-k2.x` 系列原生支持视觉。
Kimi is provided by Moonshot and supports both text chat and image understanding. The `kimi-k2.x` series natively supports vision.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,17 +18,17 @@ Kimi 由 Moonshot 提供,支持文本对话与图像理解,`kimi-k2.x` 系
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 可填 `kimi-k2.6``kimi-k2.5``kimi-k2``moonshot-v1-8k``moonshot-v1-32k``moonshot-v1-128k` |
| `moonshot_api_key` | [Moonshot 控制台](https://platform.moonshot.cn/console/api-keys) 创建 |
| `moonshot_base_url` | 可选,默认为 `https://api.moonshot.cn/v1` |
| `model` | Can be `kimi-k2.6`, `kimi-k2.5`, `kimi-k2`, `moonshot-v1-8k`, `moonshot-v1-32k`, `moonshot-v1-128k` |
| `moonshot_api_key` | Create one in the [Moonshot Console](https://platform.moonshot.cn/console/api-keys) |
| `moonshot_base_url` | Optional, defaults to `https://api.moonshot.cn/v1` |
## 图像理解
## Image Understanding
配置 `moonshot_api_key` Agent Vision 工具会自动使用 `kimi-k2.6` 识别图像,无需额外配置。
Once `moonshot_api_key` is configured, the Agent's Vision tool automatically uses `kimi-k2.6` to recognize images, with no extra setup required.
如需手动指定 Vision 模型:
To manually specify a Vision model:
```json
{

View File

@@ -1,15 +1,15 @@
---
title: LinkAI
description: 通过 LinkAI 平台统一接入文本、视觉、图像、语音与向量能力
description: Access text, vision, image, speech, and embedding capabilities through the LinkAI platform
---
通过一份 `linkai_api_key` 即可访问 OpenAIClaudeGeminiDeepSeekMiniMaxQwenKimi、豆包 等主流厂商的全部能力。
A single `linkai_api_key` gives you access to all capabilities of mainstream vendors such as OpenAI, Claude, Gemini, DeepSeek, MiniMax, Qwen, Kimi, and Doubao.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,17 +18,17 @@ description: 通过 LinkAI 平台统一接入文本、视觉、图像、语音
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `use_linkai` | 设为 `true` 启用 |
| `linkai_api_key` | 在 [控制台](https://link-ai.tech/console/interface) 创建 |
| `model` | 可填写 [模型列表](https://link-ai.tech/console/models) 中任意编码 |
| `use_linkai` | Set to `true` to enable |
| `linkai_api_key` | Create one in the [Console](https://link-ai.tech/console/interface) |
| `model` | Can be any code from the [model list](https://link-ai.tech/console/models) |
前往 [模型服务](https://link-ai.tech/console/models) 了解更多。
See [Model Service](https://link-ai.tech/console/models) for more.
## 图像理解
## Image Understanding
配置完成后 Agent 的 Vision 工具会自动调用网关上的多模态模型,无需额外配置。如需手动指定 Vision 模型:
Once configured, the Agent's Vision tool automatically calls multimodal models via the gateway, with no extra setup required. To manually specify a Vision model:
```json
{
@@ -40,9 +40,9 @@ description: 通过 LinkAI 平台统一接入文本、视觉、图像、语音
}
```
可选模型:`gpt-4.1-mini``gpt-5.4-mini``qwen3.6-plus``doubao-seed-2-0-pro-260215``kimi-k2.6``claude-sonnet-4-6``gemini-3.1-flash-lite-preview` 等。
Available models: `gpt-4.1-mini`, `gpt-5.4-mini`, `qwen3.6-plus`, `doubao-seed-2-0-pro-260215`, `kimi-k2.6`, `claude-sonnet-4-6`, `gemini-3.1-flash-lite-preview`, etc.
## 图像生成
## Image Generation
```json
{
@@ -54,14 +54,14 @@ description: 通过 LinkAI 平台统一接入文本、视觉、图像、语音
}
```
| 模型 ID | 别名 |
| Model ID | Alias |
| --- | --- |
| `gpt-image-2` | OpenAI |
| `gemini-3.1-flash-image-preview` | Nano Banana 2 |
| `gemini-3-pro-image-preview` | Nano Banana Pro |
| `seedream-5.0-lite` | 字节豆包 Seedream |
| `seedream-5.0-lite` | ByteDance Doubao Seedream |
## 语音识别
## Speech-to-Text (ASR)
```json
{
@@ -69,11 +69,11 @@ description: 通过 LinkAI 平台统一接入文本、视觉、图像、语音
}
```
ASR 固定使用 Whisper,凭证自动复用 `linkai_api_key`
ASR uses Whisper by default; credentials are automatically reused from `linkai_api_key`.
## 语音合成
## Text-to-Speech (TTS)
语音合成网关下支持多个底层 TTS 引擎,按 `text_to_voice_model` 选择引擎,音色随引擎切换。
The TTS gateway supports multiple underlying engines. The engine is selected by `text_to_voice_model`, and the available voices change with the engine.
```json
{
@@ -83,15 +83,15 @@ ASR 固定使用 Whisper凭证自动复用 `linkai_api_key`。
}
```
| `text_to_voice_model` | 引擎说明 |
| `text_to_voice_model` | Engine |
| --- | --- |
| `tts-1` | OpenAI · 多语种通用(音色 `alloy` / `nova` / `echo` 等) |
| `doubao` | 字节豆包 · 中文音色丰富 |
| `baidu` | 百度 · 中文主播音色 |
| `tts-1` | OpenAI · Multi-language (voices like `alloy` / `nova` / `echo`, etc.) |
| `doubao` | ByteDance Doubao · Rich Chinese voices |
| `baidu` | Baidu · Chinese broadcaster voices |
不同引擎对应的音色不同,建议在 Web 控制台「模型管理 → 语音合成」中可视化选择。
Voices differ by engine; we recommend selecting them visually in the Web Console under "Model Management → Text-to-Speech".
## 向量
## Embedding
```json
{
@@ -100,4 +100,4 @@ ASR 固定使用 Whisper凭证自动复用 `linkai_api_key`。
}
```
默认模型 `text-embedding-3-small`OpenAI 兼容)。修改 embedding 后需执行 `/memory rebuild-index` 命令重建索引。
The default model is `text-embedding-3-small` (OpenAI-compatible). After changing the embedding, run `/memory rebuild-index` to rebuild the index.

View File

@@ -1,15 +1,15 @@
---
title: 小米 MiMo
description: 小米 MiMo 模型配置(文本对话 + 图像理解 + 语音合成)
title: MiMo
description: Xiaomi MiMo model configuration (Text Chat + Image Understanding + Text-to-Speech)
---
小米 MiMo 是原生全模态大模型,单 `mimo_api_key` 即可同时启用文本对话、图像理解与语音合成。
Xiaomi MiMo is a native omni-modal large model. A single `mimo_api_key` enables text chat, image understanding, and text-to-speech all at once.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console — no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -19,24 +19,24 @@ description: 小米 MiMo 模型配置(文本对话 + 图像理解 + 语音合
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 默认推荐 `mimo-v2.5-pro`,也可使用 `mimo-v2.5` |
| `mimo_api_key` | 在 [MiMo 开放平台](https://platform.xiaomimimo.com/console/api-keys) 创建 |
| `mimo_api_base` | 可选,默认为 `https://api.xiaomimimo.com/v1` |
| `model` | Default recommendation: `mimo-v2.5-pro`; `mimo-v2.5` is also supported |
| `mimo_api_key` | Create one in the [MiMo Open Platform](https://platform.xiaomimimo.com/console/api-keys) |
| `mimo_api_base` | Optional, defaults to `https://api.xiaomimimo.com/v1` |
### 模型选择
### Model Selection
| 模型 | 适用场景 |
| Model | Use Case |
| --- | --- |
| `mimo-v2.5-pro` | 旗舰,原生全模态 + Agent 能力,最高 100 万 tokens 上下文 |
| `mimo-v2.5` | 综合版,原生全模态(文本 / 图像 / 视频 / 音频) |
| `mimo-v2.5-pro` | Flagship: native omni-modal + Agent capability, up to 1M tokens context |
| `mimo-v2.5` | General-purpose, native omni-modal (text / image / video / audio) |
## 思考模式
## Thinking Mode
MiMo V2.5 系列默认开启「思考模式」:模型在输出最终回答前会先输出 `reasoning_content`(思维链),提升复杂任务表现。
The MiMo V2.5 series enables "thinking mode" by default: the model emits `reasoning_content` (chain-of-thought) before the final answer, improving performance on complex tasks.
通过全局配置 `enable_thinking` 控制是否展示(也可在 Web 控制台 - 配置页面切换):
Use the global `enable_thinking` flag to toggle visibility (also switchable from the Web Console settings):
```json
{
@@ -44,14 +44,14 @@ MiMo V2.5 系列默认开启「思考模式」:模型在输出最终回答前
}
```
## 图像理解
## Image Understanding
配置 `mimo_api_key` 后,Agent Vision 工具可以自动使用 MiMo 视觉模型:
Once `mimo_api_key` is configured, the Agent's Vision tool can automatically use MiMo's vision models:
- 当主模型本身是多模态时(`mimo-v2.5-pro` / `mimo-v2.5`),直接由主模型识别图像,无需额外配置
- 当主模型是其他厂商时Vision 工具会根据顺序自动 fallback `mimo-v2.5-pro`
- When the main model itself is multimodal (`mimo-v2.5-pro` / `mimo-v2.5`), images are handled directly by the main model with no extra setup.
- When the main model belongs to another vendor, the Vision tool falls back to `mimo-v2.5-pro` in order.
如需手动指定 Vision 模型,可在配置文件中显式配置:
To force a specific Vision model, set it explicitly in the configuration:
```json
{
@@ -64,7 +64,7 @@ MiMo V2.5 系列默认开启「思考模式」:模型在输出最终回答前
}
```
## 语音合成
## Text-to-Speech (TTS)
```json
{
@@ -74,62 +74,63 @@ MiMo V2.5 系列默认开启「思考模式」:模型在输出最终回答前
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `text_to_voice_model` | 当前仅支持 `mimo-v2.5-tts`(预置音色 + 唱歌模式) |
| `tts_voice_id` | 预置音色名(中文音色直接使用中文名作为 ID |
| `text_to_voice_model` | Currently only `mimo-v2.5-tts` (preset voices + singing mode) |
| `tts_voice_id` | Preset voice name (Chinese voice IDs use the Chinese name directly) |
### 预置音色
### Preset Voices
| 音色 ID | 说明 |
| Voice ID | Description |
| --- | --- |
| `冰糖` | 中文 · 女声(默认) |
| `茉莉` | 中文 · 女声 |
| `苏打` | 中文 · 男声 |
| `白桦` | 中文 · 男声 |
| `Mia` | 英文 · 女声 |
| `Chloe` | 英文 · 女声 |
| `Milo` | 英文 · 男声 |
| `Dean` | 英文 · 男声 |
| `Mia` | English · Female |
| `Chloe` | English · Female |
| `Milo` | English · Male |
| `Dean` | English · Male |
| `冰糖` | Chinese · Female (default) |
| `茉莉` | Chinese · Female |
| `苏打` | Chinese · Male |
| `白桦` | Chinese · Male |
也可在 Web 控制台的「模型管理 → 语音合成」下拉框中可视化选择。
### 风格控制
You can also pick a voice visually from the Web Console under "Model Management → Text-to-Speech".
MiMo TTS 支持在合成文本中嵌入 **音频标签** 来控制情绪、语调、方言、角色甚至唱歌。标签需出现在 **最终被合成为语音的文本(即 Agent 回复内容)** 中,整体风格标签写在开头:
### Style Control
MiMo TTS supports embedding **audio tags** in the synthesis text to control emotion, tone, dialect, persona, and even singing. Tags must appear in the **text that will be synthesized to speech (i.e. the Agent's reply)**, with the overall style tag placed at the very beginning:
```
(风格)待合成内容
(style)content-to-synthesize
```
支持半角 `()`、全角 `` 或 `[]` 三种括号。常见风格示例:
Half-width `()`, full-width ``, and `[]` brackets are all accepted. Both Chinese and English style descriptors work — pick whichever language expresses the timbre most precisely. Common examples:
| 类型 | 示例标签 |
| Category | Example tags |
| --- | --- |
| 基础情绪 | `开心` `悲伤` `愤怒` `恐惧` `惊讶` `兴奋` `委屈` `平静` `冷漠` |
| 复合情绪 | `怅然` `欣慰` `无奈` `愧疚` `释然` `忐忑` `动情` |
| 整体语调 | `温柔` `高冷` `活泼` `严肃` `慵懒` `俏皮` `深沉` `干练` `凌厉` |
| 音色定位 | `磁性` `醇厚` `清亮` `空灵` `稚嫩` `苍老` `甜美` `沙哑` |
| 人设腔调 | `夹子音` `御姐音` `正太音` `大叔音` `台湾腔` |
| 方言 | `东北话` `四川话` `河南话` `粤语` |
| 角色扮演 | `孙悟空` `林黛玉` |
| 唱歌 | `唱歌`(等价于 `sing` / `singing` |
| Basic emotions | `happy` `sad` `angry` `fear` `surprised` `excited` `aggrieved` `calm` `indifferent` |
| Compound emotions | `wistful` `relieved` `helpless` `guilty` `at ease` `uneasy` `touched` |
| Overall tone | `gentle` `aloof` `lively` `serious` `languid` `playful` `deep` `sharp` `cutting` |
| Voice character | `magnetic` `mellow` `bright` `ethereal` `childlike` `aged` `sweet` `husky` |
| Persona | `squeaky` `mature lady` `young boy` `uncle` `Taiwanese accent` |
| Dialect | `Northeastern` `Sichuan` `Henan` `Cantonese` |
| Role-play | `Sun Wukong` `Lin Daiyu` |
| Singing | `sing` / `singing` |
示例:
Examples:
- (磁性)夜已经深了,城市还在呼吸。
- (东北话)哎呀妈呀,这天儿也忒冷了吧!
- (粤语)呢个真係好正啊!
- (唱歌)原谅我这一生不羁放纵爱自由…
- `(magnetic)The night is deep, and the city is still breathing.`
- `(gentle)Take a breath. You've got this.`
- `(serious)This is the final warning before the system reboots.`
- `(singing)Oh, when the saints go marching in…`
也可以在文本任意位置插入细粒度音频标签来控制呼吸、笑声、停顿等,例如:
You can also insert fine-grained audio tags at any position in the text to control breathing, laughter, pauses, etc. For example:
```
(紧张,深呼吸)呼……冷静,冷静。(语速加快)自我介绍我背了五十遍了,应该没问题。
(nervous, deep breath) Phew… stay calm, stay calm. (faster pace) I've rehearsed this intro fifty times, it'll be fine.
```
完整标签列表参见 [MiMo 语音合成文档](https://platform.xiaomimimo.com/docs/zh-CN/usage-guide/speech-synthesis-v2.5)
See the [MiMo speech synthesis documentation](https://platform.xiaomimimo.com/docs/zh-CN/usage-guide/speech-synthesis-v2.5) for the full tag list.
<Tip>
CowAgent 在调用 TTS 时会将 Agent 的回复原文(含 `(...)` 标签)直接送入 MiMo 合成。你可以在人设 / 系统提示词里要求模型「在回复开头用 `(风格)` 标签控制语气」,即可让 IM 渠道(微信 / 飞书 / 钉钉 / 企微)的语音回复带上情绪、方言、唱歌等效果。
When CowAgent calls TTS, the Agent's reply text (including any `(...)` tags) is forwarded directly to MiMo for synthesis. Tell the model in its persona / system prompt to "prefix replies with a `(style)` tag to control the tone", and IM channels (WeChat / Feishu / DingTalk / WeCom) will play voice replies with the corresponding emotion, dialect, or even singing.
</Tip>

View File

@@ -1,15 +1,15 @@
---
title: MiniMax
description: MiniMax 模型配置(文本 / 图像理解 / 图像生成 / 语音合成)
description: MiniMax model configuration (Text / Image Understanding / Image Generation / Text-to-Speech)
---
MiniMax 支持文本对话、图像理解、图像生成与语音合成,一份 `minimax_api_key` 即可启用全部能力。
MiniMax supports text chat, image understanding, image generation, and text-to-speech. A single `minimax_api_key` enables all capabilities.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,16 +18,16 @@ MiniMax 支持文本对话、图像理解、图像生成与语音合成,一份
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 可填 `MiniMax-M2.7``MiniMax-M2.7-highspeed``MiniMax-M2.5``MiniMax-M2.1``MiniMax-M2.1-lightning``MiniMax-M2` |
| `minimax_api_key` | [MiniMax 控制台](https://platform.minimaxi.com/user-center/basic-information/interface-key) 创建 |
| `model` | Can be `MiniMax-M2.7`, `MiniMax-M2.7-highspeed`, `MiniMax-M2.5`, `MiniMax-M2.1`, `MiniMax-M2.1-lightning`, `MiniMax-M2`, etc. |
| `minimax_api_key` | Create one in the [MiniMax Console](https://platform.minimaxi.com/user-center/basic-information/interface-key) |
## 图像理解
## Image Understanding
MiniMax M2.x 系列 chat 模型本身不支持视觉,视觉调用统一路由到 `MiniMax-Text-01`。配置 `minimax_api_key` Agent Vision 工具会自动使用该模型,无需在配置文件中显式指定。
MiniMax's M2.x chat models do not support vision natively; vision calls are uniformly routed to `MiniMax-Text-01`. Once `minimax_api_key` is configured, the Agent's Vision tool automatically uses this model, with no need to specify it explicitly in the configuration file.
## 图像生成
## Image Generation
```json
{
@@ -39,9 +39,9 @@ MiniMax 的 M2.x 系列 chat 模型本身不支持视觉,视觉调用统一路
}
```
可选模型:`image-01`
Available models: `image-01`.
## 语音合成
## Text-to-Speech (TTS)
```json
{
@@ -51,21 +51,21 @@ MiniMax 的 M2.x 系列 chat 模型本身不支持视觉,视觉调用统一路
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `text_to_voice_model` | `speech-2.8-hd`(情绪渲染、自然听感)、`speech-2.8-turbo`(极速)、`speech-2.6-hd``speech-2.6-turbo` |
| `tts_voice_id` | 音色 ID支持中文 / 粤语 / 英 / 日 / 韩,共 70+ 种 |
| `text_to_voice_model` | `speech-2.8-hd` (emotional rendering, natural sound), `speech-2.8-turbo` (ultra-fast), `speech-2.6-hd`, `speech-2.6-turbo` |
| `tts_voice_id` | Voice ID; supports Chinese / Cantonese / English / Japanese / Korean — 70+ voices in total |
常用音色示例:
Common voice examples:
| 音色 ID | 说明 |
| Voice ID | Description |
| --- | --- |
| `female-shaonv` | 中文 · 少女(女) |
| `female-yujie` | 中文 · 御姐(女) |
| `female-tianmei` | 中文 · 甜美女性(女) |
| `male-qn-jingying` | 中文 · 精英青年(男) |
| `male-qn-badao` | 中文 · 霸道青年(男) |
| `Cantonese_GentleLady` | 粤语 · 温柔女声 |
| `English_Graceful_Lady` | 英文 · Graceful Lady |
| `female-shaonv` | Chinese · Young Girl (Female) |
| `female-yujie` | Chinese · Mature Lady (Female) |
| `female-tianmei` | Chinese · Sweet Female (Female) |
| `male-qn-jingying` | Chinese · Elite Youth (Male) |
| `male-qn-badao` | Chinese · Dominant Youth (Male) |
| `Cantonese_GentleLady` | Cantonese · Gentle Female Voice |
| `English_Graceful_Lady` | English · Graceful Lady |
完整音色(中文 / 粤语 / 英 / 日 / 韩共 70+ 种)可参考 [系统音色列表](https://platform.minimaxi.com/docs/faq/system-voice-id),也可在 Web 控制台的「模型管理 → 语音合成」下拉框中可视化选择。
For the full voice list (70+ voices across Chinese / Cantonese / English / Japanese / Korean), see the [system voice list](https://platform.minimaxi.com/docs/faq/system-voice-id), or select visually in the Web Console under "Model Management → Text-to-Speech".

View File

@@ -1,16 +1,16 @@
---
title: OpenAI
description: OpenAI 模型配置(文本 / 视觉 / 图像 / 语音 / 向量)
description: OpenAI model configuration (Text / Vision / Image / Speech / Embedding)
---
OpenAI 是覆盖最完整的厂商可同时承担文本对话、视觉理解、图像生成、语音识别ASR、语音合成TTS和向量Embedding能力。一份 `open_ai_api_key` 即可让 Agent 用到全部能力。
OpenAI offers the most complete coverage and can simultaneously serve text chat, vision understanding, image generation, speech-to-text (ASR), text-to-speech (TTS), and embedding. A single `open_ai_api_key` lets the Agent use all of these capabilities.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -20,16 +20,16 @@ OpenAI 是覆盖最完整的厂商,可同时承担文本对话、视觉理解
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | OpenAI 接口的 [model 参数](https://platform.openai.com/docs/models) 一致,支持 `gpt-5.5``gpt-5.4``gpt-5.4-mini``gpt-5.4-nano``gpt-5` 系列、`gpt-4.1`、o 系列等Agent 模式默认 `gpt-5.5`,追求性价比可改为 `gpt-5.4` |
| `open_ai_api_key` | [OpenAI 平台](https://platform.openai.com/api-keys) 创建 |
| `open_ai_api_base` | 可选,修改可接入第三方代理 |
| `bot_type` | 使用 OpenAI 官方模型时无需填写;通过兼容协议接入厂商模型时需设为 `openai` |
| `model` | Same as OpenAI's [model parameter](https://platform.openai.com/docs/models); supports `gpt-5.5`, `gpt-5.4`, `gpt-5.4-mini`, `gpt-5.4-nano`, the `gpt-5` series, `gpt-4.1`, the o-series, etc. Agent mode defaults to `gpt-5.5`; use `gpt-5.4` for better cost-efficiency |
| `open_ai_api_key` | Create one on the [OpenAI Platform](https://platform.openai.com/api-keys) |
| `open_ai_api_base` | Optional; change it to access a third-party proxy |
| `bot_type` | Not required when using OpenAI's official models; set to `openai` when accessing other vendors via the compatible protocol |
## 图像理解
## Image Understanding
`gpt-5.5``gpt-5.4``gpt-4o``gpt-4.1` 等 OpenAI 模型均原生支持视觉,配置 `open_ai_api_key` Agent Vision 工具会自动使用主模型识别图像。若主模型不支持视觉或希望显式指定,可在配置文件中配置:
OpenAI models like `gpt-5.5`, `gpt-5.4`, `gpt-4o`, and `gpt-4.1` natively support vision. Once `open_ai_api_key` is configured, the Agent's Vision tool automatically uses the main model to recognize images. If the main model does not support vision or you want to specify it explicitly, set it in the configuration file:
```json
{
@@ -41,11 +41,11 @@ OpenAI 是覆盖最完整的厂商,可同时承担文本对话、视觉理解
}
```
支持的 Vision 模型:`gpt-5.5``gpt-5.4``gpt-5.4-mini``gpt-5.4-nano``gpt-5``gpt-4.1``gpt-4.1-mini``gpt-4o`
Supported Vision models: `gpt-5.5`, `gpt-5.4`, `gpt-5.4-mini`, `gpt-5.4-nano`, `gpt-5`, `gpt-4.1`, `gpt-4.1-mini`, `gpt-4o`.
## 图像生成
## Image Generation
在配置文件中指定图像生成模型Agent 调用图像生成技能时会自动路由到 OpenAI
Specify the image generation model in the configuration file; the Agent automatically routes image generation skill calls to OpenAI:
```json
{
@@ -57,9 +57,9 @@ OpenAI 是覆盖最完整的厂商,可同时承担文本对话、视觉理解
}
```
支持的图像生成模型:`gpt-image-2``gpt-image-1`
Supported image generation models: `gpt-image-2`, `gpt-image-1`.
## 语音识别
## Speech-to-Text (ASR)
```json
{
@@ -68,14 +68,14 @@ OpenAI 是覆盖最完整的厂商,可同时承担文本对话、视觉理解
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `voice_to_text` | 设为 `openai` 启用 OpenAI 语音识别 |
| `voice_to_text_model` | 可选,默认 `gpt-4o-mini-transcribe`;也可填 `gpt-4o-transcribe``whisper-1` |
| `voice_to_text` | Set to `openai` to enable OpenAI speech-to-text |
| `voice_to_text_model` | Optional, defaults to `gpt-4o-mini-transcribe`; can also be `gpt-4o-transcribe`, `whisper-1` |
凭证自动复用 `open_ai_api_key`
Credentials are automatically reused from `open_ai_api_key`.
## 语音合成
## Text-to-Speech (TTS)
```json
{
@@ -85,12 +85,12 @@ OpenAI 是覆盖最完整的厂商,可同时承担文本对话、视觉理解
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `text_to_voice_model` | `tts-1``tts-1-hd``gpt-4o-mini-tts` |
| `tts_voice_id` | 音色:`alloy``echo``fable``onyx``nova``shimmer``ash``ballad``coral``sage``verse` |
| `text_to_voice_model` | `tts-1`, `tts-1-hd`, `gpt-4o-mini-tts` |
| `tts_voice_id` | Voices: `alloy`, `echo`, `fable`, `onyx`, `nova`, `shimmer`, `ash`, `ballad`, `coral`, `sage`, `verse` |
## 向量
## Embedding
```json
{
@@ -99,5 +99,5 @@ OpenAI 是覆盖最完整的厂商,可同时承担文本对话、视觉理解
}
```
可选模型:`text-embedding-3-small``text-embedding-3-large``text-embedding-ada-002`。修改 embedding 后需执行 `/memory rebuild-index` 命令重建索引。
Available models: `text-embedding-3-small`, `text-embedding-3-large`, `text-embedding-ada-002`. After changing the embedding, run `/memory rebuild-index` to rebuild the index.

View File

@@ -1,48 +1,42 @@
---
title: 百度千帆
description: 百度千帆 ERNIE 模型配置(文本对话 + 图像理解)
title: ERNIE
description: ERNIE model configuration (Baidu Qianfan)
---
百度千帆提供 ERNIE 系列模型,支持文本对话与图像理解。
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
</Tip>
## 文本对话
Option 1: Native integration (recommended):
```json
{
"model": "ernie-5.1",
"qianfan_api_key": "YOUR_API_KEY",
"qianfan_api_key": "",
"qianfan_api_base": "https://qianfan.baidubce.com/v2"
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 默认推荐使用 `ernie-5.1`;也可使用 `ernie-5.0``ernie-x1.1``ernie-4.5-turbo-128k``ernie-4.5-turbo-32k` |
| `qianfan_api_key` | 千帆 API Key,格式通常以 `bce-v3/` 开头 |
| `qianfan_api_base` | 可选,默认为 `https://qianfan.baidubce.com/v2` |
| `model` | Default recommendation: `ernie-5.1`; also supports `ernie-5.0`, `ernie-x1.1`, `ernie-4.5-turbo-128k`, `ernie-4.5-turbo-32k` |
| `qianfan_api_key` | Qianfan API key, usually starting with `bce-v3/` |
| `qianfan_api_base` | Optional, defaults to `https://qianfan.baidubce.com/v2` |
### 模型选择
## Model Selection
| 模型 | 适用场景 |
| Model | Use Case |
| --- | --- |
| `ernie-5.1` | 默认推荐,文心新一代旗舰模型,综合能力最强 |
| `ernie-5.0` | 上一代旗舰模型,综合能力优异 |
| `ernie-x1.1` | 深度思考推理模型,幻觉更低、指令遵循与工具调用更强 |
| `ernie-4.5-turbo-128k` | 长上下文和通用对话 |
| `ernie-4.5-turbo-32k` | 通用对话,成本和上下文更均衡 |
| `ernie-5.1` | Default recommendation; latest ERNIE flagship with the strongest overall capability |
| `ernie-5.0` | Previous-generation flagship with excellent overall capability |
| `ernie-x1.1` | Deep-thinking reasoning model with lower hallucination and stronger instruction following / tool calling |
| `ernie-4.5-turbo-128k` | Long-context and general chat |
| `ernie-4.5-turbo-32k` | General chat with a balanced context window and cost |
## 图像理解
## Vision tool
配置 `qianfan_api_key` Agent 的 Vision 工具可以自动使用千帆视觉模型:
Once `qianfan_api_key` is configured, Agent mode can auto-discover Qianfan for the Vision tool:
- 当主模型本身是多模态时(如 `ernie-5.1``ernie-5.0``ernie-x1.1``ernie-4.5-turbo-vl`),直接由主模型识别图像,无需额外配置
- 当主模型是纯文本时(如 `ernie-4.5-turbo-128k`Vision 工具会自动 fallback `ernie-4.5-turbo-vl`
- When the main model itself is multimodal (e.g. `ernie-5.1`, `ernie-5.0`, `ernie-x1.1`, `ernie-4.5-turbo-vl`), images are handled directly by the main model with no extra setup.
- When the main model is text-only (e.g. `ernie-4.5-turbo-128k`), the Vision tool automatically falls back to `ernie-4.5-turbo-vl`.
如需手动指定 Vision 模型,可在配置文件中显式配置:
To force a specific Vision model, set it explicitly in `config.json`:
```json
{
@@ -54,6 +48,17 @@ description: 百度千帆 ERNIE 模型配置(文本对话 + 图像理解)
}
```
Option 2: OpenAI-compatible configuration:
```json
{
"model": "ernie-5.1",
"bot_type": "openai",
"open_ai_api_key": "",
"open_ai_api_base": "https://qianfan.baidubce.com/v2"
}
```
<Tip>
新配置推荐使用 `qianfan_api_key`。旧的 `wenxin``wenxin-4``baidu_wenxin_api_key``baidu_wenxin_secret_key` 配置仍保持兼容。
Prefer `qianfan_api_key` for new configurations. Existing `wenxin`, `wenxin-4`, `baidu_wenxin_api_key`, and `baidu_wenxin_secret_key` configurations remain supported.
</Tip>

View File

@@ -1,15 +1,15 @@
---
title: 通义千问 Qwen
description: 通义千问模型配置(文本 / 图像理解 / 图像生成 / 语音识别 / 语音合成 / 向量)
title: Qwen
description: Qwen model configuration (Text / Image Understanding / Image Generation / Speech-to-Text / Text-to-Speech / Embedding)
---
通义千问DashScope / 百炼)是国内覆盖最完整的厂商之一,文本、图像理解、图像生成、语音识别、语音合成与向量能力均可用一份 `dashscope_api_key` 启用。
Qwen (Alibaba DashScope / Bailian) is one of the most fully-featured vendors. Text, image understanding, image generation, speech-to-text, text-to-speech, and embedding can all be enabled with a single `dashscope_api_key`.
<Tip>
通过 Web 控制台的「模型管理」页面可一站式配置以下全部能力,无需手动改配置文件。
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
</Tip>
## 文本对话
## Text Chat
```json
{
@@ -18,16 +18,16 @@ description: 通义千问模型配置(文本 / 图像理解 / 图像生成 /
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `model` | 可填 `qwen3.6-plus``qwen3.7-max``qwen3.5-plus``qwen3-max``qwen-max``qwen-plus``qwen-turbo``qwq-plus` |
| `dashscope_api_key` | 在 [百炼控制台](https://bailian.console.aliyun.com/?tab=model#/api-key) 创建,参考 [官方文档](https://bailian.console.aliyun.com/?tab=api#/api) |
| `model` | Can be `qwen3.6-plus`, `qwen3.7-max`, `qwen3.5-plus`, `qwen3-max`, `qwen-max`, `qwen-plus`, `qwen-turbo`, `qwq-plus`, etc. |
| `dashscope_api_key` | Create one in the [Bailian Console](https://bailian.console.aliyun.com/?tab=model#/api-key); see the [official docs](https://bailian.console.aliyun.com/?tab=api#/api) |
## 图像理解
## Image Understanding
配置 `dashscope_api_key` Agent Vision 工具会自动调用千问的视觉模型识别图像。`qwen3-max` / `qwen3.5-plus` / `qwen3.6-plus` 等模型本身就是多模态;若主模型是纯文本(如 `qwen-turbo`),会自动回落到 `qwen-vl-max`
Once `dashscope_api_key` is configured, the Agent's Vision tool automatically calls Qwen's vision models to recognize images. Models like `qwen3-max` / `qwen3.5-plus` / `qwen3.6-plus` are already multimodal; if the main model is text-only (e.g. `qwen-turbo`), it automatically falls back to `qwen-vl-max`.
如需手动指定 Vision 模型:
To manually specify a Vision model:
```json
{
@@ -39,9 +39,9 @@ description: 通义千问模型配置(文本 / 图像理解 / 图像生成 /
}
```
支持模型:`qwen3.6-plus``qwen3.5-plus``qwen3-max`
Supported models: `qwen3.6-plus`, `qwen3.5-plus`, `qwen3-max`.
## 图像生成
## Image Generation
```json
{
@@ -53,9 +53,9 @@ description: 通义千问模型配置(文本 / 图像理解 / 图像生成 /
}
```
可选模型:`qwen-image-2.0``qwen-image-2.0-pro`
Available models: `qwen-image-2.0`, `qwen-image-2.0-pro`.
## 语音识别
## Speech-to-Text (ASR)
```json
{
@@ -64,14 +64,14 @@ description: 通义千问模型配置(文本 / 图像理解 / 图像生成 /
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `voice_to_text` | 设为 `dashscope` 启用通义千问 ASR |
| `voice_to_text_model` | 可选,默认 `qwen3-asr-flash` |
| `voice_to_text` | Set to `dashscope` to enable Qwen ASR |
| `voice_to_text_model` | Optional, defaults to `qwen3-asr-flash` |
凭证自动复用 `dashscope_api_key`。单段音频建议小于 10MB、时长不超过 300 秒。
Credentials are automatically reused from `dashscope_api_key`. A single audio segment should be smaller than 10MB and no longer than 300 seconds.
## 语音合成
## Text-to-Speech (TTS)
```json
{
@@ -81,26 +81,26 @@ description: 通义千问模型配置(文本 / 图像理解 / 图像生成 /
}
```
| 参数 | 说明 |
| Parameter | Description |
| --- | --- |
| `text_to_voice_model` | 可选,默认 `qwen3-tts-flash`,覆盖普通话、方言与主流外语 |
| `tts_voice_id` | 音色 ID详见下方常用列表 |
| `text_to_voice_model` | Optional, defaults to `qwen3-tts-flash`; covers Mandarin, dialects, and major foreign languages |
| `tts_voice_id` | Voice ID; see the common list below |
常用音色示例:
Common voice examples:
| 音色 ID | 说明 |
| Voice ID | Description |
| --- | --- |
| `Cherry` | 芊悦 · 阳光女声 |
| `Serena` | 苏瑶 · 温柔女声 |
| `Ethan` | 晨煦 · 阳光男声 |
| `Chelsie` | 千雪 · 二次元少女 |
| `Dylan` | 北京话 · 晓东 |
| `Rocky` | 粤语 · 阿强 |
| `Sunny` | 四川话 · 晴儿 |
| `Cherry` | Qianyue · Sunny Female Voice |
| `Serena` | Suyao · Gentle Female Voice |
| `Ethan` | Chenxu · Sunny Male Voice |
| `Chelsie` | Qianxue · Anime Girl |
| `Dylan` | Beijing Dialect · Xiaodong |
| `Rocky` | Cantonese · Aqiang |
| `Sunny` | Sichuan Dialect · Qing'er |
完整音色(普通话 / 各地方言 / 双语等)可在 Web 控制台的「模型管理 → 语音合成」下拉框中可视化选择。
The full voice list (Mandarin / regional dialects / bilingual, etc.) can be selected visually in the Web Console under "Model Management → Text-to-Speech".
## 向量
## Embedding
```json
{
@@ -109,4 +109,4 @@ description: 通义千问模型配置(文本 / 图像理解 / 图像生成 /
}
```
默认模型 `text-embedding-v4`。修改 embedding 后需执行 `/memory rebuild-index` 命令重建索引。
The default model is `text-embedding-v4`. After changing the embedding, run `/memory rebuild-index` to rebuild the index.