mirror of
https://github.com/zhayujie/chatgpt-on-wechat.git
synced 2026-06-02 00:57:41 +08:00
104 lines
2.7 KiB
Plaintext
104 lines
2.7 KiB
Plaintext
---
|
|
title: LinkAI
|
|
description: Access text, vision, image, speech, and embedding capabilities through the LinkAI platform
|
|
---
|
|
|
|
A single `linkai_api_key` gives you access to all capabilities of mainstream vendors such as OpenAI, Claude, Gemini, DeepSeek, MiniMax, Qwen, Kimi, and Doubao.
|
|
|
|
<Tip>
|
|
All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file.
|
|
</Tip>
|
|
|
|
## Text Chat
|
|
|
|
```json
|
|
{
|
|
"use_linkai": true,
|
|
"linkai_api_key": "YOUR_API_KEY"
|
|
}
|
|
```
|
|
|
|
| Parameter | Description |
|
|
| --- | --- |
|
|
| `use_linkai` | Set to `true` to enable |
|
|
| `linkai_api_key` | Create one in the [Console](https://link-ai.tech/console/interface) |
|
|
| `model` | Can be any code from the [model list](https://link-ai.tech/console/models) |
|
|
|
|
See [Model Service](https://link-ai.tech/console/models) for more.
|
|
|
|
## Image Understanding
|
|
|
|
Once configured, the Agent's Vision tool automatically calls multimodal models via the gateway, with no extra setup required. To manually specify a Vision model:
|
|
|
|
```json
|
|
{
|
|
"tools": {
|
|
"vision": {
|
|
"model": "gpt-5.4-mini"
|
|
}
|
|
}
|
|
}
|
|
```
|
|
|
|
Available models: `gpt-4.1-mini`, `gpt-5.4-mini`, `qwen3.6-plus`, `doubao-seed-2-0-pro-260215`, `kimi-k2.6`, `claude-sonnet-4-6`, `gemini-3.1-flash-lite-preview`, etc.
|
|
|
|
## Image Generation
|
|
|
|
```json
|
|
{
|
|
"skills": {
|
|
"image-generation": {
|
|
"model": "gpt-image-2"
|
|
}
|
|
}
|
|
}
|
|
```
|
|
|
|
| Model ID | Alias |
|
|
| --- | --- |
|
|
| `gpt-image-2` | OpenAI |
|
|
| `gemini-3.1-flash-image-preview` | Nano Banana 2 |
|
|
| `gemini-3-pro-image-preview` | Nano Banana Pro |
|
|
| `seedream-5.0-lite` | ByteDance Doubao Seedream |
|
|
|
|
## Speech-to-Text (ASR)
|
|
|
|
```json
|
|
{
|
|
"voice_to_text": "linkai"
|
|
}
|
|
```
|
|
|
|
ASR uses Whisper by default; credentials are automatically reused from `linkai_api_key`.
|
|
|
|
## Text-to-Speech (TTS)
|
|
|
|
The TTS gateway supports multiple underlying engines. The engine is selected by `text_to_voice_model`, and the available voices change with the engine.
|
|
|
|
```json
|
|
{
|
|
"text_to_voice": "linkai",
|
|
"text_to_voice_model": "doubao",
|
|
"tts_voice_id": "BV001_streaming"
|
|
}
|
|
```
|
|
|
|
| `text_to_voice_model` | Engine |
|
|
| --- | --- |
|
|
| `tts-1` | OpenAI · Multi-language (voices like `alloy` / `nova` / `echo`, etc.) |
|
|
| `doubao` | ByteDance Doubao · Rich Chinese voices |
|
|
| `baidu` | Baidu · Chinese broadcaster voices |
|
|
|
|
Voices differ by engine; we recommend selecting them visually in the Web Console under "Model Management → Text-to-Speech".
|
|
|
|
## Embedding
|
|
|
|
```json
|
|
{
|
|
"embedding_provider": "linkai",
|
|
"embedding_model": "text-embedding-3-small"
|
|
}
|
|
```
|
|
|
|
The default model is `text-embedding-3-small` (OpenAI-compatible). After changing the embedding, run `/memory rebuild-index` to rebuild the index.
|