--- title: LinkAI description: Access text, vision, image, speech, and embedding capabilities through the LinkAI platform --- A single `linkai_api_key` gives you access to all capabilities of mainstream vendors such as OpenAI, Claude, Gemini, DeepSeek, MiniMax, Qwen, Kimi, and Doubao. All capabilities below can be configured in one place via the "Model Management" page in the Web Console, with no need to manually edit the configuration file. ## Text Chat ```json { "use_linkai": true, "linkai_api_key": "YOUR_API_KEY" } ``` | Parameter | Description | | --- | --- | | `use_linkai` | Set to `true` to enable | | `linkai_api_key` | Create one in the [Console](https://link-ai.tech/console/interface) | | `model` | Can be any code from the [model list](https://link-ai.tech/console/models) | See [Model Service](https://link-ai.tech/console/models) for more. ## Image Understanding Once configured, the Agent's Vision tool automatically calls multimodal models via the gateway, with no extra setup required. To manually specify a Vision model: ```json { "tools": { "vision": { "model": "gpt-5.4-mini" } } } ``` Available models: `gpt-4.1-mini`, `gpt-5.4-mini`, `qwen3.6-plus`, `doubao-seed-2-0-pro-260215`, `kimi-k2.6`, `claude-sonnet-4-6`, `gemini-3.1-flash-lite-preview`, etc. ## Image Generation ```json { "skills": { "image-generation": { "model": "gpt-image-2" } } } ``` | Model ID | Alias | | --- | --- | | `gpt-image-2` | OpenAI | | `gemini-3.1-flash-image-preview` | Nano Banana 2 | | `gemini-3-pro-image-preview` | Nano Banana Pro | | `seedream-5.0-lite` | ByteDance Doubao Seedream | ## Speech-to-Text (ASR) ```json { "voice_to_text": "linkai" } ``` ASR uses Whisper by default; credentials are automatically reused from `linkai_api_key`. ## Text-to-Speech (TTS) The TTS gateway supports multiple underlying engines. The engine is selected by `text_to_voice_model`, and the available voices change with the engine. ```json { "text_to_voice": "linkai", "text_to_voice_model": "doubao", "tts_voice_id": "BV001_streaming" } ``` | `text_to_voice_model` | Engine | | --- | --- | | `tts-1` | OpenAI · Multi-language (voices like `alloy` / `nova` / `echo`, etc.) | | `doubao` | ByteDance Doubao · Rich Chinese voices | | `baidu` | Baidu · Chinese broadcaster voices | Voices differ by engine; we recommend selecting them visually in the Web Console under "Model Management → Text-to-Speech". ## Embedding ```json { "embedding_provider": "linkai", "embedding_model": "text-embedding-3-small" } ``` The default model is `text-embedding-3-small` (OpenAI-compatible). After changing the embedding, run `/memory rebuild-index` to rebuild the index.