chatgpt-on-wechat

mirror of https://github.com/zhayujie/chatgpt-on-wechat.git synced 2026-07-17 11:07:11 +08:00

Author	SHA1	Message	Date
zhayujie	5d55ec0f8c	feat(browser): reuse system Chrome/Edge, bundle playwright for desktop	2026-07-14 18:02:27 +08:00
zhayujie	8c7cda89dc	feat(mcp): support OAuth authorization for remote MCP servers	2026-07-13 12:00:14 +08:00
zhayujie	7047b30e27	feat(models): support doubao-seed-2.1 series	2026-06-25 11:53:24 +08:00
zhayujie	92ec9653e5	feat(models): support qwen3.7-plus multi-modal model	2026-06-02 16:38:17 +08:00
zhayujie	ad2db1a776	feat(mcp): support streamable-http mcp protocol	2026-05-26 12:11:59 +08:00
zhayujie	91d427c8f9	docs: update docs and readme	2026-05-24 18:29:57 +08:00
zhayujie	069bffa3e8	feat: release 2.0.9	2026-05-22 12:25:22 +08:00
zhayujie	b8333e351c	feat(voice): rework TTS/ASR stack and unify tool/skill config schema	2026-05-21 16:00:54 +08:00
zhayujie	a0dfdb79df	feat(browser): persistent login + CDP attach mode #2809 Browser sessions now reuse a Chromium user profile across runs by default (`~/.cow/browser_profile`), so users only log in to a site once. Three launch modes are selectable via `tools.browser` in config.json: - persistent (default): Playwright Chromium with a persistent user_data_dir - cdp: attach to an externally launched real Chrome via `cdp_endpoint` (full fingerprints, ideal for sites with strict bot detection) - fresh: clean context every run, set `persistent: false` Also: - Self-heal when the user closes the browser window mid-session: detect closed page/context/browser via close listeners and exception scanning, then transparently relaunch on the next request. - Graceful CDP shutdown: disconnect only, never kill the user's Chrome. - Friendly errors when the CDP endpoint is unreachable or the persistent profile is locked, so the LLM can guide the user instead of looping. - Fix tool config being silently overwritten by workspace config in AgentInitializer; per-tool user settings (e.g. browser.cdp_endpoint) are now merged instead of replaced. - Update zh / en / ja docs with the new login-persistence section, including the Chrome 137+ requirement to pair --remote-debugging-port with a dedicated --user-data-dir.	2026-05-19 11:52:11 +08:00
zhayujie	907825601d	feat(models): add baidu ernie-5.1	2026-05-10 18:39:38 +08:00
zhayujie	fb341b869b	docs(mcp): add MCP tools guide	2026-05-08 16:14:48 +08:00
zhayujie	a5790d82f6	feat(qianfan): scope vision support to multimodal models	2026-05-06 16:11:10 +08:00
jimmyzhuu	fb7962c7f2	fix: use available qianfan vision model	2026-05-06 13:34:39 +08:00
jimmyzhuu	76e6b7b471	docs: document qianfan vision support	2026-05-06 13:28:46 +08:00
zhayujie	2c13e1b923	feat(models): support kimi-k2.6	2026-04-22 12:01:40 +08:00
zhayujie	5162da5654	Merge branch 'master' into feat-knowledge	2026-04-12 16:46:38 +08:00
zhayujie	a1d82f6193	feat(knowledge): add cli and update docs	2026-04-12 16:39:06 +08:00
zhayujie	26693acc3f	feat(vision): prioritize main model for image recognition with multi-provider fallback - Add call_vision method to all bot implementations (DashScope, Claude, Gemini, ZhipuAI, MiniMax, Doubao, Moonshot, OpenAICompatibleBot) using each vendor's native multimodal API format - Remove call_with_tools/call_vision from Bot base class to fix MRO shadowing issue with OpenAICompatibleBot mixin - Refactor vision tool provider resolution: MainModel → other configured models (auto-discovered) → OpenAI → LinkAI, with automatic fallback - Return actual model name used in call_vision responses - Sync config.json API keys to .env bidirectionally on startup - Fix bot instance cache to detect bot_type/use_linkai config changes - Add SSE reconnection support for web console - Preserve image path hints in Gemini text for correct vision tool calls - Update docs/tools/vision.mdx	2026-04-11 19:46:11 +08:00
Ikko Ashimine	5487c0befe	docs: add Japanese documents	2026-03-18 19:13:39 +09:00

19 Commits