chatgpt-on-wechat

mirror of https://github.com/zhayujie/chatgpt-on-wechat.git synced 2026-07-17 11:07:11 +08:00

Author	SHA1	Message	Date
zhayujie	5d55ec0f8c	feat(browser): reuse system Chrome/Edge, bundle playwright for desktop	2026-07-14 18:02:27 +08:00
zhayujie	8c7cda89dc	feat(mcp): support OAuth authorization for remote MCP servers	2026-07-13 12:00:14 +08:00
zhayujie	7047b30e27	feat(models): support doubao-seed-2.1 series	2026-06-25 11:53:24 +08:00
zhayujie	92ec9653e5	feat(models): support qwen3.7-plus multi-modal model	2026-06-02 16:38:17 +08:00
zhayujie	7bf4ef3d05	docs: make English the default docs language and fix link paths	2026-05-31 17:52:22 +08:00
zhayujie	ad2db1a776	feat(mcp): support streamable-http mcp protocol	2026-05-26 12:11:59 +08:00
zhayujie	36b913124b	docs: update models and channels doc	2026-05-22 10:10:07 +08:00
zhayujie	b8333e351c	feat(voice): rework TTS/ASR stack and unify tool/skill config schema	2026-05-21 16:00:54 +08:00
zhayujie	a0dfdb79df	feat(browser): persistent login + CDP attach mode #2809 Browser sessions now reuse a Chromium user profile across runs by default (`~/.cow/browser_profile`), so users only log in to a site once. Three launch modes are selectable via `tools.browser` in config.json: - persistent (default): Playwright Chromium with a persistent user_data_dir - cdp: attach to an externally launched real Chrome via `cdp_endpoint` (full fingerprints, ideal for sites with strict bot detection) - fresh: clean context every run, set `persistent: false` Also: - Self-heal when the user closes the browser window mid-session: detect closed page/context/browser via close listeners and exception scanning, then transparently relaunch on the next request. - Graceful CDP shutdown: disconnect only, never kill the user's Chrome. - Friendly errors when the CDP endpoint is unreachable or the persistent profile is locked, so the LLM can guide the user instead of looping. - Fix tool config being silently overwritten by workspace config in AgentInitializer; per-tool user settings (e.g. browser.cdp_endpoint) are now merged instead of replaced. - Update zh / en / ja docs with the new login-persistence section, including the Chrome 137+ requirement to pair --remote-debugging-port with a dedicated --user-data-dir.	2026-05-19 11:52:11 +08:00
zhayujie	907825601d	feat(models): add baidu ernie-5.1	2026-05-10 18:39:38 +08:00
zhayujie	fb341b869b	docs(mcp): add MCP tools guide	2026-05-08 16:14:48 +08:00
zhayujie	a5790d82f6	feat(qianfan): scope vision support to multimodal models	2026-05-06 16:11:10 +08:00
jimmyzhuu	fb7962c7f2	fix: use available qianfan vision model	2026-05-06 13:34:39 +08:00
jimmyzhuu	76e6b7b471	docs: document qianfan vision support	2026-05-06 13:28:46 +08:00
zhayujie	80e9062041	fix(vision): respect tool.vision.model and add automatic fallback #2792	2026-05-03 22:28:32 +08:00
zhayujie	67bd3420ed	perf(scheduler): bound isolated session context to agent_max_context_turns/5	2026-05-03 21:49:59 +08:00
zhayujie	aea081703f	fix(scheduler): inject delivered output into receiver session with sliding window Further refinements on top of #2795: - persist real session_id (notify_session_id) at task creation so group chats correctly map back to the user's actual conversation - mark scheduler turns with [SCHEDULED] (recognise legacy "Scheduled task" prefix too for backward-compatible pruning) - prune both DB and in-memory to scheduler_inject_max_per_session (default 3), only marker-tagged pairs are touched; regular user turns never deleted - send_message type gated by scheduler_inject_send_message (default false) — fixed reminder text rarely benefits follow-up Q&A Co-authored-by: huangrichao2020 <grdomai43881@gmail.com>	2026-05-03 21:27:24 +08:00
zhayujie	68ce2e5232	feat(skill): multi-provider image generation with auto-fallback - Add Gemini, Seedream (Volcengine Ark), Qwen (DashScope), MiniMax providers to image-generation skill with universal sequential fallback: OpenAI → Gemini → Seedream → Qwen → MiniMax → LinkAI - Each provider filters unsupported size tiers to valid values (e.g. Seedream 1K→2K, Qwen 3K→2K, Gemini 3K→2K) - Pinned model only tries its native provider; auto-routing uses each provider's default model - Support skill-namespaced config (config.skill.image-generation.model → SKILL_IMAGE_GENERATION_MODEL env var) - Add image lightbox (click-to-enlarge) in web console - Add docs for built-in skills (skill-creator, knowledge-wiki, image-generation) under docs/skills/	2026-04-23 12:39:39 +08:00
zhayujie	2c13e1b923	feat(models): support kimi-k2.6	2026-04-22 12:01:40 +08:00
zhayujie	5162da5654	Merge branch 'master' into feat-knowledge	2026-04-12 16:46:38 +08:00
zhayujie	a1d82f6193	feat(knowledge): add cli and update docs	2026-04-12 16:39:06 +08:00
zhayujie	26693acc3f	feat(vision): prioritize main model for image recognition with multi-provider fallback - Add call_vision method to all bot implementations (DashScope, Claude, Gemini, ZhipuAI, MiniMax, Doubao, Moonshot, OpenAICompatibleBot) using each vendor's native multimodal API format - Remove call_with_tools/call_vision from Bot base class to fix MRO shadowing issue with OpenAICompatibleBot mixin - Refactor vision tool provider resolution: MainModel → other configured models (auto-discovered) → OpenAI → LinkAI, with automatic fallback - Return actual model name used in call_vision responses - Sync config.json API keys to .env bidirectionally on startup - Fix bot instance cache to detect bot_type/use_linkai config changes - Add SSE reconnection support for web console - Preserve image path hints in Gemini text for correct vision tool calls - Update docs/tools/vision.mdx	2026-04-11 19:46:11 +08:00
zhayujie	3cb5a0fbd6	docs: add CLI system docs	2026-03-29 17:57:12 +08:00
zhayujie	fccfa92d7e	docs: update channel docs	2026-02-28 14:50:55 +08:00
zhayujie	6db22827f2	feat: docs update	2026-02-27 16:03:47 +08:00

25 Commits