chatgpt-on-wechat

mirror of https://github.com/zhayujie/chatgpt-on-wechat.git synced 2026-07-17 11:07:11 +08:00

Author	SHA1	Message	Date
zhayujie	a0dfdb79df	feat(browser): persistent login + CDP attach mode #2809 Browser sessions now reuse a Chromium user profile across runs by default (`~/.cow/browser_profile`), so users only log in to a site once. Three launch modes are selectable via `tools.browser` in config.json: - persistent (default): Playwright Chromium with a persistent user_data_dir - cdp: attach to an externally launched real Chrome via `cdp_endpoint` (full fingerprints, ideal for sites with strict bot detection) - fresh: clean context every run, set `persistent: false` Also: - Self-heal when the user closes the browser window mid-session: detect closed page/context/browser via close listeners and exception scanning, then transparently relaunch on the next request. - Graceful CDP shutdown: disconnect only, never kill the user's Chrome. - Friendly errors when the CDP endpoint is unreachable or the persistent profile is locked, so the LLM can guide the user instead of looping. - Fix tool config being silently overwritten by workspace config in AgentInitializer; per-tool user settings (e.g. browser.cdp_endpoint) are now merged instead of replaced. - Update zh / en / ja docs with the new login-persistence section, including the Chrome 137+ requirement to pair --remote-debugging-port with a dedicated --user-data-dir.	2026-05-19 11:52:11 +08:00
zhayujie	a85c5f9d4e	fix(scheduler): make scheduler init idempotent to prevent duplicate task runs	2026-05-18 18:36:48 +08:00
zhayujie	fe871aad77	fix(tools): unify text file truncation thresholds in read tool	2026-05-13 16:15:06 +08:00
zhayujie	29e66cb186	fix(mcp): correct hot-reload sync on default Agent	2026-05-08 15:40:29 +08:00
zhayujie	307769b949	feat(mcp): load MCP servers asynchronously at startup Boot MCP servers (npx/uvx) on a background thread instead of blocking agent init. Built-in tools serve traffic immediately while MCP comes online; each new agent reads whatever is ready at creation time. Idempotent via _mcp_loaded flag — concurrent sessions never re-fork subprocesses. Per-server failures are isolated and warmup is triggered in app.py so loading overlaps with channel startup.	2026-05-08 15:22:42 +08:00
ooaaooaa123	b861eef26f	fix(mcp): address PR review feedback on stability and config Stability fixes in mcp_client.py: - Fix stderr buffer overflow: start daemon thread to continuously drain stderr pipe, preventing 64KB buffer fill that blocks child process - Fix notification interference: loop readline and skip JSON-RPC messages without 'id' field (notifications) instead of treating them as responses - Fix concurrent race condition: wrap send+receive in _call_lock so multiple sessions cannot interleave reads/writes on the same client - Fix missing timeout: use select.select() with 30s timeout in _readline_with_timeout() to prevent infinite block on dead MCP server Config improvements in tool_manager.py: - Add _normalize_mcp_configs() to support both list format (mcp_servers) and dict format (mcpServers used by Claude Desktop / Cursor) - Add _load_mcp_configs() to load from ~/cow/mcp.json first, falling back to config.json mcp_servers field for backward compatibility Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 09:58:40 +08:00
ooaaooaa123	caaf006a49	fix(mcp): wire MCP tools into agent and fix env var inheritance Two bugs found during end-to-end validation with Amap and Chrome DevTools MCP servers: 1. MCP tools were loaded into ToolManager._mcp_tool_instances but never added to the agent's tool list. AgentInitializer._load_tools() only iterated tool_classes (built-in tools). Added a second pass to append all MCP tool instances. 2. When a MCP server config contains an "env" dict, it was passed directly to subprocess.Popen, replacing the entire process environment. This caused npx to fail because PATH and other inherited vars were missing. Fixed by merging config env on top of os.environ. Validated with: - @amap/amap-maps-mcp-server (12 tools, stdio + API key env var) - chrome-devtools-mcp (29 tools, stdio + remote debugging port) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-06 20:40:56 +08:00
ooaaooaa123	b2429ec30c	feat(mcp): add MCP (Model Context Protocol) tool integration Allows CowAgent to dynamically load tools from any MCP server at startup, extending the agent from a fixed toolset to an open, extensible tool ecosystem. ## What's added - `agent/tools/mcp/mcp_client.py`: lightweight JSON-RPC client supporting both stdio (subprocess) and SSE (HTTP) transports — zero extra dependencies - `agent/tools/mcp/mcp_tool.py`: `McpTool` wraps a single MCP tool as a `BaseTool`, with dynamic name/description/params set at instance level - `agent/tools/tool_manager.py`: new `_load_mcp_tools()` loads MCP servers at startup via `McpClientRegistry`; falls back gracefully on any error; no-op when `mcp_servers` is not configured - `config.py`: registers `mcp_servers` in `available_setting` with inline docs ## Design - No new dependencies — JSON-RPC implemented from scratch using stdlib only - MCP clients are long-lived (initialized once, shared across tool calls) - `McpClientRegistry` holds all subprocess handles and shuts them down cleanly - Server init failures are non-fatal: logged as warnings, agent continues normally - Zero overhead when `mcp_servers` is absent from config ## Config example ```json "mcp_servers": [ { "name": "filesystem", "type": "stdio", "command": "npx", "args": ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"] } ] ``` Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-06 20:16:04 +08:00
zhayujie	a5790d82f6	feat(qianfan): scope vision support to multimodal models	2026-05-06 16:11:10 +08:00
zhayujie	63f99af1e6	Merge pull request #2800 from jimmyzhuu/feat/qianfan-vision-provider Add Qianfan support to Vision tool	2026-05-06 15:39:12 +08:00
zhayujie	4eed2568aa	fix(bash): reduce safety check false positives	2026-05-06 15:36:44 +08:00
jimmyzhuu	fb7962c7f2	fix: use available qianfan vision model	2026-05-06 13:34:39 +08:00
jimmyzhuu	fccb7ff9ed	feat: route qianfan vision provider	2026-05-06 13:25:59 +08:00
zhayujie	8730f7fd27	fix(memory): exclude scheduler-injected pairs from daily memory flush	2026-05-05 16:53:01 +08:00
zhayujie	80e9062041	fix(vision): respect tool.vision.model and add automatic fallback #2792	2026-05-03 22:28:32 +08:00
zhayujie	aea081703f	fix(scheduler): inject delivered output into receiver session with sliding window Further refinements on top of #2795: - persist real session_id (notify_session_id) at task creation so group chats correctly map back to the user's actual conversation - mark scheduler turns with [SCHEDULED] (recognise legacy "Scheduled task" prefix too for backward-compatible pruning) - prune both DB and in-memory to scheduler_inject_max_per_session (default 3), only marker-tagged pairs are touched; regular user turns never deleted - send_message type gated by scheduler_inject_send_message (default false) — fixed reminder text rarely benefits follow-up Q&A Co-authored-by: huangrichao2020 <grdomai43881@gmail.com>	2026-05-03 21:27:24 +08:00
tingchim2pro	f150d7d83a	fix: remember scheduled task outputs in receiver session (v2) Address review feedback from #2794: 1. Use notify_session_id instead of receiver for correct group chat mapping - Task creation should store the real session_id in action.notify_session_id - Falls back to receiver for backward compatibility with old tasks 2. Add injection to all four execution branches: - _execute_agent_task - _execute_send_message - _execute_tool_call - _execute_skill_call (also fixed missing channel.send) 3. Add config switch and content truncation: - scheduler_inject_to_session (default: true) to toggle the feature - 2000 char limit prevents high-frequency tasks from bloating sessions Fixes #2793	2026-05-02 19:00:50 +08:00
zhayujie	02bfe30848	fix(memory): prevent duplicate Deep Dream runs	2026-04-28 15:30:51 +08:00
zhayujie	c9c99de3d9	fix(bash): scope safety confirm to destructive deletions outside workspace	2026-04-28 10:18:47 +08:00
zhayujie	ae11159918	feat(models): unify enable_thinking for deepseek-v4 and other thinking models	2026-04-24 15:22:45 +08:00
zhayujie	81e8bb62ae	feat(skill): support gpt-image-2 in image generation skill	2026-04-22 20:39:49 +08:00
zhayujie	2c13e1b923	feat(models): support kimi-k2.6	2026-04-22 12:01:40 +08:00
zhayujie	a0748c2e3b	fix(web): cap reasoning content to 4KB across stream/storage/display	2026-04-21 20:31:38 +08:00
zhayujie	40599bb751	fix(web): smart auto-scroll for chat #2775	2026-04-20 21:43:21 +08:00
zhayujie	6dd316547f	fix(web): fix session title generation fallback and reset Bridge on config change	2026-04-19 18:43:48 +08:00
zhayujie	d25b8966ce	fix(web): prevent duplicate image previews	2026-04-18 22:32:34 +08:00
zhayujie	14a119c48c	fix(gemini): solving the problem of tool call not returnings	2026-04-18 21:18:27 +08:00
zhayujie	c82515a927	fix(agent): don't drop tool_calls from empty-response retry	2026-04-18 20:50:40 +08:00
zhayujie	13370d2056	fix: thinking display is disabled by default	2026-04-17 15:31:59 +08:00
zhayujie	426fb88ce7	fix(knowledge): exclude root-level files from knowledge stats to preserve empty state	2026-04-16 22:55:46 +08:00
zhayujie	ba3f66d3d1	feat: show root-level files (index.md, log.md) in knowledge tree	2026-04-16 21:47:44 +08:00
zhayujie	848430f062	feat(knowledge): support nested directories in knowledge base listing and display	2026-04-16 12:28:18 +08:00
zhayujie	d4e5ecd497	fix: compatible with Python 3.7 by deferring Literal import in truncate.py	2026-04-15 12:29:09 +08:00
zhayujie	83f778fec9	feat(dream): structured organization of dream memories	2026-04-15 11:27:46 +08:00
zhayujie	3a50b64977	feat: web multi session interface	2026-04-14 22:58:25 +08:00
zhayujie	83f6625e0c	feat: release 2.0.6	2026-04-14 12:08:57 +08:00
zhayujie	acc09543b7	feat(dream): add memory dream cli and docs - New memory/deep-dream.mdx (zh/en/ja): memory flow, distillation rules, dream diary, manual trigger, safety mechanisms - Simplify long-term memory page, link to deep-dream for details - New cli/memory-knowledge.mdx (zh/en/ja): memory and knowledge commands - Move knowledge commands from general.mdx to memory-knowledge.mdx - Register new pages in docs.json navigation for all languages - Add /memory dream to cli/index.mdx command tables	2026-04-14 11:03:53 +08:00
zhayujie	94d8c7e366	feat(dream): add Dream Diary tab to memory management page - Backend: MemoryService supports category param (memory/dream), lists memory/dreams/*.md - Backend: MemoryContentHandler resolves dream files from memory/dreams/ directory - Frontend: add tab switcher (Memory Files / Dream Diary) matching knowledge tab style - Frontend: dream entries show purple "Dream" badge, empty state with moon icon - Cloud dispatch passes category param for consistency	2026-04-13 22:08:15 +08:00
zhayujie	ea1a0c8b3d	feat(memory): add Deep Dream module for daily memory distillation - Add Deep Dream: nightly distill daily memories → refined MEMORY.md + dream diary - Simplify flush prompt to daily-only, defer MEMORY.md maintenance to Deep Dream - Remove dead code (_append_to_main_memory) and fix fallback summary logic - Add shrinkage protection and input dedup for dream process - Ensure flush threads complete before dream starts - Update docs (zh/en/ja) with dream diary and distillation mechanism	2026-04-13 21:32:52 +08:00
zhayujie	7bc88c17e4	Merge branch 'master' of github.com:zhayujie/chatgpt-on-wechat	2026-04-13 20:13:30 +08:00
zhayujie	33cf1bc4c3	feat(memory): async LLM context summary injection on trim - Unified flush + context injection into a single async LLM call (flush_from_messages accepts context_summary_callback) - Fixed response parsing bug: handle generator returns and Claude-format dicts from bot.call_with_tools, which previously caused all LLM summaries to silently fail (falling back to rule-based extraction) - Removed standalone context summary prompts and methods; reuse the existing [DAILY]/[MEMORY] summarization pipeline - Updated docs (zh/en/ja) to reflect the new injection behavior	2026-04-13 20:13:05 +08:00
zhayujie	9402e63fe1	Merge pull request #2766 from zhayujie/feat-mulit-session feat(web): add multi-session management for web console	2026-04-13 18:51:07 +08:00
zhayujie	90e4d494b2	feat(web): add multi-session management for web console	2026-04-13 18:50:31 +08:00
zhayujie	da97e948ca	feat: refine memory recall/write prompts for better precision and proactivity	2026-04-13 18:02:06 +08:00
zhayujie	89a07e8e74	feat: add enable_thinking config to control deep reasoning on web console	2026-04-13 16:06:28 +08:00
zhayujie	5162da5654	Merge branch 'master' into feat-knowledge	2026-04-12 16:46:38 +08:00
zhayujie	a1d82f6193	feat(knowledge): add cli and update docs	2026-04-12 16:39:06 +08:00
zhayujie	ea78e3d0c6	feat(knowledge): document link supports jumping to view	2026-04-11 20:16:43 +08:00
zhayujie	26693acc3f	feat(vision): prioritize main model for image recognition with multi-provider fallback - Add call_vision method to all bot implementations (DashScope, Claude, Gemini, ZhipuAI, MiniMax, Doubao, Moonshot, OpenAICompatibleBot) using each vendor's native multimodal API format - Remove call_with_tools/call_vision from Bot base class to fix MRO shadowing issue with OpenAICompatibleBot mixin - Refactor vision tool provider resolution: MainModel → other configured models (auto-discovered) → OpenAI → LinkAI, with automatic fallback - Return actual model name used in call_vision responses - Sync config.json API keys to .env bidirectionally on startup - Fix bot instance cache to detect bot_type/use_linkai config changes - Add SSE reconnection support for web console - Preserve image path hints in Gemini text for correct vision tool calls - Update docs/tools/vision.mdx	2026-04-11 19:46:11 +08:00
zhayujie	5a10476010	feat: add knowledge switch and cli	2026-04-11 16:44:25 +08:00

1 2 3 4

195 Commits