chatgpt-on-wechat

mirror of https://github.com/zhayujie/chatgpt-on-wechat.git synced 2026-07-18 12:07:15 +08:00

Author	SHA1	Message	Date
zhayujie	c5a3f991c5	fix(scheduler): make cron pushes survive restart on weixin channel	2026-05-25 12:15:57 +08:00
zhayujie	840dabeccd	fix(weixin): cap thinking messages to avoid rate-limit drops	2026-05-22 17:42:50 +08:00
zhayujie	b7734c3926	feat(search): multi-provider web search + console integration Search tool now supports 4 backends with unified output (bocha, qianfan, zhipu, linkai) and a routing layer: - strategy 'auto' (default): pick first configured in canonical order bocha > qianfan > zhipu > linkai - strategy 'fixed': pin a specific provider - agent may pass `provider` to override per-call (only exposed when ≥2 providers configured + auto strategy)	2026-05-21 19:58:03 +08:00
zhayujie	2b90f377e6	feat(voice): add dashscope & zhipu ASR, in-page mic input	2026-05-20 22:36:37 +08:00
zhayujie	3ffb563a44	feat(memory): support multi-vendor embedding fallback Add embedding_provider config knob with native support for openai / dashscope / doubao / zhipu / linkai, plus an in-chat /memory status and /memory rebuild-index workflow for switching vendors safely.	2026-05-20 11:00:53 +08:00
zhayujie	a0dfdb79df	feat(browser): persistent login + CDP attach mode #2809 Browser sessions now reuse a Chromium user profile across runs by default (`~/.cow/browser_profile`), so users only log in to a site once. Three launch modes are selectable via `tools.browser` in config.json: - persistent (default): Playwright Chromium with a persistent user_data_dir - cdp: attach to an externally launched real Chrome via `cdp_endpoint` (full fingerprints, ideal for sites with strict bot detection) - fresh: clean context every run, set `persistent: false` Also: - Self-heal when the user closes the browser window mid-session: detect closed page/context/browser via close listeners and exception scanning, then transparently relaunch on the next request. - Graceful CDP shutdown: disconnect only, never kill the user's Chrome. - Friendly errors when the CDP endpoint is unreachable or the persistent profile is locked, so the LLM can guide the user instead of looping. - Fix tool config being silently overwritten by workspace config in AgentInitializer; per-tool user settings (e.g. browser.cdp_endpoint) are now merged instead of replaced. - Update zh / en / ja docs with the new login-persistence section, including the Chrome 137+ requirement to pair --remote-debugging-port with a dedicated --user-data-dir.	2026-05-19 11:52:11 +08:00
zhayujie	a85c5f9d4e	fix(scheduler): make scheduler init idempotent to prevent duplicate task runs	2026-05-18 18:36:48 +08:00
zhayujie	f5479c56af	feat(models): support reasoning_effort config for DeepSeek V4	2026-05-15 18:17:35 +08:00
zhayujie	29e66cb186	fix(mcp): correct hot-reload sync on default Agent	2026-05-08 15:40:29 +08:00
zhayujie	307769b949	feat(mcp): load MCP servers asynchronously at startup Boot MCP servers (npx/uvx) on a background thread instead of blocking agent init. Built-in tools serve traffic immediately while MCP comes online; each new agent reads whatever is ready at creation time. Idempotent via _mcp_loaded flag — concurrent sessions never re-fork subprocesses. Per-server failures are isolated and warmup is triggered in app.py so loading overlaps with channel startup.	2026-05-08 15:22:42 +08:00
ooaaooaa123	caaf006a49	fix(mcp): wire MCP tools into agent and fix env var inheritance Two bugs found during end-to-end validation with Amap and Chrome DevTools MCP servers: 1. MCP tools were loaded into ToolManager._mcp_tool_instances but never added to the agent's tool list. AgentInitializer._load_tools() only iterated tool_classes (built-in tools). Added a second pass to append all MCP tool instances. 2. When a MCP server config contains an "env" dict, it was passed directly to subprocess.Popen, replacing the entire process environment. This caused npx to fail because PATH and other inherited vars were missing. Fixed by merging config env on top of os.environ. Validated with: - @amap/amap-maps-mcp-server (12 tools, stdio + API key env var) - chrome-devtools-mcp (29 tools, stdio + remote debugging port) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-06 20:40:56 +08:00
zhayujie	530fc20596	Merge pull request #2790 from jimmyzhuu/feat/qianfan-provider Add first-class Baidu Qianfan / ERNIE provider	2026-05-06 11:43:32 +08:00
zhayujie	67bd3420ed	perf(scheduler): bound isolated session context to agent_max_context_turns/5	2026-05-03 21:49:59 +08:00
zhayujie	aea081703f	fix(scheduler): inject delivered output into receiver session with sliding window Further refinements on top of #2795: - persist real session_id (notify_session_id) at task creation so group chats correctly map back to the user's actual conversation - mark scheduler turns with [SCHEDULED] (recognise legacy "Scheduled task" prefix too for backward-compatible pruning) - prune both DB and in-memory to scheduler_inject_max_per_session (default 3), only marker-tagged pairs are touched; regular user turns never deleted - send_message type gated by scheduler_inject_send_message (default false) — fixed reminder text rarely benefits follow-up Q&A Co-authored-by: huangrichao2020 <grdomai43881@gmail.com>	2026-05-03 21:27:24 +08:00
tingchim2pro	f150d7d83a	fix: remember scheduled task outputs in receiver session (v2) Address review feedback from #2794: 1. Use notify_session_id instead of receiver for correct group chat mapping - Task creation should store the real session_id in action.notify_session_id - Falls back to receiver for backward compatibility with old tasks 2. Add injection to all four execution branches: - _execute_agent_task - _execute_send_message - _execute_tool_call - _execute_skill_call (also fixed missing channel.send) 3. Add config switch and content truncation: - scheduler_inject_to_session (default: true) to toggle the feature - 2000 char limit prevents high-frequency tasks from bloating sessions Fixes #2793	2026-05-02 19:00:50 +08:00
jimmyzhuu	9eeca70292	feat: register qianfan model provider	2026-04-29 15:52:32 +08:00
zhayujie	02bfe30848	fix(memory): prevent duplicate Deep Dream runs	2026-04-28 15:30:51 +08:00
zhayujie	ae11159918	feat(models): unify enable_thinking for deepseek-v4 and other thinking models	2026-04-24 15:22:45 +08:00
zhayujie	81e8bb62ae	feat(skill): support gpt-image-2 in image generation skill	2026-04-22 20:39:49 +08:00
zhayujie	6dd316547f	fix(web): fix session title generation fallback and reset Bridge on config change	2026-04-19 18:43:48 +08:00
zhayujie	13370d2056	fix: thinking display is disabled by default	2026-04-17 15:31:59 +08:00
zhayujie	cabd24605f	fix: add random jitter to daily dream schedule	2026-04-15 00:33:33 +08:00
zhayujie	ea1a0c8b3d	feat(memory): add Deep Dream module for daily memory distillation - Add Deep Dream: nightly distill daily memories → refined MEMORY.md + dream diary - Simplify flush prompt to daily-only, defer MEMORY.md maintenance to Deep Dream - Remove dead code (_append_to_main_memory) and fix fallback summary logic - Add shrinkage protection and input dedup for dream process - Ensure flush threads complete before dream starts - Update docs (zh/en/ja) with dream diary and distillation mechanism	2026-04-13 21:32:52 +08:00
zhayujie	89a07e8e74	feat: add enable_thinking config to control deep reasoning on web console	2026-04-13 16:06:28 +08:00
zhayujie	5162da5654	Merge branch 'master' into feat-knowledge	2026-04-12 16:46:38 +08:00
zhayujie	3497f00cb4	Merge pull request #2759 from zhayujie/feat-multimodel feat(vision): prioritize main model for image recognition	2026-04-11 19:55:15 +08:00
zhayujie	26693acc3f	feat(vision): prioritize main model for image recognition with multi-provider fallback - Add call_vision method to all bot implementations (DashScope, Claude, Gemini, ZhipuAI, MiniMax, Doubao, Moonshot, OpenAICompatibleBot) using each vendor's native multimodal API format - Remove call_with_tools/call_vision from Bot base class to fix MRO shadowing issue with OpenAICompatibleBot mixin - Refactor vision tool provider resolution: MainModel → other configured models (auto-discovered) → OpenAI → LinkAI, with automatic fallback - Return actual model name used in call_vision responses - Sync config.json API keys to .env bidirectionally on startup - Fix bot instance cache to detect bot_type/use_linkai config changes - Add SSE reconnection support for web console - Preserve image path hints in Gemini text for correct vision tool calls - Update docs/tools/vision.mdx	2026-04-11 19:46:11 +08:00
6vision	90d1835353	fix: send generic file types (tar.gz, zip, etc.) as FILE instead of TEXT Previously, files with extensions not in the known categories (image, document, video, audio) fell through to a fallback that returned ReplyType.TEXT, causing the file to never actually be sent to the user. Now the fallback uses ReplyType.FILE so all file types are delivered. Made-with: Cursor	2026-04-11 15:45:34 +08:00
zhayujie	6a737fb734	feat: display thinking content in web console	2026-04-10 15:07:23 +08:00
zhayujie	9cc173cc4d	fix: use dynamic model name in system prompt runtime info	2026-04-02 17:01:56 +08:00
zhayujie	b5f33e5ecd	feat: support qwen3.6-plus	2026-04-02 16:46:58 +08:00
zhayujie	61732aecfc	Merge pull request #2721 from yrk111222/feat/modelscope-update Feat/modelscope update	2026-03-30 11:39:50 +08:00
zhayujie	d09ae49287	feat(browser): auto-snapshot on navigate, screenshot prompt guidance Browser tool enhancements: - Navigate action now auto-includes snapshot result, saving one LLM round-trip - Wait for networkidle + 800ms after navigation for SPA/JS-rendered pages - Prompt guides agent to screenshot key results and ask user for login/CAPTCHA help - Fixed playwright version pinned to 1.52.0; mirror fallback to official CDN on failure Web console file/image support: - SSE real-time push for images and files via on_event (file_to_send) - Added /api/file endpoint to serve local files for web preview - Frontend renders images in media-content container (survives delta/done overwrites) - File attachment cards with download links; RFC 5987 encoding for non-ASCII filenames Tool workspace fix: - Inject workspace_dir as cwd into send and browser tools (previously only file tools) - Screenshots now save to ~/cow/tmp/ instead of project directory	2026-03-29 19:09:11 +08:00
yrk	4c1c42efac	feat: update modelscope bot	2026-03-24 10:43:45 +08:00
6vision	f512b55ec2	feat(deepseek): add independent DeepSeek bot module with dedicated config Separate DeepSeek from ChatGPTBot into its own module (models/deepseek/) with dedicated deepseek_api_key and deepseek_api_base config fields, avoiding config conflicts when switching between providers. Backward compatible with old users who configured DeepSeek via open_ai_api_key/open_ai_api_base through automatic fallback. Made-with: Cursor	2026-03-23 21:23:35 +08:00
zhayujie	b4e711f411	feat: add request header	2026-03-19 17:06:05 +08:00
zhayujie	4efae41048	feat: support coding plan	2026-03-18 11:59:22 +08:00
zhayujie	5e42996b36	fix: guide LLM to use matching skill when tool not found	2026-03-17 18:34:09 +08:00
zhayujie	4fec55cc01	feat: web_featch tool support remote file url	2026-03-11 17:16:39 +08:00
zhayujie	b21e945c76	feat: optimize bootstrap flow	2026-03-11 11:27:08 +08:00
6vision	d0a70d3339	update:Adjust bot_type resolution priority in Agent mode	2026-03-10 15:14:01 +08:00
Weikjssss	36d54cab52	fix: pass bot_type in agent mode	2026-03-10 14:28:39 +08:00
zhayujie	6be2034110	feat: add fallback embedding provider	2026-03-09 11:03:31 +08:00
zhayujie	022c13f3a4	feat: upgrade memory flush system - Use LLM to summarize discarded context into concise daily memory entries - Batch trim to half when exceeding max_turns/max_tokens, reducing flush frequency - Run summarization asynchronously in background thread, no blocking on replies - Add daily scheduled flush (23:55) as fallback for low-activity days - Sync trimmed messages back to agent to keep context state consistent	2026-03-08 21:56:12 +08:00
zhayujie	0f23b209ad	fix: adjust the context of restart loading	2026-03-03 11:38:14 +08:00
zhayujie	a773eb7893	fix: filter history to one user and one assistant per turn	2026-02-28 18:09:02 +08:00
zhayujie	e9c57ddf4d	fix: adjust default turns	2026-02-28 15:25:20 +08:00
zhayujie	a33ce97ed9	fix: restore only user/assistant text from history, strip tool calls Made-with: Cursor	2026-02-28 15:14:56 +08:00
zhayujie	b788a3dd4e	fix: incomplete historical session messages	2026-02-28 15:03:33 +08:00
zhayujie	7d258b5202	feat(channels): add multi-channel management UI with real-time connect/disconnect - Web console Channels page: display active channels as config cards, support save/connect/disconnect with real-time start/stop of channel processes - Custom dropdown for channel selection (consistent with model selector style), custom confirmation dialog for disconnect - Fix channel stop: use sys.modules['__main__'] to access live ChannelManager - Fix web request pending: move stop logic outside lock, set daemon_threads=True - Fix reconnect: new asyncio event loop per startup, ctypes thread interrupt, 5s grace period before re-establishing remote connection - Filter stale offline messages (>60s) pushed after reconnect	2026-02-27 14:39:40 +08:00

1 2 3 4

161 Commits