14 Commits

Author SHA1 Message Date
zhayujie
b8333e351c feat(voice): rework TTS/ASR stack and unify tool/skill config schema 2026-05-21 16:00:54 +08:00
zhayujie
b80c3fe5a8 feat(feishu): enhance #2791 with cardkit streaming + ASR fixes
- rewrite streaming reply to official cardkit v2.0 API (default on, auto-fallback)
- fix Whisper hallucination: bump ASR sample rate to 16k, pass language=zh
- fix lock-over-IO and tmp file cleanup from #2791
- drop deprecated feishu_bot_name; quiet unknown-key warnings
- docs: cardkit permission and feishu_stream_reply usage
2026-05-05 14:15:25 +08:00
zhayujie
8752f0cc60 refactor(openai): drop SDK dependency and switch to native HTTP client 2026-04-27 20:21:54 +08:00
Lecter
b494ee2f1c fix openai voice_to_text whisper 2024-04-14 14:33:17 +08:00
zhayujie
e5ba26febe fix: tts voice base url 2023-12-06 00:31:31 +08:00
zhayujie
0c8f292e12 feat: add tts speech model 2023-11-10 10:48:52 +08:00
zhayujie
f001ac6903 feat: add dalle3 gpt-4-turbo model change 2023-11-10 10:11:02 +08:00
lanvent
618c94edb8 formatting: run precommit on all files 2023-04-22 12:01:29 +08:00
lanvent
8f72e8c3e6 formatting code 2023-04-17 01:01:02 +08:00
lanvent
e78886fb35 feat: new voice class pytts 2023-03-28 03:14:26 +08:00
lanvent
ad6ae0b32a refactor: use enum to specify type 2023-03-13 19:44:24 +08:00
lanvent
8d2e81815c compatible for voice 2023-03-13 00:12:34 +08:00
wanggang
d7a8854fa1 [voice] add support for whisper-1 model 2023-03-08 11:32:27 +08:00
wanggang
882e6c3576 [voice] add support for wispper 2023-03-08 11:02:01 +08:00