zhayujie
b8333e351c
feat(voice): rework TTS/ASR stack and unify tool/skill config schema
2026-05-21 16:00:54 +08:00
zhayujie
b80c3fe5a8
feat(feishu): enhance #2791 with cardkit streaming + ASR fixes
...
- rewrite streaming reply to official cardkit v2.0 API (default on, auto-fallback)
- fix Whisper hallucination: bump ASR sample rate to 16k, pass language=zh
- fix lock-over-IO and tmp file cleanup from #2791
- drop deprecated feishu_bot_name; quiet unknown-key warnings
- docs: cardkit permission and feishu_stream_reply usage
2026-05-05 14:15:25 +08:00
zhayujie
8752f0cc60
refactor(openai): drop SDK dependency and switch to native HTTP client
2026-04-27 20:21:54 +08:00
Lecter
b494ee2f1c
fix openai voice_to_text whisper
2024-04-14 14:33:17 +08:00
zhayujie
e5ba26febe
fix: tts voice base url
2023-12-06 00:31:31 +08:00
zhayujie
0c8f292e12
feat: add tts speech model
2023-11-10 10:48:52 +08:00
zhayujie
f001ac6903
feat: add dalle3 gpt-4-turbo model change
2023-11-10 10:11:02 +08:00
lanvent
618c94edb8
formatting: run precommit on all files
2023-04-22 12:01:29 +08:00
lanvent
8f72e8c3e6
formatting code
2023-04-17 01:01:02 +08:00
lanvent
e78886fb35
feat: new voice class pytts
2023-03-28 03:14:26 +08:00
lanvent
ad6ae0b32a
refactor: use enum to specify type
2023-03-13 19:44:24 +08:00
lanvent
8d2e81815c
compatible for voice
2023-03-13 00:12:34 +08:00
wanggang
d7a8854fa1
[voice] add support for whisper-1 model
2023-03-08 11:32:27 +08:00
wanggang
882e6c3576
[voice] add support for wispper
2023-03-08 11:02:01 +08:00