Revise `ACTIVE_FRAME_THRESHOLD` from 0.32s to 0.1s
d662661
liumaolincommited on
更新MoYoYo语音模型配置,修改文本分割方法并新增角色模型
46eb532
liumaolincommited on
更新MoYoYo语音模型配置,新增优先级字段
efefd3d
liumaolincommited on
新增Mixin类以增强语音任务处理功能
2ebe57f
liumaolincommited on
添加MoYoYo语音模型“白岩松”及相关文件
d74bcbf
liumaolincommited on
重构`is_task_interrupted`方法以检查语音任务中断状态
02c321c
liumaolincommited on
Add `has_no_words` check to skip punctuation-only TTS tasks and enhance debug logs
b636027
liumaolincommited on
Refactor SpeechMonitor to use active audio frame duration instead of count
15891ec
liumaolincommited on
Add TTS generation error handling and `is_task_interrupted` helper function
5ecb408
liumaolincommited on
Optimize VAD logic by replacing `np.max(probs)` with `any(prob >= threshold)` for improved readability and efficiency.
9273b76
liumaolincommited on
Increase audio capture chunk size to 1024 in `capture.py` for smoother streaming
095cfb6
liumaolincommited on
Remove `speech_monitor` dependency from `asr_worker` service definition.
5cf0dbe
liumaolincommited on
Add echo cancellation and VAD toggle support in service factories and routes
2ecfa8f
liumaolincommited on
Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled
d846f85
liumaolincommited on
Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.
7d8046a
liumaolincommited on
Handle overflow errors in audio capture by adding `exception_on_overflow=False` and skip processing when no data is available.
5f9eaee
liumaolincommited on
Add `--disable-echo-cancellation` CLI option and update audio pipeline to support toggling echo cancellation and VAD
4e071d3
liumaolincommited on
Integrate `SileroVAD` into `SpeechMonitor` for optional voice activity detection. Add `_detect_speech()` method and update queue handling logic. Implement `SileroVAD` as a singleton for efficient model management.
4e2e3d8
liumaolincommited on
Increase queue timeout in audio and text processing services for smoother task handling
b446464
liumaolincommited on
Refactor audio processing pipeline to normalize data in `SpeechMonitor` and streamline queuing in `AudioCapture`
57b0084
liumaolincommited on
Update `AudioCapture` to support both PyAudio and macOS native AEC+VAD libraries
99e8988
liumaolincommited on
Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility
7437d6d
liumaolincommited on
Clean input text in MoYoYo TTS by removing punctuation for better processing
8587958
liumaolincommited on
Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager
941bf07
liumaolincommited on
Enable debug mode with global configuration and detailed task logging when active
e0f42b2
liumaolincommited on
Reset task ID in speech recognizer for empty transcriptions to prevent errors
2291ed2
liumaolincommited on
Add new voice model "Doubao" to MoYoYo configuration
7b003c4
liumaolincommited on
Update performance logging format in TTS player for improved structure and readability
d0c1c61
liumaolincommited on
Enhance launcher startup log formatting for improved readability and visual appeal
fa296dd
liumaolincommited on
Remove commented-out performance logging code from TTS player
c3e85a2
liumaolincommited on
Add new voice model "Ellen" to MoYoYo configuration
b5b48f0
liumaolincommited on
Update MoYoYo TTS prompt text for improved relevance and clarity
8228973
liumaolincommited on
Improve speech recognizer to handle empty transcriptions
0cbda14
liumaolincommited on
Simplify system prompts for text generation in Chinese and English
c545fd9
liumaolincommited on
Add new voice model "Juniper" to MoYoYo configuration
469433f
liumaolincommited on
Standardize punctuation for system prompts in both Chinese and English text generation modules.