Add `has_no_words` check to skip punctuation-only TTS tasks and enhance debug logs
b636027
liumaolincommited on
Refactor SpeechMonitor to use active audio frame duration instead of count
15891ec
liumaolincommited on
Add TTS generation error handling and `is_task_interrupted` helper function
5ecb408
liumaolincommited on
Optimize VAD logic by replacing `np.max(probs)` with `any(prob >= threshold)` for improved readability and efficiency.
9273b76
liumaolincommited on
Increase audio capture chunk size to 1024 in `capture.py` for smoother streaming
095cfb6
liumaolincommited on
Remove `speech_monitor` dependency from `asr_worker` service definition.
5cf0dbe
liumaolincommited on
Add echo cancellation and VAD toggle support in service factories and routes
2ecfa8f
liumaolincommited on
Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled
d846f85
liumaolincommited on
Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.
7d8046a
liumaolincommited on
Handle overflow errors in audio capture by adding `exception_on_overflow=False` and skip processing when no data is available.
5f9eaee
liumaolincommited on
Add `--disable-echo-cancellation` CLI option and update audio pipeline to support toggling echo cancellation and VAD
4e071d3
liumaolincommited on
Integrate `SileroVAD` into `SpeechMonitor` for optional voice activity detection. Add `_detect_speech()` method and update queue handling logic. Implement `SileroVAD` as a singleton for efficient model management.
4e2e3d8
liumaolincommited on
Increase queue timeout in audio and text processing services for smoother task handling
b446464
liumaolincommited on
Refactor audio processing pipeline to normalize data in `SpeechMonitor` and streamline queuing in `AudioCapture`
57b0084
liumaolincommited on
Update `AudioCapture` to support both PyAudio and macOS native AEC+VAD libraries
99e8988
liumaolincommited on
Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility
7437d6d
liumaolincommited on
Clean input text in MoYoYo TTS by removing punctuation for better processing
8587958
liumaolincommited on
Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager
941bf07
liumaolincommited on
Enable debug mode with global configuration and detailed task logging when active
e0f42b2
liumaolincommited on
Reset task ID in speech recognizer for empty transcriptions to prevent errors
2291ed2
liumaolincommited on
Add new voice model "Doubao" to MoYoYo configuration
7b003c4
liumaolincommited on
Update performance logging format in TTS player for improved structure and readability
d0c1c61
liumaolincommited on
Enhance launcher startup log formatting for improved readability and visual appeal
fa296dd
liumaolincommited on
Remove commented-out performance logging code from TTS player
c3e85a2
liumaolincommited on
Add new voice model "Ellen" to MoYoYo configuration
b5b48f0
liumaolincommited on
Update MoYoYo TTS prompt text for improved relevance and clarity
8228973
liumaolincommited on
Improve speech recognizer to handle empty transcriptions
0cbda14
liumaolincommited on
Simplify system prompts for text generation in Chinese and English
c545fd9
liumaolincommited on
Add new voice model "Juniper" to MoYoYo configuration
469433f
liumaolincommited on
Standardize punctuation for system prompts in both Chinese and English text generation modules.
bedd7b8
liumaolincommited on
Enhance WebSocket handling for connection management and reliability
b115e26
liumaolincommited on
Add session validation checks to `player.py` and `generator.py`
29766c6
liumaolincommited on
Refactor WebSocket handling with connection manager
300d567
liumaolincommited on
Replace `logging` with centralized `loguru`-based logger across all modules.
851495c
liumaolincommited on
Refactor response generation logic in `generator.py`
ce3d9e5
liumaolincommited on
Remove unused conditional logic for second answer handling in `player.py` and `generator.py`
c1b24fd
liumaolincommited on
Adjust context window allocation logic based on memory tiers in `apple_silicon.py`
fd3c30a
liumaolincommited on
Comment out unused Kokoro TTS voice configurations
23c146f
liumaolincommited on
Add Maple and Cove voice models to MoYoYo TTS configuration
7e92ad3
liumaolincommited on
Introduce Apple Silicon hardware optimization and dynamic LLM configuration
bdc3b7b
liumaolincommited on
Update LLM response generator and system prompts
6f77a29
liumaolincommited on
Update static file routing and root endpoint for frontend integration
f7b034a
liumaolincommited on
Add robust lifecycle management for `audio_player` service in system routes
627c3e7
liumaolincommited on
Standardize service lifecycle management by replacing `stop` with `exit` and introducing `is_exited` check
f5226c0
liumaolincommited on
Remove `voice_schemas.py` and refactor schema imports for TTS and ASR modules in `__init__.py`
4895dc2
liumaolincommited on
Refactor speech recognizer, audio capture, and system routes for improved clarity and functionality
037e5ae
liumaolincommited on
Add pause and resume functionality to voice dialogue system
d701b8a
liumaolincommited on
Refactor project: split `main.py` functionality into modular components under `cli`, `core`, and `config`.