Remove `speech_monitor` dependency from `asr_worker` service definition.
5cf0dbe
liumaolincommited on
Add echo cancellation and VAD toggle support in service factories and routes
2ecfa8f
liumaolincommited on
Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled
d846f85
liumaolincommited on
Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.
7d8046a
liumaolincommited on
Handle overflow errors in audio capture by adding `exception_on_overflow=False` and skip processing when no data is available.
5f9eaee
liumaolincommited on
Add `--disable-echo-cancellation` CLI option and update audio pipeline to support toggling echo cancellation and VAD
4e071d3
liumaolincommited on
Integrate `SileroVAD` into `SpeechMonitor` for optional voice activity detection. Add `_detect_speech()` method and update queue handling logic. Implement `SileroVAD` as a singleton for efficient model management.
4e2e3d8
liumaolincommited on
Increase queue timeout in audio and text processing services for smoother task handling
b446464
liumaolincommited on
Refactor audio processing pipeline to normalize data in `SpeechMonitor` and streamline queuing in `AudioCapture`
57b0084
liumaolincommited on
Update `AudioCapture` to support both PyAudio and macOS native AEC+VAD libraries
99e8988
liumaolincommited on
Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility
7437d6d
liumaolincommited on
Clean input text in MoYoYo TTS by removing punctuation for better processing
8587958
liumaolincommited on
Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager
941bf07
liumaolincommited on
Enable debug mode with global configuration and detailed task logging when active
e0f42b2
liumaolincommited on
Reset task ID in speech recognizer for empty transcriptions to prevent errors
2291ed2
liumaolincommited on
Add new voice model "Doubao" to MoYoYo configuration
7b003c4
liumaolincommited on
Update performance logging format in TTS player for improved structure and readability
d0c1c61
liumaolincommited on
Enhance launcher startup log formatting for improved readability and visual appeal
fa296dd
liumaolincommited on
Remove commented-out performance logging code from TTS player
c3e85a2
liumaolincommited on
Add new voice model "Ellen" to MoYoYo configuration
b5b48f0
liumaolincommited on
Update MoYoYo TTS prompt text for improved relevance and clarity
8228973
liumaolincommited on
Improve speech recognizer to handle empty transcriptions
0cbda14
liumaolincommited on
Simplify system prompts for text generation in Chinese and English
c545fd9
liumaolincommited on
Add new voice model "Juniper" to MoYoYo configuration
469433f
liumaolincommited on
Standardize punctuation for system prompts in both Chinese and English text generation modules.
bedd7b8
liumaolincommited on
Enhance WebSocket handling for connection management and reliability
b115e26
liumaolincommited on
Add session validation checks to `player.py` and `generator.py`
29766c6
liumaolincommited on
Refactor WebSocket handling with connection manager
300d567
liumaolincommited on
Replace `logging` with centralized `loguru`-based logger across all modules.
851495c
liumaolincommited on
Refactor response generation logic in `generator.py`
ce3d9e5
liumaolincommited on
Remove unused conditional logic for second answer handling in `player.py` and `generator.py`
c1b24fd
liumaolincommited on
Adjust context window allocation logic based on memory tiers in `apple_silicon.py`
fd3c30a
liumaolincommited on
Comment out unused Kokoro TTS voice configurations
23c146f
liumaolincommited on
Add Maple and Cove voice models to MoYoYo TTS configuration
7e92ad3
liumaolincommited on
Introduce Apple Silicon hardware optimization and dynamic LLM configuration
bdc3b7b
liumaolincommited on
Update LLM response generator and system prompts
6f77a29
liumaolincommited on
Update static file routing and root endpoint for frontend integration
f7b034a
liumaolincommited on
Add robust lifecycle management for `audio_player` service in system routes
627c3e7
liumaolincommited on
Standardize service lifecycle management by replacing `stop` with `exit` and introducing `is_exited` check
f5226c0
liumaolincommited on
Remove `voice_schemas.py` and refactor schema imports for TTS and ASR modules in `__init__.py`
4895dc2
liumaolincommited on
Refactor speech recognizer, audio capture, and system routes for improved clarity and functionality
037e5ae
liumaolincommited on
Add pause and resume functionality to voice dialogue system
d701b8a
liumaolincommited on
Refactor project: split `main.py` functionality into modular components under `cli`, `core`, and `config`.
d08a15b
liumaolincommited on
Increase service startup timeouts and set daemon mode for services.
61524a8
liumaolincommited on
Refactor imports in `whisper.py` and `funasr.py` to use absolute paths for `ensure_minimum_audio_duration`.
d673573
liumaolincommited on
Update `moyoyo.py`: add fallback for `utils` to ensure `HParams` availability in runtime.
bd3673b
liumaolincommited on
Refactor imports for consistency in `kokoro.py` and `processor.py`. Use absolute paths for better readability and maintainability.
8630353
liumaolincommited on
Update `paths.py`: improve PROJECT_ROOT resolution with `_MEIPASS` support and enhance third-party path handling.
664d767
liumaolincommited on
Rename 'src/VoiceDialogue' to 'src/voice_dialogue'.
511ff0c
liumaolincommited on
Revamp API core description: expand feature details for ASR, LLMs, TTS, system control, and real-time communication; improve clarity and structure of documentation.