Commit History

Remove `speech_monitor` dependency from `asr_worker` service definition.
5cf0dbe

liumaolin commited on

Add echo cancellation and VAD toggle support in service factories and routes
2ecfa8f

liumaolin commited on

Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled
d846f85

liumaolin commited on

Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.
7d8046a

liumaolin commited on

Handle overflow errors in audio capture by adding `exception_on_overflow=False` and skip processing when no data is available.
5f9eaee

liumaolin commited on

Add `--disable-echo-cancellation` CLI option and update audio pipeline to support toggling echo cancellation and VAD
4e071d3

liumaolin commited on

Integrate `SileroVAD` into `SpeechMonitor` for optional voice activity detection. Add `_detect_speech()` method and update queue handling logic. Implement `SileroVAD` as a singleton for efficient model management.
4e2e3d8

liumaolin commited on

Increase queue timeout in audio and text processing services for smoother task handling
b446464

liumaolin commited on

Refactor audio processing pipeline to normalize data in `SpeechMonitor` and streamline queuing in `AudioCapture`
57b0084

liumaolin commited on

Update `AudioCapture` to support both PyAudio and macOS native AEC+VAD libraries
99e8988

liumaolin commited on

Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility
7437d6d

liumaolin commited on

Clean input text in MoYoYo TTS by removing punctuation for better processing
8587958

liumaolin commited on

Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager
941bf07

liumaolin commited on

Enable debug mode with global configuration and detailed task logging when active
e0f42b2

liumaolin commited on

Reset task ID in speech recognizer for empty transcriptions to prevent errors
2291ed2

liumaolin commited on

Add new voice model "Doubao" to MoYoYo configuration
7b003c4

liumaolin commited on

Update performance logging format in TTS player for improved structure and readability
d0c1c61

liumaolin commited on

Enhance launcher startup log formatting for improved readability and visual appeal
fa296dd

liumaolin commited on

Remove commented-out performance logging code from TTS player
c3e85a2

liumaolin commited on

Add new voice model "Ellen" to MoYoYo configuration
b5b48f0

liumaolin commited on

Update MoYoYo TTS prompt text for improved relevance and clarity
8228973

liumaolin commited on

Improve speech recognizer to handle empty transcriptions
0cbda14

liumaolin commited on

Simplify system prompts for text generation in Chinese and English
c545fd9

liumaolin commited on

Add new voice model "Juniper" to MoYoYo configuration
469433f

liumaolin commited on

Standardize punctuation for system prompts in both Chinese and English text generation modules.
bedd7b8

liumaolin commited on

Enhance WebSocket handling for connection management and reliability
b115e26

liumaolin commited on

Add session validation checks to `player.py` and `generator.py`
29766c6

liumaolin commited on

Refactor WebSocket handling with connection manager
300d567

liumaolin commited on

Replace `logging` with centralized `loguru`-based logger across all modules.
851495c

liumaolin commited on

Refactor response generation logic in `generator.py`
ce3d9e5

liumaolin commited on

Remove unused conditional logic for second answer handling in `player.py` and `generator.py`
c1b24fd

liumaolin commited on

Adjust context window allocation logic based on memory tiers in `apple_silicon.py`
fd3c30a

liumaolin commited on

Comment out unused Kokoro TTS voice configurations
23c146f

liumaolin commited on

Add Maple and Cove voice models to MoYoYo TTS configuration
7e92ad3

liumaolin commited on

Introduce Apple Silicon hardware optimization and dynamic LLM configuration
bdc3b7b

liumaolin commited on

Update LLM response generator and system prompts
6f77a29

liumaolin commited on

Update static file routing and root endpoint for frontend integration
f7b034a

liumaolin commited on

Add robust lifecycle management for `audio_player` service in system routes
627c3e7

liumaolin commited on

Standardize service lifecycle management by replacing `stop` with `exit` and introducing `is_exited` check
f5226c0

liumaolin commited on

Remove `voice_schemas.py` and refactor schema imports for TTS and ASR modules in `__init__.py`
4895dc2

liumaolin commited on

Refactor speech recognizer, audio capture, and system routes for improved clarity and functionality
037e5ae

liumaolin commited on

Add pause and resume functionality to voice dialogue system
d701b8a

liumaolin commited on

Refactor project: split `main.py` functionality into modular components under `cli`, `core`, and `config`.
d08a15b

liumaolin commited on

Increase service startup timeouts and set daemon mode for services.
61524a8

liumaolin commited on

Refactor imports in `whisper.py` and `funasr.py` to use absolute paths for `ensure_minimum_audio_duration`.
d673573

liumaolin commited on

Update `moyoyo.py`: add fallback for `utils` to ensure `HParams` availability in runtime.
bd3673b

liumaolin commited on

Refactor imports for consistency in `kokoro.py` and `processor.py`. Use absolute paths for better readability and maintainability.
8630353

liumaolin commited on

Update `paths.py`: improve PROJECT_ROOT resolution with `_MEIPASS` support and enhance third-party path handling.
664d767

liumaolin commited on

Rename 'src/VoiceDialogue' to 'src/voice_dialogue'.
511ff0c

liumaolin commited on

Revamp API core description: expand feature details for ASR, LLMs, TTS, system control, and real-time communication; improve clarity and structure of documentation.
c57de2a

liumaolin commited on