Add pause and resume functionality to voice dialogue system
d701b8a
liumaolincommited on
Refactor project: split `main.py` functionality into modular components under `cli`, `core`, and `config`.
d08a15b
liumaolincommited on
Increase service startup timeouts and set daemon mode for services.
61524a8
liumaolincommited on
Refactor imports in `whisper.py` and `funasr.py` to use absolute paths for `ensure_minimum_audio_duration`.
d673573
liumaolincommited on
Update `moyoyo.py`: add fallback for `utils` to ensure `HParams` availability in runtime.
bd3673b
liumaolincommited on
Refactor imports for consistency in `kokoro.py` and `processor.py`. Use absolute paths for better readability and maintainability.
8630353
liumaolincommited on
Update `paths.py`: improve PROJECT_ROOT resolution with `_MEIPASS` support and enhance third-party path handling.
664d767
liumaolincommited on
Rename 'src/VoiceDialogue' to 'src/voice_dialogue'.
511ff0c
liumaolincommited on
Revamp API core description: expand feature details for ASR, LLMs, TTS, system control, and real-time communication; improve clarity and structure of documentation.
c57de2a
liumaolincommited on
Update project requirements.
ccdd95f
liumaolincommited on
Integrate WebSocket support: add `/api/v1/ws` endpoint, enable real-time message handling via `websocket_message_queue`, and refactor services and models to support WebSocket-based question and answer updates.
2534744
liumaolincommited on
Refactor `SessionIdManager` module: rename `session_id_manager.py` to `session_manager.py` and update imports accordingly.
83ef092
liumaolincommited on
Refactor session ID handling: replace `current_session_id` with `SessionIdManager` for thread-safe management, update related imports and references.
92bb56d
liumaolincommited on
Refactor `__init__.py` in TTS runtime: streamline `__all__` handling, improve logging for import failures, and enhance maintainability of module exports.
6f036c6
liumaolincommited on
Serve static frontend assets through FastAPI: mount static files and replace root endpoint response with `index.html`.
ad7bf8d
liumaolincommited on
Handle `UnboundLocalError` in punctuation model lookup: add exception handling to ensure stability during transcription.
933e84c
liumaolincommited on
Remove `VoiceDialogue` API, models, and settings: clean up unused modules and dependencies for the decommissioned service.
3eb6daa
liumaolincommited on
Refactor assets files
1d3b1b4
liumaolincommited on
Refactor directories across services: rename `audio_generator` to `generators`, `asr` to `recognizers`, and update all import paths for consistency and improved module organization.
919ff3f
liumaolincommited on
Refactor imports across services: replace `services.core` module references with `core` for consistency and maintainability; remove unused `Queue` imports.
619c761
liumaolincommited on
Refactor imports in TTS and ASR modules: switch to absolute imports for improved clarity and maintainability.
d7d0d96
liumaolincommited on
Remove trailing whitespace in `audio_generator/manager.py` and `asr/manager.py` for improved code cleanliness and consistency.
f08ef5f
liumaolincommited on
Refactor ASR manager: remove `_get_asr_supported_languages`, replace static language mapping with `supported_langs` attribute, and update dynamic module import to use `importlib.util` for improved maintainability.
8acaad0
liumaolincommited on
Refactor `audio_generator/manager.py`: streamline imports, remove redundant modules in `register_all_tts`, and adjust dynamic import spec for improved maintainability.
ee4663a
liumaolincommited on
Adjust sorting priority in `audio_generator/models/base.py`: prioritize `language_bonus` over `priority` for config selection.
3b73d5e
liumaolincommited on
Refactor TTS module loading in `audio_generator/manager.py`: replace `importlib.import_module` with `importlib.util` for improved dynamic import handling and error logging.
9bfcd24
liumaolincommited on
Update TTS speaker configuration: replace static mapping with dynamic retrieval, add available speaker listing, and update CLI argument parsing for improved flexibility and maintainability.
cf355e6
liumaolincommited on
Refactor sentence processing in `text_generator.py`: centralize sentence end mark sets, streamline `_should_end_sentence` logic, and eliminate redundant parameter passing for improved clarity and maintainability.
1ae18a4
liumaolincommited on
Refactor imports in `main.py`: reorder modules for better readability and remove redundant `load_third_party` call.
8d91cc1
liumaolincommited on
Add Kokoro TTS support: integrate new TTS model, configuration, and runtime components for enhanced multilingual voice synthesis.
1cbd55c
liumaolincommited on
Refactor ASR routes: replace `fastapi_request.state` with `fastapi_request.app.state` for consistent application-level state management.
d231de5
liumaolincommited on
Enhance TTS model handling: add dynamic status tracking, model downloading, and default system configuration initialization with API updates to manage active and default TTS models effectively.
fb6d02a
liumaolincommited on
Comment out logging statements in `audio_player.py` to disable performance logs and streamline runtime output.
87a7384
liumaolincommited on
Refactor ASR routes and services: implement instance creation tracking with background task support, enhance `get_supported_languages` with current ASR language, and clean up unused schemas and routes for simplified management.
757f3be
liumaolincommited on
Remove `SystemConfig` and `SystemStartRequest` imports and clean up `__all__` in `schemas/__init__.py` for simplified schema management.
51a672c
liumaolincommited on
Remove `SystemConfig` and `SystemStartRequest` models and clean up related API routes and background tasks for simplified system startup and management.
e7ebdb0
liumaolincommited on
Enhance system management and audio capture services: implement `SystemStatusResponse` updates with detailed state tracking, add `audio_capture` service creation and lifecycle management, and refactor API `/system` routes for improved status and control handling.
94c7b78
liumaolincommited on
Extend TTS registry functionality and integrate default system configurations: implement prioritization logic, language preference handling, and fallback mechanisms in `TTSConfigRegistry`; refactor service factory and lifespan management to support dynamic TTS selection and initialization.
a28f7e3
liumaolincommited on
Introduce core module for API lifecycle management: add configuration, service factories, service manager, and lifespan handlers to streamline application startup, shutdown, and service orchestration.
a16e0e5
liumaolincommited on
Add system utilities and initialize core modules: implement `get_system_language` and `get_system_info`, update API startup with system defaults, and integrate ASR, LLM, and speech modules for enhanced functionality.
5c0e715
liumaolincommited on
Refactor core queue initialization: move queue definitions to `constants.py` and clean up redundant imports in `main.py` for better modularity.
bfefeb3
liumaolincommited on
Introduce initial API structure for VoiceDialogue: add dependencies, middleware, and routes for ASR, TTS, system, and voice modules.
8f823b0
liumaolincommited on
Refactor ASR module: introduce modular structure with ASR interface, implement FunASR and Whisper clients, add registry, and consolidate utility functions for enhanced maintainability and extensibility.
59603db
liumaolincommited on
Refactor TTS module: rename `tts_manager` to `manager` for consistency across imports and structure.
89f7f05
liumaolincommited on
Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
5284873
liumaolincommited on
Refactor TTS audio generation: rename queues for clarity, update `TTSAudioGenerator` initialization, and enhance docstrings for better maintainability.
bba0d84
liumaolincommited on
Refactor TTS architecture: implement runtime interface, TTS manager, universal registry, and factory pattern to support multiple engines.
ef0d09e
liumaolincommited on
Refactor voice model structure: extract MoYoYo-specific configurations and introduce universal TTS registry.
025ca3f
liumaolincommited on
Remove unused `prompt_semantic` and `reference_spec` configuration parameters from voice model definitions.
2a5dcf2
liumaolincommited on
Add thread readiness checks and is_ready property across services