Refactor directories across services: rename `audio_generator` to `generators`, `asr` to `recognizers`, and update all import paths for consistency and improved module organization.
919ff3f
liumaolincommited on
Refactor imports across services: replace `services.core` module references with `core` for consistency and maintainability; remove unused `Queue` imports.
619c761
liumaolincommited on
Refactor imports in TTS and ASR modules: switch to absolute imports for improved clarity and maintainability.
d7d0d96
liumaolincommited on
Remove trailing whitespace in `audio_generator/manager.py` and `asr/manager.py` for improved code cleanliness and consistency.
f08ef5f
liumaolincommited on
Refactor ASR manager: remove `_get_asr_supported_languages`, replace static language mapping with `supported_langs` attribute, and update dynamic module import to use `importlib.util` for improved maintainability.
8acaad0
liumaolincommited on
Refactor `audio_generator/manager.py`: streamline imports, remove redundant modules in `register_all_tts`, and adjust dynamic import spec for improved maintainability.
ee4663a
liumaolincommited on
Adjust sorting priority in `audio_generator/models/base.py`: prioritize `language_bonus` over `priority` for config selection.
3b73d5e
liumaolincommited on
Refactor TTS module loading in `audio_generator/manager.py`: replace `importlib.import_module` with `importlib.util` for improved dynamic import handling and error logging.
9bfcd24
liumaolincommited on
Update TTS speaker configuration: replace static mapping with dynamic retrieval, add available speaker listing, and update CLI argument parsing for improved flexibility and maintainability.
cf355e6
liumaolincommited on
Refactor sentence processing in `text_generator.py`: centralize sentence end mark sets, streamline `_should_end_sentence` logic, and eliminate redundant parameter passing for improved clarity and maintainability.
1ae18a4
liumaolincommited on
Refactor imports in `main.py`: reorder modules for better readability and remove redundant `load_third_party` call.
8d91cc1
liumaolincommited on
Add Kokoro TTS support: integrate new TTS model, configuration, and runtime components for enhanced multilingual voice synthesis.
1cbd55c
liumaolincommited on
Refactor ASR routes: replace `fastapi_request.state` with `fastapi_request.app.state` for consistent application-level state management.
d231de5
liumaolincommited on
Enhance TTS model handling: add dynamic status tracking, model downloading, and default system configuration initialization with API updates to manage active and default TTS models effectively.
fb6d02a
liumaolincommited on
Comment out logging statements in `audio_player.py` to disable performance logs and streamline runtime output.
87a7384
liumaolincommited on
Refactor ASR routes and services: implement instance creation tracking with background task support, enhance `get_supported_languages` with current ASR language, and clean up unused schemas and routes for simplified management.
757f3be
liumaolincommited on
Remove `SystemConfig` and `SystemStartRequest` imports and clean up `__all__` in `schemas/__init__.py` for simplified schema management.
51a672c
liumaolincommited on
Remove `SystemConfig` and `SystemStartRequest` models and clean up related API routes and background tasks for simplified system startup and management.
e7ebdb0
liumaolincommited on
Enhance system management and audio capture services: implement `SystemStatusResponse` updates with detailed state tracking, add `audio_capture` service creation and lifecycle management, and refactor API `/system` routes for improved status and control handling.
94c7b78
liumaolincommited on
Extend TTS registry functionality and integrate default system configurations: implement prioritization logic, language preference handling, and fallback mechanisms in `TTSConfigRegistry`; refactor service factory and lifespan management to support dynamic TTS selection and initialization.
a28f7e3
liumaolincommited on
Introduce core module for API lifecycle management: add configuration, service factories, service manager, and lifespan handlers to streamline application startup, shutdown, and service orchestration.
a16e0e5
liumaolincommited on
Add system utilities and initialize core modules: implement `get_system_language` and `get_system_info`, update API startup with system defaults, and integrate ASR, LLM, and speech modules for enhanced functionality.
5c0e715
liumaolincommited on
Refactor core queue initialization: move queue definitions to `constants.py` and clean up redundant imports in `main.py` for better modularity.
bfefeb3
liumaolincommited on
Introduce initial API structure for VoiceDialogue: add dependencies, middleware, and routes for ASR, TTS, system, and voice modules.
8f823b0
liumaolincommited on
Refactor ASR module: introduce modular structure with ASR interface, implement FunASR and Whisper clients, add registry, and consolidate utility functions for enhanced maintainability and extensibility.
59603db
liumaolincommited on
Refactor TTS module: rename `tts_manager` to `manager` for consistency across imports and structure.
89f7f05
liumaolincommited on
Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
5284873
liumaolincommited on
Refactor TTS audio generation: rename queues for clarity, update `TTSAudioGenerator` initialization, and enhance docstrings for better maintainability.
bba0d84
liumaolincommited on
Refactor TTS architecture: implement runtime interface, TTS manager, universal registry, and factory pattern to support multiple engines.
ef0d09e
liumaolincommited on
Refactor voice model structure: extract MoYoYo-specific configurations and introduce universal TTS registry.
025ca3f
liumaolincommited on
Remove unused `prompt_semantic` and `reference_spec` configuration parameters from voice model definitions.
2a5dcf2
liumaolincommited on
Add thread readiness checks and is_ready property across services
e80f558
liumaolincommited on
Using FunASR quantized model.
ac62229
liumaolincommited on
Add multilingual support and optimize LLM pipeline configuration.
2988b10
liumaolincommited on
Add descriptions for Chinese voice models.
4643bb2
liumaolincommited on
Remove unused configuration parameters and conversation templates.