Commit History

Enhance system management and audio capture services: implement `SystemStatusResponse` updates with detailed state tracking, add `audio_capture` service creation and lifecycle management, and refactor API `/system` routes for improved status and control handling.
94c7b78

liumaolin commited on

Extend TTS registry functionality and integrate default system configurations: implement prioritization logic, language preference handling, and fallback mechanisms in `TTSConfigRegistry`; refactor service factory and lifespan management to support dynamic TTS selection and initialization.
a28f7e3

liumaolin commited on

Introduce core module for API lifecycle management: add configuration, service factories, service manager, and lifespan handlers to streamline application startup, shutdown, and service orchestration.
a16e0e5

liumaolin commited on

Add system utilities and initialize core modules: implement `get_system_language` and `get_system_info`, update API startup with system defaults, and integrate ASR, LLM, and speech modules for enhanced functionality.
5c0e715

liumaolin commited on

Refactor core queue initialization: move queue definitions to `constants.py` and clean up redundant imports in `main.py` for better modularity.
bfefeb3

liumaolin commited on

Introduce initial API structure for VoiceDialogue: add dependencies, middleware, and routes for ASR, TTS, system, and voice modules.
8f823b0

liumaolin commited on

Update dependencies: add FunASR, FunASR-ONNX, FastAPI, and Uvicorn to requirements.txt
d91a26b

liumaolin commited on

Refactor ASR module: introduce modular structure with ASR interface, implement FunASR and Whisper clients, add registry, and consolidate utility functions for enhanced maintainability and extensibility.
59603db

liumaolin commited on

Refactor TTS module: rename `tts_manager` to `manager` for consistency across imports and structure.
89f7f05

liumaolin commited on

Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
5284873

liumaolin commited on

Refactor TTS audio generation: rename queues for clarity, update `TTSAudioGenerator` initialization, and enhance docstrings for better maintainability.
bba0d84

liumaolin commited on

Refactor TTS architecture: implement runtime interface, TTS manager, universal registry, and factory pattern to support multiple engines.
ef0d09e

liumaolin commited on

Refactor voice model structure: extract MoYoYo-specific configurations and introduce universal TTS registry.
025ca3f

liumaolin commited on

Update TTS inference to validate audio duration using soundfile.
7b86866

liumaolin commited on

Remove unused `prompt_semantic` and `reference_spec` configuration parameters from voice model definitions.
2a5dcf2

liumaolin commited on

Add thread readiness checks and is_ready property across services
e80f558

liumaolin commited on

Update FunASR punc quantized model.
d691bbc

liumaolin commited on

Using FunASR quantized model.
ac62229

liumaolin commited on

Add multilingual support and optimize LLM pipeline configuration.
2988b10

liumaolin commited on

Update .gitattributes to track GGUF files with git-lfs
ecc005d

liumaolin commited on

Add descriptions for Chinese voice models.
4643bb2

liumaolin commited on

Remove unused configuration parameters and conversation templates.
3d953ae

liumaolin commited on

Integrate FunASR service.
516d7b8

liumaolin commited on

Add FunASR models.
76e7fcd

liumaolin commited on

Update .gitattributes to track jieba dictionary and PNG files with git-lfs.
566a4da

liumaolin commited on

Update README for enhanced installation and usage guidance.
6a33f71

liumaolin commited on

Add requirements.txt.
0df78b6

liumaolin commited on

First commit.
7b64dcd

liumaolin commited on

Update git-lfs file track.
892407b

liumaolin commited on