Commit History

Update static file routing and root endpoint for frontend integration
f7b034a

liumaolin commited on

Add robust lifecycle management for `audio_player` service in system routes
627c3e7

liumaolin commited on

Standardize service lifecycle management by replacing `stop` with `exit` and introducing `is_exited` check
f5226c0

liumaolin commited on

Remove `voice_schemas.py` and refactor schema imports for TTS and ASR modules in `__init__.py`
4895dc2

liumaolin commited on

Refactor speech recognizer, audio capture, and system routes for improved clarity and functionality
037e5ae

liumaolin commited on

Add pause and resume functionality to voice dialogue system
d701b8a

liumaolin commited on

Refactor project: split `main.py` functionality into modular components under `cli`, `core`, and `config`.
d08a15b

liumaolin commited on

Increase service startup timeouts and set daemon mode for services.
61524a8

liumaolin commited on

Refactor imports in `whisper.py` and `funasr.py` to use absolute paths for `ensure_minimum_audio_duration`.
d673573

liumaolin commited on

Update `moyoyo.py`: add fallback for `utils` to ensure `HParams` availability in runtime.
bd3673b

liumaolin commited on

Refactor imports for consistency in `kokoro.py` and `processor.py`. Use absolute paths for better readability and maintainability.
8630353

liumaolin commited on

Update `paths.py`: improve PROJECT_ROOT resolution with `_MEIPASS` support and enhance third-party path handling.
664d767

liumaolin commited on

Rename 'src/VoiceDialogue' to 'src/voice_dialogue'.
511ff0c

liumaolin commited on

Revamp API core description: expand feature details for ASR, LLMs, TTS, system control, and real-time communication; improve clarity and structure of documentation.
c57de2a

liumaolin commited on

Update project requirements.
ccdd95f

liumaolin commited on

Integrate WebSocket support: add `/api/v1/ws` endpoint, enable real-time message handling via `websocket_message_queue`, and refactor services and models to support WebSocket-based question and answer updates.
2534744

liumaolin commited on

Refactor `SessionIdManager` module: rename `session_id_manager.py` to `session_manager.py` and update imports accordingly.
83ef092

liumaolin commited on

Refactor session ID handling: replace `current_session_id` with `SessionIdManager` for thread-safe management, update related imports and references.
92bb56d

liumaolin commited on

Refactor `__init__.py` in TTS runtime: streamline `__all__` handling, improve logging for import failures, and enhance maintainability of module exports.
6f036c6

liumaolin commited on

Serve static frontend assets through FastAPI: mount static files and replace root endpoint response with `index.html`.
ad7bf8d

liumaolin commited on

Handle `UnboundLocalError` in punctuation model lookup: add exception handling to ensure stability during transcription.
933e84c

liumaolin commited on

Remove `VoiceDialogue` API, models, and settings: clean up unused modules and dependencies for the decommissioned service.
3eb6daa

liumaolin commited on

Refactor assets files
1d3b1b4

liumaolin commited on

Refactor directories across services: rename `audio_generator` to `generators`, `asr` to `recognizers`, and update all import paths for consistency and improved module organization.
919ff3f

liumaolin commited on

Refactor imports across services: replace `services.core` module references with `core` for consistency and maintainability; remove unused `Queue` imports.
619c761

liumaolin commited on

Refactor imports in TTS and ASR modules: switch to absolute imports for improved clarity and maintainability.
d7d0d96

liumaolin commited on

Remove trailing whitespace in `audio_generator/manager.py` and `asr/manager.py` for improved code cleanliness and consistency.
f08ef5f

liumaolin commited on

Refactor ASR manager: remove `_get_asr_supported_languages`, replace static language mapping with `supported_langs` attribute, and update dynamic module import to use `importlib.util` for improved maintainability.
8acaad0

liumaolin commited on

Refactor `audio_generator/manager.py`: streamline imports, remove redundant modules in `register_all_tts`, and adjust dynamic import spec for improved maintainability.
ee4663a

liumaolin commited on

Adjust sorting priority in `audio_generator/models/base.py`: prioritize `language_bonus` over `priority` for config selection.
3b73d5e

liumaolin commited on

Refactor TTS module loading in `audio_generator/manager.py`: replace `importlib.import_module` with `importlib.util` for improved dynamic import handling and error logging.
9bfcd24

liumaolin commited on

Update TTS speaker configuration: replace static mapping with dynamic retrieval, add available speaker listing, and update CLI argument parsing for improved flexibility and maintainability.
cf355e6

liumaolin commited on

Refactor sentence processing in `text_generator.py`: centralize sentence end mark sets, streamline `_should_end_sentence` logic, and eliminate redundant parameter passing for improved clarity and maintainability.
1ae18a4

liumaolin commited on

Refactor imports in `main.py`: reorder modules for better readability and remove redundant `load_third_party` call.
8d91cc1

liumaolin commited on

Add Kokoro TTS support: integrate new TTS model, configuration, and runtime components for enhanced multilingual voice synthesis.
1cbd55c

liumaolin commited on

Refactor ASR routes: replace `fastapi_request.state` with `fastapi_request.app.state` for consistent application-level state management.
d231de5

liumaolin commited on

Enhance TTS model handling: add dynamic status tracking, model downloading, and default system configuration initialization with API updates to manage active and default TTS models effectively.
fb6d02a

liumaolin commited on

Comment out logging statements in `audio_player.py` to disable performance logs and streamline runtime output.
87a7384

liumaolin commited on

Refactor ASR routes and services: implement instance creation tracking with background task support, enhance `get_supported_languages` with current ASR language, and clean up unused schemas and routes for simplified management.
757f3be

liumaolin commited on

Remove `SystemConfig` and `SystemStartRequest` imports and clean up `__all__` in `schemas/__init__.py` for simplified schema management.
51a672c

liumaolin commited on

Remove `SystemConfig` and `SystemStartRequest` models and clean up related API routes and background tasks for simplified system startup and management.
e7ebdb0

liumaolin commited on

Enhance system management and audio capture services: implement `SystemStatusResponse` updates with detailed state tracking, add `audio_capture` service creation and lifecycle management, and refactor API `/system` routes for improved status and control handling.
94c7b78

liumaolin commited on

Extend TTS registry functionality and integrate default system configurations: implement prioritization logic, language preference handling, and fallback mechanisms in `TTSConfigRegistry`; refactor service factory and lifespan management to support dynamic TTS selection and initialization.
a28f7e3

liumaolin commited on

Introduce core module for API lifecycle management: add configuration, service factories, service manager, and lifespan handlers to streamline application startup, shutdown, and service orchestration.
a16e0e5

liumaolin commited on

Add system utilities and initialize core modules: implement `get_system_language` and `get_system_info`, update API startup with system defaults, and integrate ASR, LLM, and speech modules for enhanced functionality.
5c0e715

liumaolin commited on

Refactor core queue initialization: move queue definitions to `constants.py` and clean up redundant imports in `main.py` for better modularity.
bfefeb3

liumaolin commited on

Introduce initial API structure for VoiceDialogue: add dependencies, middleware, and routes for ASR, TTS, system, and voice modules.
8f823b0

liumaolin commited on

Refactor ASR module: introduce modular structure with ASR interface, implement FunASR and Whisper clients, add registry, and consolidate utility functions for enhanced maintainability and extensibility.
59603db

liumaolin commited on

Refactor TTS module: rename `tts_manager` to `manager` for consistency across imports and structure.
89f7f05

liumaolin commited on

Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
5284873

liumaolin commited on