Add `has_no_words` check to skip punctuation-only TTS tasks and enhance debug logs
b636027
liumaolincommited on
Bump app version to 1.0.2 in `package.json`.
f00898d
liumaolincommited on
Add `silero-vad` and `pyaudio` to `requirements.txt` for VAD and audio processing dependencies
22a99cd
liumaolincommited on
Update frontend files.
611aa86
liumaolincommited on
Refactor SpeechMonitor to use active audio frame duration instead of count
15891ec
liumaolincommited on
Add TTS generation error handling and `is_task_interrupted` helper function
5ecb408
liumaolincommited on
Optimize VAD logic by replacing `np.max(probs)` with `any(prob >= threshold)` for improved readability and efficiency.
9273b76
liumaolincommited on
[fix]: update config modal layout.
20896dc
Xin Zhangcommited on
Update `.gitignore` to exclude `.DS_Store` and `package-lock.json`.
74f1d81
liumaolincommited on
Rename application title to `VoiceDialogue` in `index.html`.
ec5c62b
liumaolincommited on
Increase audio capture chunk size to 1024 in `capture.py` for smoother streaming
095cfb6
liumaolincommited on
Add `pyaudio` and `silero_vad` as dependencies for PyInstaller hooks
08de414
liumaolincommited on
Update frontend assets.
5a7f766
liumaolincommited on
Update frontend assets.
2cfbc4c
liumaolincommited on
Add echo cancellation toggle in UI and update project dependencies
5bc3ef8
liumaolincommited on
Remove `speech_monitor` dependency from `asr_worker` service definition.
5cf0dbe
liumaolincommited on
Add echo cancellation and VAD toggle support in service factories and routes
2ecfa8f
liumaolincommited on
Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled
d846f85
liumaolincommited on
Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.
7d8046a
liumaolincommited on
Handle overflow errors in audio capture by adding `exception_on_overflow=False` and skip processing when no data is available.
5f9eaee
liumaolincommited on
Add `--disable-echo-cancellation` CLI option and update audio pipeline to support toggling echo cancellation and VAD
4e071d3
liumaolincommited on
Integrate `SileroVAD` into `SpeechMonitor` for optional voice activity detection. Add `_detect_speech()` method and update queue handling logic. Implement `SileroVAD` as a singleton for efficient model management.
4e2e3d8
liumaolincommited on
Add `silero-vad` as a dependency in `pyproject.toml` and update `uv.lock`
d41c6db
liumaolincommited on
Increase queue timeout in audio and text processing services for smoother task handling
b446464
liumaolincommited on
Refactor audio processing pipeline to normalize data in `SpeechMonitor` and streamline queuing in `AudioCapture`
57b0084
liumaolincommited on
Update `AudioCapture` to support both PyAudio and macOS native AEC+VAD libraries
99e8988
liumaolincommited on
Add PyAudio as a dependency in `pyproject.toml` and update `uv.lock`
a453c72
liumaolincommited on
Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility
7437d6d
liumaolincommited on
frontend
7bff52e
Xin Zhangcommited on
Clean input text in MoYoYo TTS by removing punctuation for better processing
8587958
liumaolincommited on
Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager
941bf07
liumaolincommited on
Enable debug mode with global configuration and detailed task logging when active
e0f42b2
liumaolincommited on
Reset task ID in speech recognizer for empty transcriptions to prevent errors
2291ed2
liumaolincommited on
Add new voice model "Doubao" to MoYoYo configuration
7b003c4
liumaolincommited on
Update performance logging format in TTS player for improved structure and readability
d0c1c61
liumaolincommited on
Enhance launcher startup log formatting for improved readability and visual appeal
fa296dd
liumaolincommited on
Remove commented-out performance logging code from TTS player
c3e85a2
liumaolincommited on
Add new voice model "Ellen" to MoYoYo configuration
b5b48f0
liumaolincommited on
Update MoYoYo TTS prompt text for improved relevance and clarity
8228973
liumaolincommited on
Improve speech recognizer to handle empty transcriptions
0cbda14
liumaolincommited on
Add `tests/tts_test_output` to `.gitignore` to exclude test artifacts
952b71a
liumaolincommited on
Add unit tests for TTS audio generator
c6519c7
liumaolincommited on
Update reference audio for MoYoYo TTS model
68663d9
liumaolincommited on
Simplify system prompts for test dialogue in Chinese and English
ece34f8
liumaolincommited on
Simplify system prompts for text generation in Chinese and English
c545fd9
liumaolincommited on
Refactor `test_llm_dialogue.py` for improved pipeline creation and streamlined testing
20d87ca
liumaolincommited on
Refactor `test_llm_dialogue.py` to enhance multi-dataset testing for LLM dialogue
6556ced
liumaolincommited on
Add new voice model "Juniper" to MoYoYo configuration