Commit History

添加MoYoYo语音模型“白岩松”及相关文件
d74bcbf

liumaolin commited on

重构`is_task_interrupted`方法以检查语音任务中断状态
02c321c

liumaolin commited on

Add `has_no_words` check to skip punctuation-only TTS tasks and enhance debug logs
b636027

liumaolin commited on

Bump app version to 1.0.2 in `package.json`.
f00898d

liumaolin commited on

Add `silero-vad` and `pyaudio` to `requirements.txt` for VAD and audio processing dependencies
22a99cd

liumaolin commited on

Update frontend files.
611aa86

liumaolin commited on

Refactor SpeechMonitor to use active audio frame duration instead of count
15891ec

liumaolin commited on

Add TTS generation error handling and `is_task_interrupted` helper function
5ecb408

liumaolin commited on

Optimize VAD logic by replacing `np.max(probs)` with `any(prob >= threshold)` for improved readability and efficiency.
9273b76

liumaolin commited on

[fix]: update config modal layout.
20896dc

Xin Zhang commited on

Update `.gitignore` to exclude `.DS_Store` and `package-lock.json`.
74f1d81

liumaolin commited on

Rename application title to `VoiceDialogue` in `index.html`.
ec5c62b

liumaolin commited on

Increase audio capture chunk size to 1024 in `capture.py` for smoother streaming
095cfb6

liumaolin commited on

Add `pyaudio` and `silero_vad` as dependencies for PyInstaller hooks
08de414

liumaolin commited on

Update frontend assets.
5a7f766

liumaolin commited on

Update frontend assets.
2cfbc4c

liumaolin commited on

Add echo cancellation toggle in UI and update project dependencies
5bc3ef8

liumaolin commited on

Remove `speech_monitor` dependency from `asr_worker` service definition.
5cf0dbe

liumaolin commited on

Add echo cancellation and VAD toggle support in service factories and routes
2ecfa8f

liumaolin commited on

Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled
d846f85

liumaolin commited on

Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.
7d8046a

liumaolin commited on

Handle overflow errors in audio capture by adding `exception_on_overflow=False` and skip processing when no data is available.
5f9eaee

liumaolin commited on

Add `--disable-echo-cancellation` CLI option and update audio pipeline to support toggling echo cancellation and VAD
4e071d3

liumaolin commited on

Integrate `SileroVAD` into `SpeechMonitor` for optional voice activity detection. Add `_detect_speech()` method and update queue handling logic. Implement `SileroVAD` as a singleton for efficient model management.
4e2e3d8

liumaolin commited on

Add `silero-vad` as a dependency in `pyproject.toml` and update `uv.lock`
d41c6db

liumaolin commited on

Increase queue timeout in audio and text processing services for smoother task handling
b446464

liumaolin commited on

Refactor audio processing pipeline to normalize data in `SpeechMonitor` and streamline queuing in `AudioCapture`
57b0084

liumaolin commited on

Update `AudioCapture` to support both PyAudio and macOS native AEC+VAD libraries
99e8988

liumaolin commited on

Add PyAudio as a dependency in `pyproject.toml` and update `uv.lock`
a453c72

liumaolin commited on

Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility
7437d6d

liumaolin commited on

frontend
7bff52e

Xin Zhang commited on

Clean input text in MoYoYo TTS by removing punctuation for better processing
8587958

liumaolin commited on

Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager
941bf07

liumaolin commited on

Enable debug mode with global configuration and detailed task logging when active
e0f42b2

liumaolin commited on

Reset task ID in speech recognizer for empty transcriptions to prevent errors
2291ed2

liumaolin commited on

Add new voice model "Doubao" to MoYoYo configuration
7b003c4

liumaolin commited on

Update performance logging format in TTS player for improved structure and readability
d0c1c61

liumaolin commited on

Enhance launcher startup log formatting for improved readability and visual appeal
fa296dd

liumaolin commited on

Remove commented-out performance logging code from TTS player
c3e85a2

liumaolin commited on

Add new voice model "Ellen" to MoYoYo configuration
b5b48f0

liumaolin commited on

Update MoYoYo TTS prompt text for improved relevance and clarity
8228973

liumaolin commited on

Improve speech recognizer to handle empty transcriptions
0cbda14

liumaolin commited on

Add `tests/tts_test_output` to `.gitignore` to exclude test artifacts
952b71a

liumaolin commited on

Add unit tests for TTS audio generator
c6519c7

liumaolin commited on

Update reference audio for MoYoYo TTS model
68663d9

liumaolin commited on

Simplify system prompts for test dialogue in Chinese and English
ece34f8

liumaolin commited on

Simplify system prompts for text generation in Chinese and English
c545fd9

liumaolin commited on

Refactor `test_llm_dialogue.py` for improved pipeline creation and streamlined testing
20d87ca

liumaolin commited on

Refactor `test_llm_dialogue.py` to enhance multi-dataset testing for LLM dialogue
6556ced

liumaolin commited on

Add new voice model "Juniper" to MoYoYo configuration
469433f

liumaolin commited on