Commit History

添加MoYoYo语音模型“白岩松”及相关文件

d74bcbf

liumaolin commited on Jul 7, 2025

重构`is_task_interrupted`方法以检查语音任务中断状态

02c321c

liumaolin commited on Jul 6, 2025

Add `has_no_words` check to skip punctuation-only TTS tasks and enhance debug logs

b636027

liumaolin commited on Jul 4, 2025

Bump app version to 1.0.2 in `package.json`.

f00898d

liumaolin commited on Jul 4, 2025

Add `silero-vad` and `pyaudio` to `requirements.txt` for VAD and audio processing dependencies

22a99cd

liumaolin commited on Jul 4, 2025

Update frontend files.

611aa86

liumaolin commited on Jul 4, 2025

Refactor SpeechMonitor to use active audio frame duration instead of count

15891ec

liumaolin commited on Jul 4, 2025

Add TTS generation error handling and `is_task_interrupted` helper function

5ecb408

liumaolin commited on Jul 4, 2025

Optimize VAD logic by replacing `np.max(probs)` with `any(prob >= threshold)` for improved readability and efficiency.

9273b76

liumaolin commited on Jul 4, 2025

[fix]: update config modal layout.

20896dc

Xin Zhang commited on Jul 4, 2025

Update `.gitignore` to exclude `.DS_Store` and `package-lock.json`.

74f1d81

liumaolin commited on Jul 4, 2025

Rename application title to `VoiceDialogue` in `index.html`.

ec5c62b

liumaolin commited on Jul 4, 2025

Increase audio capture chunk size to 1024 in `capture.py` for smoother streaming

095cfb6

liumaolin commited on Jul 3, 2025

Add `pyaudio` and `silero_vad` as dependencies for PyInstaller hooks

08de414

liumaolin commited on Jul 3, 2025

Update frontend assets.

5a7f766

liumaolin commited on Jul 3, 2025

Update frontend assets.

2cfbc4c

liumaolin commited on Jul 3, 2025

Add echo cancellation toggle in UI and update project dependencies

5bc3ef8

liumaolin commited on Jul 3, 2025

Remove `speech_monitor` dependency from `asr_worker` service definition.

5cf0dbe

liumaolin commited on Jul 3, 2025

Add echo cancellation and VAD toggle support in service factories and routes

2ecfa8f

liumaolin commited on Jul 3, 2025

Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled

d846f85

liumaolin commited on Jul 2, 2025

Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.

7d8046a

liumaolin commited on Jul 2, 2025

Handle overflow errors in audio capture by adding `exception_on_overflow=False` and skip processing when no data is available.

5f9eaee

liumaolin commited on Jul 2, 2025

Add `--disable-echo-cancellation` CLI option and update audio pipeline to support toggling echo cancellation and VAD

4e071d3

liumaolin commited on Jul 2, 2025

Integrate `SileroVAD` into `SpeechMonitor` for optional voice activity detection. Add `_detect_speech()` method and update queue handling logic. Implement `SileroVAD` as a singleton for efficient model management.

4e2e3d8

liumaolin commited on Jul 2, 2025

Add `silero-vad` as a dependency in `pyproject.toml` and update `uv.lock`

d41c6db

liumaolin commited on Jul 2, 2025

Increase queue timeout in audio and text processing services for smoother task handling

b446464

liumaolin commited on Jul 2, 2025

Refactor audio processing pipeline to normalize data in `SpeechMonitor` and streamline queuing in `AudioCapture`

57b0084

liumaolin commited on Jul 2, 2025

Update `AudioCapture` to support both PyAudio and macOS native AEC＋VAD libraries

99e8988

liumaolin commited on Jul 2, 2025

Add PyAudio as a dependency in `pyproject.toml` and update `uv.lock`

a453c72

liumaolin commited on Jul 2, 2025

Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility

7437d6d

liumaolin commited on Jul 2, 2025

frontend

7bff52e

Xin Zhang commited on Jul 2, 2025

Clean input text in MoYoYo TTS by removing punctuation for better processing

8587958

liumaolin commited on Jul 1, 2025

Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager

941bf07

liumaolin commited on Jul 1, 2025

Enable debug mode with global configuration and detailed task logging when active

e0f42b2

liumaolin commited on Jul 1, 2025

Reset task ID in speech recognizer for empty transcriptions to prevent errors

2291ed2

liumaolin commited on Jun 30, 2025

Add new voice model "Doubao" to MoYoYo configuration

7b003c4

liumaolin commited on Jun 30, 2025

Update performance logging format in TTS player for improved structure and readability

d0c1c61

liumaolin commited on Jun 28, 2025

Enhance launcher startup log formatting for improved readability and visual appeal

fa296dd

liumaolin commited on Jun 28, 2025

Remove commented-out performance logging code from TTS player

c3e85a2

liumaolin commited on Jun 28, 2025

Add new voice model "Ellen" to MoYoYo configuration

b5b48f0

liumaolin commited on Jun 27, 2025

Update MoYoYo TTS prompt text for improved relevance and clarity

8228973

liumaolin commited on Jun 26, 2025

Improve speech recognizer to handle empty transcriptions

0cbda14

liumaolin commited on Jun 26, 2025

Add `tests/tts_test_output` to `.gitignore` to exclude test artifacts

952b71a

liumaolin commited on Jun 26, 2025

Add unit tests for TTS audio generator

c6519c7

liumaolin commited on Jun 26, 2025

Update reference audio for MoYoYo TTS model

68663d9

liumaolin commited on Jun 26, 2025

Simplify system prompts for test dialogue in Chinese and English

ece34f8

liumaolin commited on Jun 26, 2025

Simplify system prompts for text generation in Chinese and English

c545fd9

liumaolin commited on Jun 26, 2025

Refactor `test_llm_dialogue.py` for improved pipeline creation and streamlined testing

20d87ca

liumaolin commited on Jun 26, 2025

Refactor `test_llm_dialogue.py` to enhance multi-dataset testing for LLM dialogue

6556ced

liumaolin commited on Jun 26, 2025

Add new voice model "Juniper" to MoYoYo configuration

469433f

liumaolin commited on Jun 26, 2025