Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
VoiceDialogue
like
2
Follow
MoYoYoTech
23
Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
22a99cd
VoiceDialogue
/
src
/
voice_dialogue
246 kB
3 contributors
History:
53 commits
liumaolin
Refactor SpeechMonitor to use active audio frame duration instead of count
15891ec
5 months ago
api
Remove `speech_monitor` dependency from `asr_worker` service definition.
6 months ago
cli
Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled
6 months ago
config
Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager
6 months ago
core
Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency.
6 months ago
models
Rename 'src/VoiceDialogue' to 'src/voice_dialogue'.
6 months ago
services
Refactor SpeechMonitor to use active audio frame duration instead of count
5 months ago
utils
Replace `logging` with centralized `loguru`-based logger across all modules.
6 months ago
__init__.py
539 Bytes
Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility
6 months ago