Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MoYoYoTech
/
VoiceDialogue

Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
Model card Files Files and versions
xet
Community
1
VoiceDialogue / src /voice_dialogue
246 kB
  • 3 contributors
History: 53 commits
liumaolin
Refactor SpeechMonitor to use active audio frame duration instead of count
15891ec 5 months ago
  • api
    Remove `speech_monitor` dependency from `asr_worker` service definition. 6 months ago
  • cli
    Fix help text for `--disable-echo-cancellation` to clarify the default behavior is not disabled 6 months ago
  • config
    Refactor LlamaCpp initialization to simplify parameter handling and remove unused callback manager 6 months ago
  • core
    Refactor threading in `launcher.py` to standardize worker initialization, enforce daemon mode, and improve naming consistency. 6 months ago
  • models
    Rename 'src/VoiceDialogue' to 'src/voice_dialogue'. 6 months ago
  • services
    Refactor SpeechMonitor to use active audio frame duration instead of count 5 months ago
  • utils
    Replace `logging` with centralized `loguru`-based logger across all modules. 6 months ago
  • __init__.py
    539 Bytes
    Refactor to replace `EchoCancellingAudioCapture` with `AudioCapture` across the codebase for improved clarity and flexibility 6 months ago