Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MoYoYoTech
/
VoiceDialogue

Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
Model card Files Files and versions
xet
Community
1
VoiceDialogue
13.3 GB
  • 3 contributors
History: 21 commits
liumaolin
Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
5284873 7 months ago
  • models
    Update FunASR punc quantized model. 7 months ago
  • resources
    First commit. 7 months ago
  • src
    Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency. 7 months ago
  • third_party
    Update TTS inference to validate audio duration using soundfile. 7 months ago
  • .gitattributes
    1.88 kB
    Update .gitattributes to track GGUF files with git-lfs 7 months ago
  • .gitignore
    5.03 kB
    First commit. 7 months ago
  • README.md
    13.4 kB
    Integrate FunASR service. 7 months ago
  • requirements.txt
    391 Bytes
    Update README for enhanced installation and usage guidance. 7 months ago