Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
VoiceDialogue
like
2
Follow
MoYoYoTech
24
Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
f7b034a
VoiceDialogue
/
assets
13.1 GB
3 contributors
History:
7 commits
liumaolin
Replace `ffmpeg`-based audio loading with `soundfile` and `librosa`
b6d76bc
7 months ago
audio
Refactor assets files
7 months ago
libraries
Refactor assets files
7 months ago
models
Remove deprecated `whisper/ggml-large-v3-turbo-encoder` model files and metadata
7 months ago
www
Replace `ffmpeg`-based audio loading with `soundfile` and `librosa`
7 months ago