Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
VoiceDialogue
like
2
Follow
MoYoYoTech
24
Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
ef0d09e
VoiceDialogue
13.3 GB
3 contributors
History:
19 commits
liumaolin
Refactor TTS architecture: implement runtime interface, TTS manager, universal registry, and factory pattern to support multiple engines.
ef0d09e
7 months ago
models
Update FunASR punc quantized model.
7 months ago
resources
First commit.
7 months ago
src
Refactor TTS architecture: implement runtime interface, TTS manager, universal registry, and factory pattern to support multiple engines.
7 months ago
third_party
Update TTS inference to validate audio duration using soundfile.
7 months ago
.gitattributes
1.88 kB
Update .gitattributes to track GGUF files with git-lfs
7 months ago
.gitignore
5.03 kB
First commit.
7 months ago
README.md
13.4 kB
Integrate FunASR service.
7 months ago
requirements.txt
391 Bytes
Update README for enhanced installation and usage guidance.
7 months ago