Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
VoiceDialogue
like
2
Follow
MoYoYoTech
24
Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
a3adfd5
VoiceDialogue
13.4 GB
3 contributors
History:
44 commits
liumaolin
Update README.md: revise feature descriptions, add new Web API service section, update supported TTS models and characters, and include enhanced installation and usage instructions.
a3adfd5
8 months ago
models
Add Kokoro TTS support: integrate new TTS model, configuration, and runtime components for enhanced multilingual voice synthesis.
8 months ago
resources
First commit.
8 months ago
src
Refactor `audio_generator/manager.py`: streamline imports, remove redundant modules in `register_all_tts`, and adjust dynamic import spec for improved maintainability.
8 months ago
third_party
Update TTS inference to validate audio duration using soundfile.
8 months ago
.gitattributes
1.88 kB
Update .gitattributes to track GGUF files with git-lfs
8 months ago
.gitignore
5.03 kB
First commit.
8 months ago
README.md
12.7 kB
Update README.md: revise feature descriptions, add new Web API service section, update supported TTS models and characters, and include enhanced installation and usage instructions.
8 months ago
requirements.txt
458 Bytes
Update dependencies: add FunASR, FunASR-ONNX, FastAPI, and Uvicorn to requirements.txt
8 months ago