Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
VoiceDialogue
like
2
Follow
MoYoYoTech
27
Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
d29b312
VoiceDialogue
15.9 GB
Ctrl+K
Ctrl+K
3 contributors
History:
57 commits
Xin Zhang
www
d29b312
11 months ago
assets
www
11 months ago
src
Handle `UnboundLocalError` in punctuation model lookup: add exception handling to ensure stability during transcription.
11 months ago
third_party
Refactor imports in `TextPreprocessor.py` and `inference_webui.py`: switch to explicit relative imports for `LangSegment` to improve clarity and maintainability.
11 months ago
.gitattributes
2.17 kB
www
11 months ago
.gitignore
5.03 kB
First commit.
11 months ago
README.md
13.2 kB
Update README.md: clarify usage details, add dynamic speaker management, and refine documentation for consistency and completeness.
11 months ago
requirements.txt
458 Bytes
Update dependencies: add FunASR, FunASR-ONNX, FastAPI, and Uvicorn to requirements.txt
11 months ago