Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
VoiceDialogue
like
2
Follow
MoYoYoTech
27
Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
87a7384
VoiceDialogue
13.3 GB
Ctrl+K
Ctrl+K
3 contributors
History:
34 commits
liumaolin
Comment out logging statements in `audio_player.py` to disable performance logs and streamline runtime output.
87a7384
11 months ago
models
Update FunASR punc quantized model.
11 months ago
resources
First commit.
11 months ago
src
Comment out logging statements in `audio_player.py` to disable performance logs and streamline runtime output.
11 months ago
third_party
Update TTS inference to validate audio duration using soundfile.
11 months ago
.gitattributes
1.88 kB
Update .gitattributes to track GGUF files with git-lfs
11 months ago
.gitignore
5.03 kB
First commit.
11 months ago
README.md
13.4 kB
Integrate FunASR service.
11 months ago
requirements.txt
458 Bytes
Update dependencies: add FunASR, FunASR-ONNX, FastAPI, and Uvicorn to requirements.txt
11 months ago