Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
VoiceDialogue
like
2
Follow
MoYoYoTech
23
Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
5284873
VoiceDialogue
13.3 GB
3 contributors
History:
21 commits
liumaolin
Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
5284873
7 months ago
models
Update FunASR punc quantized model.
7 months ago
resources
First commit.
7 months ago
src
Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
7 months ago
third_party
Update TTS inference to validate audio duration using soundfile.
7 months ago
.gitattributes
1.88 kB
Update .gitattributes to track GGUF files with git-lfs
7 months ago
.gitignore
5.03 kB
First commit.
7 months ago
README.md
13.4 kB
Integrate FunASR service.
7 months ago
requirements.txt
391 Bytes
Update README for enhanced installation and usage guidance.
7 months ago