Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ZKong
's Collections
flux2
LTX2
qwen-image-edit
PyWheels
pose
dataset
Segment
hunyuan-video
Z-Image
tts
ocr
VL
qwen image
upscale
vae
wan2.2
qwen
sound
flux-kontext
image-process
prompt
面部AI
encoder
video
translate翻译
motionCapture
flux
3D
image
audio
audio
updated
Jul 16, 2025
Upvote
-
google-t5/t5-base
Translation
•
0.2B
•
Updated
Feb 14, 2024
•
2.46M
•
•
761
stabilityai/stable-audio-open-1.0
Text-to-Audio
•
Updated
Jun 19, 2025
•
24k
•
1.4k
Kijai/MMAudio_safetensors
Updated
Dec 11, 2024
•
67
nvidia/bigvgan_v2_44khz_128band_512x
Audio-to-Audio
•
Updated
Sep 5, 2024
•
528k
•
67
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
Apr 10, 2025
•
2.24M
•
•
5.63k
mistralai/Voxtral-Mini-3B-2507
5B
•
Updated
Jul 28, 2025
•
458k
•
609
mistralai/Voxtral-Small-24B-2507
Audio-Text-to-Text
•
24B
•
Updated
Dec 20, 2025
•
67.8k
•
444
Upvote
-
Share collection
View history
Collection guide
Browse collections