Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
paoloski97
's Collections
3D
Image-Generation
Embedding
Med
Mesh
Point tracking
Materials
Multimodal+ImageGen
Dataset
Traduttori
Detection
Depth
Video
Stable Cascade Models
Wuerstchen
Image Captioning
Stable DIffusion 3
HunyuanDiT
Audio
LLM
AuraFlow
Flux
Text_to_Audio
Repository
Multimodal
UltraPixel
Forecasting
OCR
llamafile
Audio
updated
Nov 19, 2025
Upvote
-
CAMB-AI/MARS5-TTS
Text-to-Speech
•
Updated
Jul 5, 2024
•
55
•
481
suno/bark
Text-to-Speech
•
Updated
Oct 4, 2023
•
18.5k
•
1.52k
fishaudio/fish-speech-1.4
Text-to-Speech
•
Updated
Nov 5, 2024
•
1k
•
457
nyrahealth/CrisperWhisper
Automatic Speech Recognition
•
2B
•
Updated
Apr 7
•
88.7k
•
331
fishaudio/fish-speech-1.5
Text-to-Speech
•
Updated
Mar 25, 2025
•
6.43k
•
743
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
Apr 10, 2025
•
9.7M
•
•
6.16k
NexaAI/OmniAudio-2.6B
Audio-Text-to-Text
•
3B
•
Updated
Dec 13, 2024
•
1.07k
•
289
m-a-p/YuE-s1-7B-anneal-en-cot
Text Generation
•
6B
•
Updated
Mar 12, 2025
•
8.48k
•
448
maya-research/maya1
Text-to-Speech
•
Updated
Nov 12, 2025
•
5.8k
•
882
Upvote
-
Share collection
View history
Collection guide
Browse collections