Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bezzam
's Collections
VibeVoice
Neural codecs
Omnilingual ASR (1,600+ Languages)
Multimodel audio
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
DiffuserCam Mirflickr
Multimodel audio
updated
24 days ago
Upvote
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition
•
2B
•
Updated
Jan 4, 2024
•
187k
•
935
stepfun-ai/Step-Audio-2-mini
Any-to-Any
•
8B
•
Updated
Sep 5, 2025
•
1.22k
•
241
bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
•
6B
•
Updated
Jul 28, 2025
•
291k
•
647
Upvote
-
Share collection
View history
Collection guide
Browse collections