Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
seraph9999
's Collections
XXX
Image-to-Video
Text-to-Image
LLM
VLM
Text-to-Video
ASR
Embedding
Multi-Modal
Forecasting
Multi-Modal
updated
Aug 11, 2024
Upvote
-
Qwen/Qwen2-Audio-7B-Instruct
Audio-Text-to-Text
•
Updated
Jan 12, 2025
•
395k
•
530
Upvote
-
Share collection
View history
Collection guide
Browse collections