Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
seraph9999
's Collections
XXX
Image-to-Video
Text-to-Image
LLM
VLM
Text-to-Video
ASR
Embedding
Multi-Modal
Forecasting
Multi-Modal
updated
Aug 11, 2024
Upvote
-
Qwen/Qwen2-Audio-7B-Instruct
Audio-Text-to-Text
•
8B
•
Updated
Jan 12, 2025
•
550k
•
515
Upvote
-
Share collection
View history
Collection guide
Browse collections