Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AudioVisual-Caption
/
ASID-Captioner-3B
like
1
Follow
ASID-Caption
2
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_omni
video-captioning
audiovisual
qwen2.5-omni
instruction-tuning
attribute-structured
quality-verified
conversational
arxiv:
2602.13013
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
ASID-Captioner-3B
/
merges.txt
lyhisme
Upload folder using huggingface_hub
393feb7
verified
8 days ago
raw
Copy download link
history
contribute
delete
Safe
1.67 MB
File too large to display, you can
check the raw version
instead.