Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AudioVisual-Caption
/
ASID-Captioner-3B
like
1
Follow
ASID-Caption
2
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_omni
video-captioning
audiovisual
qwen2.5-omni
instruction-tuning
attribute-structured
quality-verified
conversational
arxiv:
2602.13013
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
lyhisme
commited on
7 days ago
Commit
a5824a3
·
verified
·
1 Parent(s):
0b1dd39
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+0
-1
README.md
CHANGED
Viewed
@@ -1,4 +1,3 @@
1
-
```markdown
2
---
3
language:
4
- en
1
---
2
language:
3
- en