Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
DeepAttriCap-VLA-3B
like
3
Image-Text-to-Text
Transformers
Safetensors
4 datasets
English
doi:10.57967/hf/6400
qwen2_5_vl
trl
VisualUnderstanding
text-generation-inference
VisionLanguageAttribution
AttributeCaptioning
VLA
conversational
Model card
Files
Files and versions
xet
Community
3
Deploy
Use this model
main
DeepAttriCap-VLA-3B
/
README.md
Commit History
Update README.md
08bf25d
verified
prithivMLmods
commited on
Aug 28, 2025
Update README.md
ab2917c
verified
prithivMLmods
commited on
Aug 28, 2025
Update README.md
a3ccc6a
verified
prithivMLmods
commited on
Aug 28, 2025
Update README.md
545e5b7
verified
prithivMLmods
commited on
Aug 28, 2025
Update README.md
3fa92fc
verified
prithivMLmods
commited on
Aug 28, 2025
Update README.md
d163959
verified
prithivMLmods
commited on
Aug 28, 2025
Update README.md
fe6c2dc
verified
prithivMLmods
commited on
Aug 28, 2025
Update README.md
e09a141
verified
prithivMLmods
commited on
Aug 28, 2025
initial commit
056f41e
verified
prithivMLmods
commited on
Aug 28, 2025