Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
amete7
/
qvla
like
0
Image-Text-to-Text
Transformers
Safetensors
English
molmo
text-generation
multimodal
olmo
pixmo
conversational
custom_code
arxiv:
2409.17146
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
qvla
Commit History
fixed initialization
dd4b88e
Atharva Mete
commited on
Jan 11, 2025
vla added but giving nans in loss
57b4d23
Atharva Mete
commited on
Jan 10, 2025
original molmo
303e3cf
Atharva Mete
commited on
Jan 7, 2025
initial commit
848f2b3
verified
amete7
commited on
Jan 7, 2025