Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Salesforce
/
GTA1-32B
like
6
Follow
Salesforce
2.08k
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
image-to-text
VLM
Computer-Use-Agent
OS-Agent
GUI
Grounding
conversational
text-generation-inference
arxiv:
2507.05791
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
GTA1-32B
Commit History
Update README.md
00b702f
verified
HelloKKMe
commited on
Oct 3, 2025
Update README.md
4593370
verified
HelloKKMe
commited on
Oct 3, 2025
Update README.md
d6a31b2
verified
HelloKKMe
commited on
Oct 3, 2025
Update README.md
e7b4192
verified
HelloKKMe
commited on
Oct 2, 2025
Update README.md
757ea39
verified
HelloKKMe
commited on
Oct 1, 2025
Update README.md
24ef646
verified
HelloKKMe
commited on
Oct 1, 2025
Update README.md
8c83a4c
verified
HelloKKMe
commited on
Oct 1, 2025
Update README.md
975aa67
verified
HelloKKMe
commited on
Sep 25, 2025
Update README.md
b4e6da3
verified
HelloKKMe
commited on
Sep 25, 2025
Update README.md
610cd6f
verified
HelloKKMe
commited on
Sep 25, 2025
Update README.md
665ae8c
verified
HelloKKMe
commited on
Sep 25, 2025
Upload folder using huggingface_hub
8e4436e
verified
HelloKKMe
commited on
Sep 25, 2025
initial commit
e23e1dc
verified
HelloKKMe
commited on
Sep 25, 2025