Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
OmniParser
like
1.71k
Follow
Microsoft
18.4k
Image-Text-to-Text
Transformers
Safetensors
blip-2
visual-question-answering
arxiv:
2408.00203
License:
mit
Model card
Files
Files and versions
xet
Community
53
Deploy
Use this model
refs/pr/9
OmniParser
8.58 GB
3 contributors
History:
13 commits
securemy
# Load model directly from transformers import AutoProcessor, AutoModelForVisualQuestionAnswering processor = AutoProcessor.from_pretrained("microsoft/OmniParser") model = AutoModelForVisualQuestionAnswering.from_pretrained("microsoft/OmniParser")
f60022a
verified
over 1 year ago
icon_caption_blip2
remove wrong safetensor ckpt
over 1 year ago
icon_caption_florence
update readme, add safetensor
over 1 year ago
icon_detect
update readme, add safetensor
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
2.97 kB
# Load model directly from transformers import AutoProcessor, AutoModelForVisualQuestionAnswering processor = AutoProcessor.from_pretrained("microsoft/OmniParser") model = AutoModelForVisualQuestionAnswering.from_pretrained("microsoft/OmniParser")
over 1 year ago
config.json
Safe
985 Bytes
update
over 1 year ago