chrisc36 commited on
Commit
d8985a8
·
verified ·
1 Parent(s): df1e3b6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -4
README.md CHANGED
@@ -3,8 +3,7 @@ license: apache-2.0
3
  language:
4
  - en
5
  base_model:
6
- - Qwen/Qwen3-8B
7
- - google/siglip-so400m-patch14-384
8
  pipeline_tag: image-text-to-text
9
  tags:
10
  - multimodal
@@ -14,8 +13,8 @@ tags:
14
  - molmo_point
15
  ---
16
 
17
- # MolmoPoint-Img-8B
18
- MolmoPoint-Img-8B is a fully-open VLM developed by the Allen Institute for AI (Ai2) that is specialized for GUI pointing.
19
  As specialized model, it only supports single image input with instruction-like queries, and will output a single point.
20
  See MolmoPoint-8B for a generalist model.
21
  MolmoPoint-Img-8B points using grounding-tokens instead of text coordinates, see our paper for details.
 
3
  language:
4
  - en
5
  base_model:
6
+ - allenai/MolmoPoint-8B
 
7
  pipeline_tag: image-text-to-text
8
  tags:
9
  - multimodal
 
13
  - molmo_point
14
  ---
15
 
16
+ # MolmoPoint-GUI-8B
17
+ MolmoPoint-GUI-8B is a fully-open VLM developed by the Allen Institute for AI (Ai2) that is specialized for GUI pointing.
18
  As specialized model, it only supports single image input with instruction-like queries, and will output a single point.
19
  See MolmoPoint-8B for a generalist model.
20
  MolmoPoint-Img-8B points using grounding-tokens instead of text coordinates, see our paper for details.