Add metadata, project page link and improve description

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -1,12 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
 
2
  # VisionReasoner-7B
3
 
 
4
 
5
- Code: https://github.com/dvlab-research/VisionReasoner
 
 
6
 
7
  ## Description
8
 
9
- This is a VisionReasoner-7B model. It introduces a decoupled architecture consisting of a reasoning model and a segmentation model. The reasoning model interprets user intentions, generates explicit reasoning chains, and produces positional prompts, which are subsequently used by the segmentation model to generate pixel-level masks.
10
 
11
  ## Usage
12
 
@@ -17,4 +35,4 @@ import torch
17
  # load model
18
  model = AutoModelForCausalLM.from_pretrained("Ricky06662/VisionReasoner-7B")
19
  tokenizer = AutoTokenizer.from_pretrained("Ricky06662/VisionReasoner-7B")
20
- ```
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - COCO
5
+ - ReasonSeg
6
+ - CountBench
7
+ language:
8
+ - en
9
+ metrics:
10
+ - accuracy
11
+ base_model:
12
+ - Qwen2.5-VL
13
+ pipeline_tag: image-text-to-text
14
+ library_name: transformers
15
+ ---
16
 
17
  # VisionReasoner-7B
18
 
19
+ [Paper](https://huggingface.co/papers/2505.12081)
20
 
21
+ Code: [https://github.com/dvlab-research/VisionReasoner](https://github.com/dvlab-research/VisionReasoner)
22
+
23
+ Project page: [https://github.com/dvlab-research/VisionReasoner](https://github.com/dvlab-research/VisionReasoner)
24
 
25
  ## Description
26
 
27
+ This is a VisionReasoner-7B model. It introduces a decoupled architecture consisting of a reasoning model and a segmentation model. The reasoning model interprets user intentions, generates explicit reasoning chains, and produces positional prompts, which are subsequently used by the segmentation model to generate pixel-level masks.
28
 
29
  ## Usage
30
 
 
35
  # load model
36
  model = AutoModelForCausalLM.from_pretrained("Ricky06662/VisionReasoner-7B")
37
  tokenizer = AutoTokenizer.from_pretrained("Ricky06662/VisionReasoner-7B")
38
+ ```