DruryXu commited on
Commit
ada3a92
·
verified ·
1 Parent(s): 4ca5e3c

update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -8,6 +8,10 @@ pipeline_tag: image-text-to-text
8
  ---
9
  # TBAC-VLR1-3B-preview
10
 
 
 
 
 
11
  ## Performance
12
  | Model | **Average** | **MathVista**| **MathVision** | **MathVerse** | **DynaMath** | **WeMath**| **LogicVista** |
13
  | :-------------------: | :---------: | :-----------:| :------------: | :-----------: | :-----------: | :-------: | :----------: |
 
8
  ---
9
  # TBAC-VLR1-3B-preview
10
 
11
+ ## Overview
12
+ This is a multimodal language model fine-tuned by Tencent PCG Basic Algorithm Center. Based on Qwen2.5-VL-3B-Instruct, TBAC-VLR1-3B-preview uses Group Relative Policy Optimization
13
+ (GRPO) to enhance multimodal reasoning ability, achieving state-of-the-art results on several multimodal reasoning benchmarks among models of the same size.
14
+
15
  ## Performance
16
  | Model | **Average** | **MathVista**| **MathVision** | **MathVerse** | **DynaMath** | **WeMath**| **LogicVista** |
17
  | :-------------------: | :---------: | :-----------:| :------------: | :-----------: | :-----------: | :-------: | :----------: |