File size: 671 Bytes
a2a7f3e
5645b8a
a2a7f3e
a45462c
a2a7f3e
 
5645b8a
a2a7f3e
5645b8a
bee6e5a
5645b8a
a2a7f3e
5645b8a
a2a7f3e
 
a45462c
a2a7f3e
a45462c
a2a7f3e
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
# Curious-VLA-dev

  This repository hosts the development release assets for Curious-VLA, including:

  - SFT training data
  - SFT-stage model weights

  For full documentation, environment setup, evaluation, SFT training, and GRPO training, please refer to the main project:

  - GitHub: [github.com/Mashiroln/Curious-VLA](https://github.com/Mashiroln/curious_vla)

  ## Recommended Usage

  - baseline data + Qwen2.5-VL
  - golden data + Qwen3-VL

  ## Notes

  - The SFT-stage model can be used as the starting point for GRPO training in the main Curious-VLA project.
  - For RL training data, released token filters, and end-to-end scripts, see the main repository.