nielsr HF Staff commited on
Commit
f0afbc9
·
verified ·
1 Parent(s): 4dec8af

Add metadata and project links

Browse files

Hi! I'm Niels from the Hugging Face community science team.

I've opened this PR to improve the discoverability and documentation of this model:
- Added `pipeline_tag: robotics` to categorize the model.
- Added `library_name: transformers` as the model architecture is supported by the library.
- Included a link to the official project website.

Please review and merge if this looks good!

Files changed (1) hide show
  1. README.md +7 -6
README.md CHANGED
@@ -1,19 +1,20 @@
1
  ---
2
- license: cc-by-4.0
 
3
  datasets:
4
  - teetone/RoboReward
5
  language:
6
  - en
7
- base_model:
8
- - Qwen/Qwen3-VL-4B-Instruct
 
9
  ---
10
 
11
  # RoboReward 4B
12
 
13
- **Paper:** [https://arxiv.org/abs/2601.00675](https://arxiv.org/abs/2601.00675)
14
-
15
- RoboReward provides **general-purpose vision-language reward model for robotics**, trained on the [RoboReward dataset](https://huggingface.co/datasets/teetone/RoboReward) with **Qwen-3 VL** to predict **discrete end-of-episode progress rewards** from real-robot rollout videos.
16
 
 
17
 
18
  ## Usage
19
 
 
1
  ---
2
+ base_model:
3
+ - Qwen/Qwen3-VL-4B-Instruct
4
  datasets:
5
  - teetone/RoboReward
6
  language:
7
  - en
8
+ license: cc-by-4.0
9
+ library_name: transformers
10
+ pipeline_tag: robotics
11
  ---
12
 
13
  # RoboReward 4B
14
 
15
+ [**Project Website**](https://crfm.stanford.edu/helm/robo-reward-bench) | **Paper:** [https://arxiv.org/abs/2601.00675](https://arxiv.org/abs/2601.00675)
 
 
16
 
17
+ RoboReward provides **general-purpose vision-language reward models for robotics**, trained on the [RoboReward dataset](https://huggingface.co/datasets/teetone/RoboReward) with **Qwen-3 VL** to predict **discrete end-of-episode progress rewards** from real-robot rollout videos.
18
 
19
  ## Usage
20