lakomchik commited on
Commit
dc88e82
·
verified ·
1 Parent(s): 1f89dd8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -4,7 +4,7 @@ license: apache-2.0
4
  language:
5
  - en
6
  base_model:
7
- - SberRoboticsCenter/GreenVLA-5b-base-stride-1
8
  pipeline_tag: robotics
9
  tags:
10
  - robotics
@@ -17,7 +17,7 @@ tags:
17
  - calvin
18
  - reinforcement-learning
19
  model-index:
20
- - name: GreenVLA-5b-stride-1-R2-calvin
21
  results:
22
  - task:
23
  type: robotics
@@ -30,7 +30,7 @@ model-index:
30
 
31
  <div align="center">
32
 
33
- # GreenVLA-5b-stride-1-R2-calvin
34
 
35
  ### RL-Aligned VLA for CALVIN
36
 
@@ -46,9 +46,9 @@ model-index:
46
 
47
  ## Overview
48
 
49
- **GreenVLA-5b-stride-1-R2-calvin** is the R2 (RL-aligned) checkpoint of the [Green-VLA](https://arxiv.org/abs/2602.00919) family, fine-tuned for the CALVIN benchmark environment.
50
 
51
- Starting from [GreenVLA-5b-base-stride-1](https://huggingface.co/SberRoboticsCenter/GreenVLA-5b-base-stride-1), this model went through both R1 (supervised fine-tuning) and R2 (RL policy alignment) stages on CALVIN data.
52
 
53
  ## Evaluation
54
 
@@ -62,7 +62,7 @@ Evaluated on the **CALVIN** benchmark:
62
 
63
  | | Details |
64
  |---|---|
65
- | **Base checkpoint** | [GreenVLA-5b-base-stride-1](https://huggingface.co/SberRoboticsCenter/GreenVLA-5b-base-stride-1) |
66
  | **Stage** | R2 — RL policy alignment |
67
  | **Method** | Trajectory optimization (SFT + RL on collected trajectories) |
68
  | **Environment** | CALVIN |
@@ -91,7 +91,7 @@ from lerobot.common.utils.torch_observation import (
91
 
92
  # 1. Load policy and transforms.
93
  policy, input_transforms, output_transforms = load_pretrained_policy(
94
- "SberRoboticsCenter/GreenVLA-5b-stride-1-R2-calvin",
95
  data_config_name="calvin",
96
  )
97
  policy.to("cuda").eval()
 
4
  language:
5
  - en
6
  base_model:
7
+ - SberRoboticsCenter/GreenVLA-5b-base-stride-4
8
  pipeline_tag: robotics
9
  tags:
10
  - robotics
 
17
  - calvin
18
  - reinforcement-learning
19
  model-index:
20
+ - name: GreenVLA-5b-stride-4-R2-calvin
21
  results:
22
  - task:
23
  type: robotics
 
30
 
31
  <div align="center">
32
 
33
+ # GreenVLA-5b-stride-4-R2-calvin
34
 
35
  ### RL-Aligned VLA for CALVIN
36
 
 
46
 
47
  ## Overview
48
 
49
+ **GreenVLA-5b-stride-4-R2-calvin** is the R2 (RL-aligned) checkpoint of the [Green-VLA](https://arxiv.org/abs/2602.00919) family, fine-tuned for the CALVIN benchmark environment.
50
 
51
+ Starting from [GreenVLA-5b-base-stride-4](https://huggingface.co/SberRoboticsCenter/GreenVLA-5b-base-stride-4), this model went through both R1 (supervised fine-tuning) and R2 (RL policy alignment) stages on CALVIN data.
52
 
53
  ## Evaluation
54
 
 
62
 
63
  | | Details |
64
  |---|---|
65
+ | **Base checkpoint** | [GreenVLA-5b-base-stride-4](https://huggingface.co/SberRoboticsCenter/GreenVLA-5b-base-stride-4) |
66
  | **Stage** | R2 — RL policy alignment |
67
  | **Method** | Trajectory optimization (SFT + RL on collected trajectories) |
68
  | **Environment** | CALVIN |
 
91
 
92
  # 1. Load policy and transforms.
93
  policy, input_transforms, output_transforms = load_pretrained_policy(
94
+ "SberRoboticsCenter/GreenVLA-5b-stride-4-R2-calvin",
95
  data_config_name="calvin",
96
  )
97
  policy.to("cuda").eval()