Robotics
Safetensors
tanshh97 commited on
Commit
3ce6162
Β·
verified Β·
1 Parent(s): 91a8aac

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +63 -0
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # πŸ’ͺ RIPT-VLA: Reinforcement Interactive Post-Training for Vision-Language-Action Models
2
+
3
+ **Authors**: Shuhan Tan, Kairan Dou, Yue Zhao, Philipp KrΓ€henbΓΌhl
4
+ **Codebase**: [GitHub – RIPT-VLA](https://github.com/Ariostgx/ript-vla)
5
+ **Website**: [Project Page](https://ariostgx.github.io/ript_vla/)
6
+
7
+ > **RIPT-VLA** enables interactive post-training for any pretrained Vision-Language-Action (VLA) model using only **sparse binary success rewards**.
8
+ > With **K-rollout interaction**, **dynamic sampling**, and **leave-one-out advantage estimation**, RIPT-VLA achieves **state-of-the-art** performance in extremely low-data regimes.
9
+
10
+ ---
11
+
12
+ ## 🧠 Model Summary
13
+
14
+ RIPT-VLA takes a pretrained VLA model (e.g., QueST or OpenVLA-OFT) and improves its performance by fine-tuning it with reinforcement learning based on success/failure signals only β€” no dense rewards or value functions required.
15
+
16
+ Supported models:
17
+ - βœ… QueST (small, efficient)
18
+ - βœ… OpenVLA-OFT (large-scale, high-capacity)
19
+
20
+ ---
21
+
22
+ ## πŸ§ͺ Model Use
23
+
24
+ ### βœ… Intended Use
25
+
26
+ - Research on post-training VLA models via RL
27
+ - Evaluation on LIBERO benchmarks (LIBERO-90, Goal, Object, Spatial, Long)
28
+ - Studying low-data reinforcement learning settings
29
+
30
+ ---
31
+
32
+ ## πŸ“¦ Checkpoints
33
+
34
+ All checkpoints are hosted here in this repository.
35
+
36
+ ### βœ”οΈ QueST Checkpoints
37
+
38
+ | Suite | SFT Checkpoint | RIPT Checkpoint |
39
+ |------------------|----------------|-----------------|
40
+ | LIBERO-90 | βœ… | βœ… |
41
+ | LIBERO-GOAL | βœ… | βœ… |
42
+ | LIBERO-LONG | βœ… | βœ… |
43
+ | LIBERO-OBJECT | βœ… | βœ… |
44
+ | LIBERO-SPATIAL | βœ… | βœ… |
45
+
46
+ Each QueST checkpoint is ~80MB.
47
+
48
+ ### βœ”οΈ OpenVLA-OFT Checkpoints
49
+
50
+ | Suite | SFT Scale Head | RIPT LoRA Adaptor |
51
+ |------------------|----------------|--------------------|
52
+ | LIBERO-GOAL | βœ… | βœ… |
53
+ | LIBERO-LONG | βœ… | βœ… |
54
+ | LIBERO-OBJECT | βœ… | βœ… |
55
+ | LIBERO-SPATIAL | βœ… | βœ… |
56
+
57
+ OpenVLA-OFT scale heads are ~300MB; RIPT LoRA adaptors are ~1GB.
58
+
59
+ ---
60
+
61
+ ## πŸ›  How to Use
62
+
63
+ For usage, see [INSTALL.md](https://github.com/Ariostgx/ript-vla/blob/main/INSTALL.md) in the main GitHub repo.