Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -7,6 +7,8 @@ tags:
|
|
| 7 |
- lerobot
|
| 8 |
- libero
|
| 9 |
- manipulation
|
|
|
|
|
|
|
| 10 |
---
|
| 11 |
|
| 12 |
# SmolVLA Fine-tuned on LIBERO-Spatial
|
|
@@ -14,6 +16,14 @@ tags:
|
|
| 14 |
This is a fine-tuned version of [lerobot/smolvla_base](https://huggingface.co/lerobot/smolvla_base)
|
| 15 |
trained on the LIBERO-Spatial benchmark using the LeRobot framework.
|
| 16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
## Model Details
|
| 18 |
|
| 19 |
- **Base model:** lerobot/smolvla_base
|
|
@@ -39,9 +49,7 @@ trained on the LIBERO-Spatial benchmark using the LeRobot framework.
|
|
| 39 |
| task_9 | 30% |
|
| 40 |
| **Overall** | **36%** |
|
| 41 |
|
| 42 |
-
## Training
|
| 43 |
-
|
| 44 |
-
Fine-tuned using the LeRobot training pipeline:
|
| 45 |
```bash
|
| 46 |
lerobot-train \
|
| 47 |
--policy.type=smolvla \
|
|
@@ -52,6 +60,20 @@ lerobot-train \
|
|
| 52 |
--seed=42
|
| 53 |
```
|
| 54 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 55 |
## Framework
|
| 56 |
|
| 57 |
- [LeRobot](https://github.com/huggingface/lerobot)
|
|
|
|
| 7 |
- lerobot
|
| 8 |
- libero
|
| 9 |
- manipulation
|
| 10 |
+
datasets:
|
| 11 |
+
- HuggingFaceVLA/libero
|
| 12 |
---
|
| 13 |
|
| 14 |
# SmolVLA Fine-tuned on LIBERO-Spatial
|
|
|
|
| 16 |
This is a fine-tuned version of [lerobot/smolvla_base](https://huggingface.co/lerobot/smolvla_base)
|
| 17 |
trained on the LIBERO-Spatial benchmark using the LeRobot framework.
|
| 18 |
|
| 19 |
+
## Demo Video
|
| 20 |
+
|
| 21 |
+
Task 8 success episode (70% success rate on this task):
|
| 22 |
+
|
| 23 |
+
<video controls>
|
| 24 |
+
<source src="https://huggingface.co/Beeface/smolvla-libero-spatial/resolve/main/videos/task_8_success.mp4" type="video/mp4">
|
| 25 |
+
</video>
|
| 26 |
+
|
| 27 |
## Model Details
|
| 28 |
|
| 29 |
- **Base model:** lerobot/smolvla_base
|
|
|
|
| 49 |
| task_9 | 30% |
|
| 50 |
| **Overall** | **36%** |
|
| 51 |
|
| 52 |
+
## Training Command
|
|
|
|
|
|
|
| 53 |
```bash
|
| 54 |
lerobot-train \
|
| 55 |
--policy.type=smolvla \
|
|
|
|
| 60 |
--seed=42
|
| 61 |
```
|
| 62 |
|
| 63 |
+
## Ablation Study — Training Duration
|
| 64 |
+
|
| 65 |
+
We evaluated checkpoints at multiple steps to understand convergence:
|
| 66 |
+
|
| 67 |
+
| Training Steps | Success Rate |
|
| 68 |
+
|---------------|-------------|
|
| 69 |
+
| 2,000 | 2% |
|
| 70 |
+
| 6,000 | 17% |
|
| 71 |
+
| 10,000 | 31% |
|
| 72 |
+
| 20,000 | 36% |
|
| 73 |
+
|
| 74 |
+
Performance improves consistently but with diminishing returns, suggesting
|
| 75 |
+
convergence begins around 10K steps on LIBERO-Spatial.
|
| 76 |
+
|
| 77 |
## Framework
|
| 78 |
|
| 79 |
- [LeRobot](https://github.com/huggingface/lerobot)
|