Upload README.md with huggingface_hub
Browse files
README.md
ADDED
|
@@ -0,0 +1,48 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
base_model: lerobot/act
|
| 4 |
+
tags:
|
| 5 |
+
- lerobot
|
| 6 |
+
- act
|
| 7 |
+
- robotics
|
| 8 |
+
- manipulation
|
| 9 |
+
- real-robot
|
| 10 |
+
- so101
|
| 11 |
+
- visuomotor
|
| 12 |
+
datasets:
|
| 13 |
+
- ShubhamK32/so101_declutter_v1
|
| 14 |
+
pipeline_tag: robotics
|
| 15 |
+
---
|
| 16 |
+
|
| 17 |
+
# ACT — SO-101 Space Decluttering
|
| 18 |
+
|
| 19 |
+
ACT (Action Chunking Transformer) policy trained on the [SO-101 Space Decluttering Dataset v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1) for pick-and-place decluttering tasks on a 6-DoF SO-101 robotic arm. Trained using [LeRobot](https://github.com/huggingface/lerobot).
|
| 20 |
+
|
| 21 |
+
## Training Details
|
| 22 |
+
|
| 23 |
+
- **Policy:** ACT (Action Chunking Transformer)
|
| 24 |
+
- **Steps:** 100,000
|
| 25 |
+
- **Robot:** SO-101 6-DoF leader-follower
|
| 26 |
+
- **Cameras:** Dual-view — fixed top-view + wrist-mounted egocentric
|
| 27 |
+
- **Framework:** LeRobot
|
| 28 |
+
|
| 29 |
+
## Dataset
|
| 30 |
+
|
| 31 |
+
Trained on [ShubhamK32/so101_declutter_v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1) — a multi-view teleoperation dataset with spatial distractors injected to prevent visual shortcut learning.
|
| 32 |
+
|
| 33 |
+
## Usage
|
| 34 |
+
```python
|
| 35 |
+
from lerobot.policies.act.modeling_act import ACTPolicy
|
| 36 |
+
|
| 37 |
+
policy = ACTPolicy.from_pretrained("ShubhamK32/act_so101_declutter")
|
| 38 |
+
```
|
| 39 |
+
|
| 40 |
+
## Camera Views
|
| 41 |
+
|
| 42 |
+
- `observation.images.topview` — Fixed overhead. Better for unoccluded pick-place tasks.
|
| 43 |
+
- `observation.images.wristview` — Egocentric wrist-mounted. Better for overlapping and cluttered scenes.
|
| 44 |
+
|
| 45 |
+
## Related
|
| 46 |
+
|
| 47 |
+
- Dataset: [ShubhamK32/so101_declutter_v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1)
|
| 48 |
+
- SmolVLA checkpoint: [ShubhamK32/smolvla_so101_declutter](https://huggingface.co/ShubhamK32/smolvla_so101_declutter)
|