ShubhamK32
/

act_so101_declutter

+---
+license: apache-2.0
+base_model: lerobot/act
+tags:
+  - lerobot
+  - act
+  - robotics
+  - manipulation
+  - real-robot
+  - so101
+  - visuomotor
+datasets:
+  - ShubhamK32/so101_declutter_v1
+pipeline_tag: robotics
+---
+# ACT — SO-101 Space Decluttering
+ACT (Action Chunking Transformer) policy trained on the [SO-101 Space Decluttering Dataset v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1) for pick-and-place decluttering tasks on a 6-DoF SO-101 robotic arm. Trained using [LeRobot](https://github.com/huggingface/lerobot).
+## Training Details
+- **Policy:** ACT (Action Chunking Transformer)
+- **Steps:** 100,000
+- **Robot:** SO-101 6-DoF leader-follower
+- **Cameras:** Dual-view — fixed top-view + wrist-mounted egocentric
+- **Framework:** LeRobot
+## Dataset
+Trained on [ShubhamK32/so101_declutter_v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1) — a multi-view teleoperation dataset with spatial distractors injected to prevent visual shortcut learning.
+## Usage
+```python
+from lerobot.policies.act.modeling_act import ACTPolicy
+policy = ACTPolicy.from_pretrained("ShubhamK32/act_so101_declutter")
+```
+## Camera Views
+- `observation.images.topview` — Fixed overhead. Better for unoccluded pick-place tasks.
+- `observation.images.wristview` — Egocentric wrist-mounted. Better for overlapping and cluttered scenes.
+## Related
+- Dataset: [ShubhamK32/so101_declutter_v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1)
+- SmolVLA checkpoint: [ShubhamK32/smolvla_so101_declutter](https://huggingface.co/ShubhamK32/smolvla_so101_declutter)