File size: 1,607 Bytes
4656358
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
license: apache-2.0
base_model: lerobot/act
tags:
  - lerobot
  - act
  - robotics
  - manipulation
  - real-robot
  - so101
  - visuomotor
datasets:
  - ShubhamK32/so101_declutter_v1
pipeline_tag: robotics
---

# ACT — SO-101 Space Decluttering

ACT (Action Chunking Transformer) policy trained on the [SO-101 Space Decluttering Dataset v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1) for pick-and-place decluttering tasks on a 6-DoF SO-101 robotic arm. Trained using [LeRobot](https://github.com/huggingface/lerobot).

## Training Details

- **Policy:** ACT (Action Chunking Transformer)
- **Steps:** 100,000
- **Robot:** SO-101 6-DoF leader-follower
- **Cameras:** Dual-view — fixed top-view + wrist-mounted egocentric
- **Framework:** LeRobot

## Dataset

Trained on [ShubhamK32/so101_declutter_v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1) — a multi-view teleoperation dataset with spatial distractors injected to prevent visual shortcut learning.

## Usage
```python
from lerobot.policies.act.modeling_act import ACTPolicy

policy = ACTPolicy.from_pretrained("ShubhamK32/act_so101_declutter")
```

## Camera Views

- `observation.images.topview` — Fixed overhead. Better for unoccluded pick-place tasks.
- `observation.images.wristview` — Egocentric wrist-mounted. Better for overlapping and cluttered scenes.

## Related

- Dataset: [ShubhamK32/so101_declutter_v1](https://huggingface.co/datasets/ShubhamK32/so101_declutter_v1)
- SmolVLA checkpoint: [ShubhamK32/smolvla_so101_declutter](https://huggingface.co/ShubhamK32/smolvla_so101_declutter)