File size: 2,199 Bytes
c5ccfcf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
---
tags:
  - OpenRAL
  - rskill
  - act
  - lerobot
  - aloha
  - bimanual
  - manipulation
  - insertion
license: mit
language:
  - en
---

# rskill-act-aloha-insertion

> **OpenRAL rSkill (custom example)** — ACT (Action Chunking Transformer)
> finetuned on the ALOHA bimanual **peg-insertion** task, packaged for
> `OpenRAL`.

This package wraps
[`lerobot/act_aloha_sim_insertion_human`](https://huggingface.co/lerobot/act_aloha_sim_insertion_human)
with a `rskill.yaml` manifest that adds capability checking, license
surfacing, latency budgets, and local registry integration. It does
**not** copy model weights.

It is the harder sibling of [`rskill-act-aloha`](../act-aloha) (cube
transfer) and demonstrates how a single packaging format covers multiple
task-specific checkpoints from the same paper. The runnable demo lives at
`examples/sim/custom_act_aloha_insertion.yaml` and is wired into the
top-level `just sim-custom` recipe.

## Upstream model

| Field | Value |
| --- | --- |
| Source repo | [`lerobot/act_aloha_sim_insertion_human`](https://huggingface.co/lerobot/act_aloha_sim_insertion_human) |
| Architecture | Action Chunking Transformer (~52M params, chunk=100) |
| Task | gym-aloha `AlohaInsertion-v0` (bimanual peg-in-socket) |
| License | MIT |
| Paper | Zhao et al., 2023 — *Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware* ([arXiv 2304.13705](https://arxiv.org/abs/2304.13705)) |

## Why no `eval/` block?

This skill is shipped as a **custom-example** package, not as a
reproduced benchmark entry. The paper's headline number for sim ALOHA
insertion is markedly lower than the cube-transfer figure (the task is
harder and the upstream protocol uses different camera intrinsics). We
deliberately omit `eval/` rather than copy paper numbers without an
internal reproduction; per CLAUDE.md §6.4 that omission must be
documented — this section is that documentation. Add `eval/aloha_insertion.json`
once a local reproduction lands.

## Run it

```bash
just sim-custom
```

…which is equivalent to:

```bash
MUJOCO_GL=egl uv run --group sim ral sim run \
  --config examples/sim/custom_act_aloha_insertion.yaml \
  --save-video example_videos
```