File size: 3,416 Bytes
f19fbdd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d4442f6
f19fbdd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d4442f6
f19fbdd
 
 
 
 
d4442f6
f19fbdd
 
 
 
 
 
 
 
 
 
 
 
 
 
d4442f6
f19fbdd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
---
tags:
  - OpenRAL
  - rskill
  - act
  - lerobot
  - aloha
  - bimanual
  - manipulation
license: mit
language:
  - en
---

# rskill-act-aloha

> **OpenRAL rSkill** β€” ACT (Action Chunking Transformer) finetuned on
> the ALOHA bimanual cube-transfer task, packaged for `OpenRAL`.

This package wraps
[`lerobot/act_aloha_sim_transfer_cube_human`](https://huggingface.co/lerobot/act_aloha_sim_transfer_cube_human)
with a `rskill.yaml` manifest that adds capability checking, license
surfacing, latency budgets, and local registry integration. It does
**not** copy model weights.

## Upstream model

| Field | Value |
| --- | --- |
| Source repo | [`lerobot/act_aloha_sim_transfer_cube_human`](https://huggingface.co/lerobot/act_aloha_sim_transfer_cube_human) |
| Paper | [arxiv:2304.13705](https://arxiv.org/abs/2304.13705) β€” *Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware* (Zhao et al., 2023) |
| License | MIT |
| Parameters | ~52 M (transformer encoder-decoder) |
| Action chunk | 100 |
| Benchmark | ALOHA bimanual cube-transfer (`gym-aloha`) |

> **Note.** The published checkpoint predates lerobot's
> `PolicyProcessorPipeline` migration and ships **without normalisation
> buffers**. See `tests/sim/test_aloha_bimanual_act_aloha.py` for the resulting
> numerical-contract caveats.

## Supported robots

| Robot | Embodiment tag | Status | Notes |
| --- | --- | --- | --- |
| ALOHA bimanual (Trossen) β€” `gym-aloha` MuJoCo | `aloha`, `lerobot` | βœ“ sim | 14-DoF (2 Γ— 7-DoF arms with parallel grippers) |

## Sensors required

| Key | Type | Resolution | Format |
| --- | --- | --- | --- |
| `observation.images.top` | RGB camera | 640 Γ— 480 | `float32` |

ACT for ALOHA cube-transfer ships with a single top-down RGB stream. No
wrist or third-person view.

## Manifest summary

| Field | Value |
| --- | --- |
| `name` | `OpenRAL/rskill-act-aloha` |
| `version` | `0.1.0` |
| `license` | `mit` |
| `role` | `s1` |
| `embodiment_tags` | `aloha`, `lerobot` |
| `runtime` / `quantization.dtype` | `pytorch` / `fp32` |
| `weights_uri` | `hf://lerobot/act_aloha_sim_transfer_cube_human` |
| `latency_budget.per_chunk_ms` | 25 ms (warm; bf16 autocast β‰ˆ 12 ms on RTX 4070 Laptop) |
| `latency_budget.warmup_ms` | 5 000 ms |
| `latency_budget.load_ms` | 10 000 ms |
| `commercial_use_allowed` | `true` |

Full schema: `openral_core.RSkillManifest` β€”
`python/core/src/openral_core/schemas.py`.

## Reproduction

```bash
git clone https://github.com/OpenRAL/openral && cd OpenRAL
just bootstrap && uv sync --all-packages --group sim

# End-to-end via the canonical SimEnvironment config:
just sim-act-aloha
# which runs:
#     openral sim run --config scenes/benchmarks/act_aloha_transfer_cube.yaml --save-video

# Sim test (real gym-aloha MuJoCo with contact dynamics):
uv run pytest tests/sim/test_aloha_bimanual_act_aloha.py -v -m sim
```

## License

This rSkill package (`rskill.yaml`, `README.md`) is **MIT** to match the
upstream weights. Commercial use is allowed
(`commercial_use_allowed: true`).

## See also

- [`robots/aloha_bimanual/README.md`](../../robots/aloha_bimanual/README.md) β€” RobotDescription manifest.
- [`scenes/benchmarks/act_aloha_transfer_cube.yaml`](../../scenes/benchmarks/act_aloha_transfer_cube.yaml) β€” paired SimEnvironment config.
- [`docs/reference/vla_compatibility.md`](../../docs/reference/vla_compatibility.md) β€” VLA Γ— Robot Γ— Sim matrix.