Update README with action space documentation and delta_legacy usage instructions

126fc85 verified 2 months ago

2.88 kB

	---
	license: apache-2.0
	---

	# MIP Checkpoints

	Pre-trained checkpoints for the [MIP (Minimum Iterative Policy)](https://github.com/simchowitzlabpublic/much-ado-fresh) framework.

	## Repository Structure

	```
	robomimic/
	{task}_{env_type}_{obs_type}/
	delta_legacy/ # Original checkpoints (rot6d repr + delta controller)
	abs/ # Absolute action space checkpoints
	delta/ # Delta action space checkpoints (7D, no rot6d)
	rel/ # Relative action space checkpoints
	pusht/ # PushT environment checkpoints
	kitchen/ # Kitchen environment checkpoints
	```

	## Robomimic Action Spaces

	\| Action Space \| Config Suffix \| `abs_action` \| `action_type` \| Dataset \| `act_dim` (single/dual) \|
	\|---\|---\|---\|---\|---\|---\|
	\| delta_legacy \| `_delta_legacy` \| `true` \| `delta` \| `low_dim.hdf5` \| 10 / 20 \|
	\| absolute \| `_abs` \| `true` \| `absolute` \| `low_dim_abs.hdf5` \| 10 / 20 \|
	\| delta \| `_delta` \| `false` \| `delta` \| `low_dim.hdf5` \| 7 / 14 \|
	\| relative \| `_rel` \| `true` \| `relative` \| `low_dim_abs.hdf5` \| 10 / 20 \|

	> Important: The majority of released robomimic checkpoints (under `delta_legacy/`) were trained
	> with the delta_legacy action space. You must use the corresponding `_delta_legacy` task
	> config to evaluate them correctly. Using the default config (which uses absolute actions) will
	> result in 0% success rate due to normalizer and controller mismatches.

	## Quick Start: Evaluating a Checkpoint

	```bash
	# Download and evaluate a delta_legacy checkpoint
	uv run examples/train_robomimic.py \
	mode=eval \
	task=lift_ph_state_delta_legacy \
	network=chiunet \
	optimization.loss_type=mip \
	optimization.model_path="path/to/lift_ph_state_mip_chiunet_256_seed3_success100.pt"
	```

	### Available Task Configs

	Each robomimic task has configs for all four action spaces:

	- `lift_ph_state_delta_legacy`, `lift_ph_state_abs`, `lift_ph_state_delta`, `lift_ph_state_rel`
	- `can_ph_state_delta_legacy`, `can_ph_state_abs`, `can_ph_state_delta`, `can_ph_state_rel`
	- `square_ph_state_delta_legacy`, `square_ph_state_abs`, `square_ph_state_delta`, `square_ph_state_rel`
	- `tool_hang_ph_state_delta_legacy`, `tool_hang_ph_state_abs`, `tool_hang_ph_state_delta`, `tool_hang_ph_state_rel`
	- `transport_ph_state_delta_legacy`, `transport_ph_state_abs`, `transport_ph_state_delta`, `transport_ph_state_rel`

	The `_mh` (multi-human) variants are also available (e.g., `lift_mh_state_delta_legacy`).

	## Checkpoint Naming Convention

	```
	{loss_type}_{network}_{dim}_seed{N}_success{N}.pt
	```

	- loss_type: `mip`, `flow`, `regression`, `psd`, `lsd`, `straight_flow`
	- network: `chiunet`, `chitransformer`, `mlp`, `sudeepdit`, `rnn`
	- dim: embedding dimension (e.g., `256`, `384`, `512`)
	- seed: random seed
	- success: best evaluation success rate (%)