lsnu
/

VLAarchTestsBench

Model card Files Files and versions

VLAarchTestsBench / docs /ENVIRONMENT_NOTES.md

lsnu's picture

Add files using upload-large-folder tool

a1fc554 verified about 2 months ago

|

history blame contribute delete

2.91 kB

	# Environment Notes

	This benchmark bundle was validated on a Linux GPU machine with:

	- CoppeliaSim v4.1.0 available at `COPPELIASIM_ROOT`
	- `xvfb` and `xauth` installed
	- Qt xcb runtime libraries installed:
	- `libxrender1`
	- `libxkbcommon0`
	- `libxkbcommon-x11-0`
	- `libxcb-icccm4`
	- `libxcb-image0`
	- `libxcb-keysyms1`
	- `libxcb-randr0`
	- `libxcb-render-util0`
	- `libxcb-shape0`
	- `libxcb-shm0`
	- `libxcb-sync1`
	- `libxcb-xfixes0`
	- `libxcb-xinerama0`
	- `libxcb-xkb1`

	The successful RLBench2 oven evaluation was run with:

	```bash
	xvfb-run -a -s "-screen 0 1400x900x24" python \
	online_evaluation_rlbench/evaluate_policy.py \
	--checkpoint models/3dfa_peract2/3dfa_peract2.pth \
	--task bimanual_take_tray_out_of_oven \
	--data_dir data/3dfa/peract2_test \
	--dataset Peract2_3dfront_3dwrist \
	--image_size 256,256 \
	--model_type denoise3d \
	--bimanual true \
	--prediction_len 1 \
	--backbone clip \
	--fps_subsampling_factor 4 \
	--embedding_dim 120 \
	--num_attn_heads 8 \
	--num_vis_instr_attn_layers 3 \
	--num_history 3 \
	--num_shared_attn_layers 4 \
	--relative_action false \
	--rotation_format quat_xyzw \
	--denoise_timesteps 5 \
	--denoise_model rectified_flow
	```

	The `take shoes out of box` validation path was run from the bundled
	PointFlowMatch source on a Blackwell GPU machine after upgrading the RLBench
	environment to Torch `2.11.0+cu128` / torchvision `0.26.0+cu128` /
	torchaudio `2.11.0+cu128`. The bundled PointFlowMatch tree also contains two
	local compatibility fixes required for this workspace:

	- `pfp/envs/rlbench_env.py`
	- adapts PointFlowMatch to the local RLBench camera naming and observation API
	- broadens motion-planning failure recovery to handle simulator-side runtime failures
	- `pfp/policy/fm_policy.py`
	- adds an inference-only fallback when legacy `composer` cannot import on modern Torch
	- loads checkpoints with `weights_only=False` for PyTorch 2.6+

	The shoes evaluation command used here was:

	```bash
	scripts/run_pointflowmatch_take_shoes_out_of_box.sh 10 50
	```

	That wrapper expands to the equivalent raw command:

	```bash
	export PYTHONPATH=third_party/diffusion_policy:third_party/PointFlowMatch:${PYTHONPATH:-}
	export COPPELIASIM_ROOT=/path/to/CoppeliaSim
	export LD_LIBRARY_PATH=${LD_LIBRARY_PATH:-}:$COPPELIASIM_ROOT
	export QT_QPA_PLATFORM_PLUGIN_PATH=$COPPELIASIM_ROOT
	xvfb-run -a -s "-screen 0 1400x900x24" python \
	third_party/PointFlowMatch/scripts/evaluate.py \
	log_wandb=False \
	env_runner.env_config.vis=False \
	env_runner.num_episodes=10 \
	env_runner.max_episode_length=200 \
	policy.ckpt_name=1717447341-indigo-quokka/1717447341-indigo-quokka \
	policy.num_k_infer=50
	```

	Result note for shoes:

	- `reports/pointflowmatch_take_shoes_out_of_box_ep10_k50_gpu/summary.json`
	records a verified non-zero result before a later RLBench/PyRep crash in the
	same longer rollout.