Environment Notes
This benchmark bundle was validated on a Linux GPU machine with:
- CoppeliaSim v4.1.0 available at
COPPELIASIM_ROOT xvfbandxauthinstalled- Qt xcb runtime libraries installed:
libxrender1libxkbcommon0libxkbcommon-x11-0libxcb-icccm4libxcb-image0libxcb-keysyms1libxcb-randr0libxcb-render-util0libxcb-shape0libxcb-shm0libxcb-sync1libxcb-xfixes0libxcb-xinerama0libxcb-xkb1
The successful RLBench2 oven evaluation was run with:
xvfb-run -a -s "-screen 0 1400x900x24" python \
online_evaluation_rlbench/evaluate_policy.py \
--checkpoint models/3dfa_peract2/3dfa_peract2.pth \
--task bimanual_take_tray_out_of_oven \
--data_dir data/3dfa/peract2_test \
--dataset Peract2_3dfront_3dwrist \
--image_size 256,256 \
--model_type denoise3d \
--bimanual true \
--prediction_len 1 \
--backbone clip \
--fps_subsampling_factor 4 \
--embedding_dim 120 \
--num_attn_heads 8 \
--num_vis_instr_attn_layers 3 \
--num_history 3 \
--num_shared_attn_layers 4 \
--relative_action false \
--rotation_format quat_xyzw \
--denoise_timesteps 5 \
--denoise_model rectified_flow
The take shoes out of box validation path was run from the bundled
PointFlowMatch source on a Blackwell GPU machine after upgrading the RLBench
environment to Torch 2.11.0+cu128 / torchvision 0.26.0+cu128 /
torchaudio 2.11.0+cu128. The bundled PointFlowMatch tree also contains two
local compatibility fixes required for this workspace:
pfp/envs/rlbench_env.py- adapts PointFlowMatch to the local RLBench camera naming and observation API
- broadens motion-planning failure recovery to handle simulator-side runtime failures
pfp/policy/fm_policy.py- adds an inference-only fallback when legacy
composercannot import on modern Torch - loads checkpoints with
weights_only=Falsefor PyTorch 2.6+
- adds an inference-only fallback when legacy
The shoes evaluation command used here was:
scripts/run_pointflowmatch_take_shoes_out_of_box.sh 10 50
That wrapper expands to the equivalent raw command:
export PYTHONPATH=third_party/diffusion_policy:third_party/PointFlowMatch:${PYTHONPATH:-}
export COPPELIASIM_ROOT=/path/to/CoppeliaSim
export LD_LIBRARY_PATH=${LD_LIBRARY_PATH:-}:$COPPELIASIM_ROOT
export QT_QPA_PLATFORM_PLUGIN_PATH=$COPPELIASIM_ROOT
xvfb-run -a -s "-screen 0 1400x900x24" python \
third_party/PointFlowMatch/scripts/evaluate.py \
log_wandb=False \
env_runner.env_config.vis=False \
env_runner.num_episodes=10 \
env_runner.max_episode_length=200 \
policy.ckpt_name=1717447341-indigo-quokka/1717447341-indigo-quokka \
policy.num_k_infer=50
Result note for shoes:
reports/pointflowmatch_take_shoes_out_of_box_ep10_k50_gpu/summary.jsonrecords a verified non-zero result before a later RLBench/PyRep crash in the same longer rollout.