paulpacaud/rlbenchfail_train_dataset
Viewer
•
Updated
•
54.5k
•
54
VQA images-pair for fine-grained, multi-class failure detection annotated with multi-step reasoning traces. Features a simulated Franka Emika Panda.