RoboLab: A High-Fidelity Simulation Benchmark for Analysis of Task Generalist Policies Paper • 2604.09860 • Published 23 days ago • 8
STRIDE Applications Collection Benchmarks, proxy corpora, contamination manifests, and checkpoints for STRIDE data-attribution and benchmark-leakage experiments. • 2 items • Updated 12 days ago