EmbodiedEvalKit

IffYuan 's Collections

updated 18 days ago

A unified evaluation framework that simplifies embodied AI benchmarking with clean interfaces, supporting 25+ benchmarks and diverse model backends.

Upvote

IffYuan/VABench-P

Viewer • Updated Dec 25, 2025 • 300 • 90
IffYuan/vabench-v

Viewer • Updated Jan 9 • 300 • 35
IffYuan/vsi-bench

Viewer • Updated Dec 27, 2025 • 5.13k • 62
IffYuan/PointBench

Viewer • Updated Dec 28, 2025 • 966 • 69
IffYuan/pixmo-points-eval

Viewer • Updated Jan 2 • 330 • 27
FlagEval/ERQA

Viewer • Updated Apr 22, 2025 • 400 • 3.34k • 5
FlagEval/ERQAPlus

Viewer • Updated 4 days ago • 800 • 60 • 1
nyu-visionx/CV-Bench

Viewer • Updated Jul 20, 2025 • 5.28k • 5.93k • 47
FlagEval/EmbSpatial-Bench

Viewer • Updated Apr 21, 2025 • 3.64k • 2.58k • 5
FlagEval/SAT

Viewer • Updated May 6, 2025 • 150 • 29
chanhee-luke/RoboSpatial-Home

Viewer • Updated May 20 • 350 • 591 • 24
IffYuan/RoboVQA

Viewer • Updated Dec 26, 2025 • 1.92k • 41
FlagEval/Where2Place

Viewer • Updated May 29, 2025 • 100 • 246
BAAI/RefSpatial-Bench

Viewer • Updated Oct 23, 2025 • 277 • 3.42k • 17
IffYuan/Part-Affordance-2K

Viewer • Updated Jun 4, 2025 • 2k • 51 • 2
IffYuan/Roborefit

Viewer • Updated Dec 25, 2025 • 2k • 44
Zray26/roboafford-eval

Viewer • Updated Sep 1, 2025 • 338 • 29 • 1
IffYuan/open-eqa

Viewer • Updated Dec 27, 2025 • 1.64k • 46
IffYuan/PIO-Bench

Viewer • Updated Dec 29, 2025 • 500 • 22
BLINK-Benchmark/BLINK

Viewer • Updated Sep 3, 2025 • 3.81k • 12.8k • 48
IffYuan/COSMOS

Viewer • Updated May 11 • 510 • 42
IffYuan/sharerobot_trajectory

Viewer • Updated Feb 7 • 200 • 11
MINT-SJTU/RoboFAC-dataset

Updated Apr 28 • 4.47k • 6
VLABench/vlm_evaluation_v1.0

Viewer • Updated Mar 28, 2025 • 7.1k • 925
IffYuan/PIO-S3

Viewer • Updated Jan 9 • 100 • 20

Upvote

Collection guide
Browse collections