Replace open/closed model distinction with lock emojis in tables
8a3a9eb
openhandsopenhandscommited on
Remove open/closed distinction from graph, use company logos as data points
b6ec318
openhandsopenhandscommited on
Add company logos to graphs and tables, label frontier points with model names
800e404
openhandsopenhandscommited on
Replace total_cost with cost_per_instance (average cost per instance)
b1f3e49
openhandsopenhandscommited on
fix: Column naming and incomplete entries toggle
4ab5f97
openhandsopenhandscommited on
feat: Update leaderboard calculations and add incomplete entries toggle
5998027
openhandsopenhandscommited on
Fix UI score formatting: do not coerce NaN to 0; rely on format_score_column to show 'Not Submitted'.\n\nCo-authored-by: openhands <openhands@all-hands.dev>
c68aa7d
openhandscommited on
Fix data plotting requirements and server port handling; ensure per-benchmark plots use correct agent column.\n\n- Respect HOST/PORT env for local runs\n- Use 'OpenHands Version' in plot requirements\n- Avoid plotting when use_plotly=False\n\nCo-authored-by: openhands <openhands@all-hands.dev>
fb3d0db
openhandscommited on
Remove unused AstaBench category files and update UI to OpenHands categories
6a0d1cb
openhandscommited on
Fix score calculation to match AstaBench methodology and update categories
e734bf6
openhandscommited on
Swap column order and fix duplicate column warnings