eval: add eval_results.json, model-index, fix evaluate.py NaN→null 4512ee5 verified RhodWeo commited on 23 days ago
docs: clean benchmark section + proper model-index (mean metrics, no per-dataset wall) 452b4a9 verified RhodWeo commited on 24 days ago
eval: add model-index eval results + benchmark section to README 45969f6 verified RhodWeo commited on 24 days ago