burtenshaw's picture
burtenshaw HF Staff
Fix task_id to match benchmark eval.yaml
7524de5 verified