Commit History

Fix task_id to diamond (matching benchmark eval.yaml)
2bc02b7
verified

burtenshaw HF Staff commited on

Add GPQA evaluation result
29b5ff8
verified

burtenshaw HF Staff commited on

Update README.md
8dc4e68
verified

mlabonne commited on

Update README.md
47c527b
verified

mlabonne commited on

Update README.md
c20ca41
verified

mlabonne commited on

Update README.md
43f1e80
verified

mlabonne commited on

Update README.md
286966c
verified

ykhrustalev commited on

Upload README.md with huggingface_hub
4393ab8
verified

ykhrustalev commited on

Update README.md
a4ba0a6
verified

mlabonne commited on

Add files using upload-large-folder tool
205939a
verified

mlabonne commited on

initial commit
f3a7622
verified

mlabonne commited on