Aletheia Datasets - a Aletheia-Bench Collection

Aletheia-Bench 's Collections

updated Jan 8

The datasets used in the Aletheia paper

Aletheia-Bench/Aletheia-Train

Viewer • Updated Jan 8 • 50k • 58

Note The training dataset used for the GRPO and RAFT models
Aletheia-Bench/Aletheia-DPO

Viewer • Updated Jan 8 • 50k • 38

Note A companion dataset to Aletheia-Train containing preference pairs for contrastive learning
Aletheia-Bench/Aletheia-Heldout

Viewer • Updated Jan 8 • 33.3k • 21

Note A completely in-distribution test set for our models
Aletheia-Bench/Aletheia-Strong

Viewer • Updated Jan 8 • 57.3k • 46

Note An out-of-distribution test set with codes generated by stronger models
Aletheia-Bench/Aletheia-Hard

Viewer • Updated Jan 8 • 18k • 92

Note An out-of-distribution test set with more difficult code comparisons
Aletheia-Bench/Aletheia-Adv

Viewer • Updated Jan 8 • 18k • 45

Note An out-of-distribution test set modified to exploit LLM biases and test verifier robustness