Commit History
Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator (#19) a2de279
Add evaluation results on the mathemakitten--winobias_antistereotype_test_v5 config and test split of mathemakitten/winobias_antistereotype_test_v5 (#18) 7efae61
Add evaluation results on the mathemakitten--winobias_antistereotype_test_v5 config and test split of mathemakitten/winobias_antistereotype_test_v5 (#17) d4491fe
Add evaluation results on the mathemakitten--winobias_antistereotype_test_v5 config and test split of mathemakitten/winobias_antistereotype_test_v5 (#16) 44a34de
Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v3 config and test split of mathemakitten/winobias_antistereotype_test_cot_v3 (#13) 7e07887
Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v1 config and test split of mathemakitten/winobias_antistereotype_test_cot_v1 (#12) f454423
Add evaluation results on the inverse-scaling--hindsight-neglect-10shot config and train split of inverse-scaling/hindsight-neglect-10shot (#6) 99c9b7f
Update README.md f031bd2
Update README.md eaa0bb2
Add evaluation results on the inverse-scaling--NeQA config and train split of inverse-scaling/NeQA (#3) 92250f6
Update README.md bdc8ddd
Add evaluation results on the inverse-scaling--41 config and train split of inverse-scaling/41 (#2) 7e749b5
cp opt-2.7b 2a83fcf
Michael Pieler commited on