Add GSM8K eval result (self-reported, symbolic verifier)

#1
by codelion - opened
No description provided.

Superseded by result committed directly to main (with date field), matching the working .eval_results convention.

codelion changed pull request status to closed

Sign up or log in to comment