Running Featured 69 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems π 69 Who needs 1T parameters? Olympiad proofs with a 4B model
view article Article How I contributed a new model to the Transformers library using Codex 5 days ago β’ 39