QED Nano - a lm-provers Collection

lm-provers 's Collections

QED Nano

updated Mar 2

Artifacts for the QED Nano release

Running

Featured

79

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

79

Who needs 1T parameters? Olympiad proofs with a 4B model
lm-provers/QED-Nano

Text Generation • 4B • Updated Mar 23 • 102 • • 88

Note A compact 4B model that can write Olympiad-level proofs through extended reasoning
lm-provers/QED-Nano-SFT

Text Generation • 4B • Updated Mar 9 • 34 • • 6

Note A variant of Qwen3-4B-Thinking-2507, fine-tuned on reasoning traces from DeepSeek-Math-V2
lm-provers/FineProofs-SFT

Viewer • Updated Feb 14 • 12.1k • 249 • 43

Note The dataset used to train QED-Nano-SFT
lm-provers/FineProofs-RL

Viewer • Updated Feb 14 • 5.23k • 147 • 7

Note The dataset used to train QED-Nano
lm-provers/IMOProofBench

Viewer • Updated Nov 5, 2025 • 60 • 1.31k • 2

Note IMOProofBench, a benchmark for mathematical theorem proving by Google we used in our evaluation.
lm-provers/ProofBench

Viewer • Updated Jan 9 • 290 • 47 • 3

Note Benchmark constructed by converting the original ProofBench benchmark to one for mathematical theorem proving.
lm-provers/matharena-gradingbench

Viewer • Updated Dec 4, 2025 • 438 • 27 • 2

Note A collection of all human annotations in MathArena. Used to decide on our RL grader.
lm-provers/olympiads-proof-graderbench

Viewer • Updated Dec 17, 2025 • 480 • 17 • 2

Note An in-distribution benchmark for grading proofs constructed from our training data. Used to decide on our RL grader.