nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 4 days ago • 103k • 283
Running Featured 68 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 68 Who needs 1T parameters? Olympiad proofs with a 4B model