Running Featured 18 ColBERT Tool Selection 🧭 18 Send a request, the retriever pre-selects the right tools
Running Featured 88 Distilling 100B+ Models 40x Faster with TRL 📝 88 TRL distillation for 100B+ teachers, 40x faster
Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 74 Who needs 1T parameters? Olympiad proofs with a 4B model