QED-Nano: Teaching a Tiny Model to Prove Hard Theorems
📝
74
Who needs 1T parameters? Olympiad proofs with a 4B model
Who needs 1T parameters? Olympiad proofs with a 4B model
Explore the complex relationships between 400+ machine learning models
A new open-source dataset for training VLMs
Evaluate multilingual models using FineTasks
Visualize synthetic‑data experiments as an interactive bookshelf
The ultimate guide to training LLM on large GPU Clusters
Generate speech from text using a reference voice
Boost LLM answers with flexible test‑time search strategies
Explore and download the FineWeb web‑scale text dataset