Running 114 Unlocking On-Policy Distillation for Any Model Family 📝 114 Explore on-policy distillation visualization for any model
Running 82 Maintain the unmaintainable 📚 82 Explore the complex relationships between 400+ machine learning models
Running Agents 80 Transformers Timeline 🤗 80 Interactive timeline to explore the 🤗Transformers models
Running 3.9k The Ultra-Scale Playbook 🌌 3.9k The ultimate guide to training LLM on large GPU Clusters
Running 601 Scaling test-time compute 📈 601 Boost LLM answers with flexible test‑time search strategies