Running 116 Unfolding Robotics: Open-Source Shirt Folding from Data to Deployment π€ 116 Explore the open-source guide to robot shirt folding
Running on CPU Upgrade 262 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 262 Visualize syntheticβdata experiments as an interactive bookshelf
Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems π 74 Who needs 1T parameters? Olympiad proofs with a 4B model
Running 80 Bringing paper to life: A modern template for scientific writing π 80 Explore an interactive galaxy visualization of scientific article
Running 3.9k The Ultra-Scale Playbook π 3.9k The ultimate guide to training LLM on large GPU Clusters
Running 94 Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks π 94 Evaluate multilingual models using FineTasks
Running 134 TxT360: Trillion Extracted Text π 134 Explore the TxT360 LLM preβtraining dataset online