Running on CPU Upgrade Featured 3.08k The Smol Training Playbook π 3.08k The secrets to building world-class LLMs
luhua/chinese_pretrain_mrc_roberta_wwm_ext_large Question Answering β’ Updated Jun 12, 2021 β’ 330 β’ 86
nvidia/OpenMath-Nemotron-14B-Kaggle Text Generation β’ 15B β’ Updated May 29, 2025 β’ 280 β’ β’ 20
Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer β’ Updated Feb 21, 2025 β’ 110k β’ 1.09k β’ 736
Running 595 Scaling test-time compute π 595 Run advanced search strategies to boost LLM problem solving
Running on CPU Upgrade Featured 1.3k Open ASR Leaderboard π 1.3k Explore speech model benchmarks and request new evaluations
unsloth/Llama-3.3-70B-Instruct-bnb-4bit Text Generation β’ 71B β’ Updated Nov 25, 2025 β’ 10.5k β’ 52