deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 427k • • 1.5k
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
oliverguhr/fullstop-punctuation-multilang-large Token Classification • Updated Nov 16, 2023 • 768k • • 176
Running Featured 1.33k FineWeb: decanting the web for the finest text data at scale 🍷 1.33k Explore and download the FineWeb web‑text dataset
Build error Agents 1.19k ControlNet V1.1 📉 1.19k Generate edited images using edge, pose, and other guides