view article Article 混合专家模型(MoE)详解 +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 87
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • May 29 • 131
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
cardiffnlp/twitter-roberta-base-sentiment-latest Text Classification • Updated Aug 4, 2025 • 3.59M • • 815
sentence-transformers/all-mpnet-base-v2 Sentence Similarity • 0.1B • Updated Aug 19, 2025 • 32.7M • • 1.32k
Running Featured 1.38k FineWeb: decanting the web for the finest text data at scale 🍷 1.38k Explore and download the FineWeb web‑scale text dataset