Running 3.9k The Ultra-Scale Playbook 🌌 3.9k The ultimate guide to training LLM on large GPU Clusters
Apriel-H1: Towards Efficient Enterprise Reasoning Models Paper • 2511.02651 • Published Nov 4, 2025
DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone Paper • 2511.15927 • Published Nov 19, 2025
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published Mar 13 • 149
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models ServiceNow-AI • Nov 19, 2025 • 34
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models ServiceNow-AI • Nov 19, 2025 • 34
Challenging Common Assumptions about Catastrophic Forgetting Paper • 2207.04543 • Published Jul 10, 2022
Towards Modular LLMs by Building and Reusing a Library of LoRAs Paper • 2405.11157 • Published May 18, 2024 • 31
ServiceNow-AI/Apriel-H1-15b-Thinker-SFT Text Generation • 16B • Updated Nov 3, 2025 • 31 • 29