DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 454
view article Article Optimizing GLM4-MoE for Production: 65% Faster TTFT with SGLang novita • Jan 22 • 10