view article Article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models +3 imargulis, ofirzaf, sguskin, guybd, pcuenq • Sep 29, 2025 • 25
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk • Apr 29, 2025 • 44
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
view article Article Build AI on premise with Dell Enterprise Hub +2 jeffboudier, philschmid, balaatdell, ianr007 • May 21, 2024 • 27
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP +1 MatrixYao, kding1, IlyasMoutawwakil • Dec 17, 2024 • 7
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 888
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 449
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference mfuntowicz, hlarcher • Jan 16, 2025 • 76