view article Article Getting More from Your Test-Time Compute Budget with Portfolio Beam Search 17 days ago • 8
view article Article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models +3 Sep 29, 2025 • 24
view article Article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models +3 Sep 29, 2025 • 24
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding +9 Jan 30, 2024 • 9
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding +9 Jan 30, 2024 • 9