view article Article We Pitted the Cheapest TPU Against an NVIDIA L4. Here's What 6 Experiments Revealed. lujangusface • 26 days ago • 1
view article Article 1.37x Faster on Alibaba's 80B Code Model: EAGLE3 for Qwen3-Coder-Next lujangusface • 28 days ago
view article Article 1.7x Faster on a 218B Model: EAGLE3 Speculative Decoding for GLM-4.7 lujangusface • 29 days ago • 1
view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 lujangusface • Apr 9 • 3
view article Article Google Released Gemma-4 Four Days Ago. We Already Made It 1.72× Faster. lujangusface • Apr 7 • 2
view article Article Google Released Gemma-4 Four Days Ago. We Already Made It 1.72× Faster. lujangusface • Apr 7 • 2