view article Article Google Released Gemma-4 Four Days Ago. We Already Made It 1.72× Faster. lujangusface • Apr 7 • 2
view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs lujangusface • Apr 3 • 8