view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs Apr 3 โข 8
view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 27 days ago โข 3
cyankiwi/Qwen3-30B-A3B-Instruct-2507-AWQ-4bit Text Generation โข 5B โข Updated about 13 hours ago โข 54.8k โข 31
cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-8bit Text Generation โข 84B โข Updated about 13 hours ago โข 37 โข 5