FastKernels: Benchmarking GPU Kernel Generation in Production Paper • 2605.23215 • Published 7 days ago • 5
SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications Paper • 2411.04975 • Published Nov 7, 2024 • 1