view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding nvidia • Mar 19 • 47
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 505