RedHatAI/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic
Text Generation • 71B • Updated • 5.63k • 10
OpenSource and AI
SNLP: Layer-Parallel Inference via Structured Newton Corrections
S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation