furiosa-ai/Llama-3.3-70B-Instruct
Text Generation • Updated • 572 • 1
None defined yet.
EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts
ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs