DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published Feb 5 • 83
ToolGym: an Open-world Tool-using Environment for Scalable Agent Testing and Data Curation Paper • 2601.06328 • Published Jan 9 • 1
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published Oct 20, 2025 • 124