DFlash: Block Diffusion for Flash Speculative Decoding Paper โข 2602.06036 โข Published Feb 5 โข 83
ToolGym: an Open-world Tool-using Environment for Scalable Agent Testing and Data Curation Paper โข 2601.06328 โข Published Jan 9 โข 1
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper โข 2510.18121 โข Published Oct 20, 2025 โข 124