Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory Paper • 2602.15313 • Published 2 days ago • 2
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 10 days ago • 40
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published Dec 3, 2025 • 154
Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training Paper • 2510.08008 • Published Oct 9, 2025 • 6
TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression Paper • 2506.02678 • Published Jun 3, 2025 • 5
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published Jan 23, 2025 • 48