Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 1 day ago • 25
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility Paper • 2601.17027 • Published 11 days ago • 36
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 6 days ago • 132
TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers Paper • 2601.14133 • Published 8 days ago • 56
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 7 days ago • 71
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper • 2601.15876 • Published 6 days ago • 87
The AI Hippocampus: How Far are We From Human Memory? Paper • 2601.09113 • Published 14 days ago • 5
MAXS: Meta-Adaptive Exploration with LLM Agents Paper • 2601.09259 • Published 14 days ago • 94
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 16 days ago • 112
KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions Paper • 2601.04745 • Published 20 days ago • 56
EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs Paper • 2601.06786 • Published 17 days ago • 6
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published 17 days ago • 77
ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing Paper • 2601.03467 • Published 21 days ago • 7
EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering for Enhanced Alignment and Reasoning Paper • 2601.03471 • Published 21 days ago • 7
MDAgent2: Large Language Model for Code Generation and Knowledge Q&A in Molecular Dynamics Paper • 2601.02075 • Published 23 days ago • 8