Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published 6 days ago • 27
view article Article vLLM V0 to V1: Correctness Before Corrections in RL ServiceNow-AI • 21 days ago • 10
G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published 16 days ago • 17
MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning Paper • 2605.07850 • Published 19 days ago • 18
ClawGym: A Scalable Framework for Building Effective Claw Agents Paper • 2604.26904 • Published 28 days ago • 50
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published 27 days ago • 42
Better Models, Faster Training: Sigmoid Attention for single-cell Foundation Models Paper • 2604.27124 • Published 28 days ago • 11
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills Paper • 2604.24026 • Published about 1 month ago • 21
T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning Paper • 2605.02178 • Published 23 days ago • 10
AcademiClaw: When Students Set Challenges for AI Agents Paper • 2605.02661 • Published 23 days ago • 16
Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 25 days ago • 24
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 24 days ago • 164
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 23 days ago • 341
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness Paper • 2605.02396 • Published 23 days ago • 24