FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 22 days ago • 65
Autodata: An agentic data scientist to create high quality synthetic data Paper • 2606.25996 • Published 6 days ago • 16
FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation Paper • 2606.24876 • Published 7 days ago • 22
Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings Paper • 2605.22391 • Published May 21 • 41
Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems Paper • 2605.26302 • Published May 25 • 33
Compiling Agentic Workflows into LLM Weights: Near-Frontier Quality at Two Orders of Magnitude Less Cost Paper • 2605.22502 • Published May 21 • 1
OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization Paper • 2605.17757 • Published May 18 • 66
Negligible in Size, Significant in Effect: On Scale Vectors in Large Language Models Paper • 2605.26895 • Published May 26 • 22
Less is More: Early Stopping Rollout for On-Policy Distillation Paper • 2605.27028 • Published May 26 • 15
Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling Paper • 2605.27030 • Published May 26 • 32
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? Paper • 2605.06527 • Published May 7 • 47
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation Paper • 2605.11739 • Published May 13 • 60
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection Paper • 2605.16865 • Published May 16 • 10