MixSD: Mixed Contextual Self-Distillation for Knowledge Injection Paper • 2605.16865 • Published 26 days ago • 8
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 7 days ago • 57 • 5
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills Paper • 2606.07412 • Published 6 days ago • 12 • 3
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills Paper • 2606.07412 • Published 6 days ago • 12
SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 7 days ago • 47
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 7 days ago • 57 • 5
SWE-Explore: Benchmarking How Coding Agents Explore Repositories Paper • 2606.07297 • Published 6 days ago • 106
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 7 days ago • 57
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Paper • 2606.03458 • Published 9 days ago • 59
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 18 days ago • 35
Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration? Paper • 2606.01247 • Published 11 days ago • 30
X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding Paper • 2606.02482 • Published 10 days ago • 35
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 14 days ago • 145
Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues Paper • 2606.02754 • Published 10 days ago • 13
Running on Zero Agents 1 Cosmos3-Super-Text2Image (NVFP4) 🌌 1 NVIDIA Cosmos3-Super 64B text-to-image, NVFP4 quantization