The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure Paper • 2605.29087 • Published 7 days ago • 1
Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG Paper • 2605.29084 • Published 7 days ago • 1
Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG Paper • 2605.29084 • Published 7 days ago • 1
The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure Paper • 2605.29087 • Published 7 days ago • 1
CONF-KV: Confidence-Aware KV Cache Eviction with Mixed-Precision Storage for Long-Horizon LLM Paper • 2605.24786 • Published 10 days ago • 6
PANDO: Efficient Multimodal AI Agents via Online Skill Distillation Paper • 2605.24785 • Published 8 days ago • 8
CONF-KV: Confidence-Aware KV Cache Eviction with Mixed-Precision Storage for Long-Horizon LLM Paper • 2605.24786 • Published 10 days ago • 6
PANDO: Efficient Multimodal AI Agents via Online Skill Distillation Paper • 2605.24785 • Published 8 days ago • 8
When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models Paper • 2603.21460 • Published Mar 23 • 6
When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models Paper • 2603.21460 • Published Mar 23 • 6
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published Mar 30 • 13
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published Mar 30 • 13
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published Feb 13 • 62