MindZero: Learning Online Mental Reasoning With Zero Annotations Paper • 2606.00240 • Published 6 days ago • 2
MindZero: Learning Online Mental Reasoning With Zero Annotations Paper • 2606.00240 • Published 6 days ago • 2
MARQUIS: A Three-Stage Pipeline for Video Retrieval-Augmented Generation Paper • 2605.17640 • Published 18 days ago
Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction Paper • 2511.17908 • Published Jan 19
The First Drop of Ink: Nonlinear Impact of Misleading Information in Long-Context Reasoning Paper • 2605.10828 • Published 24 days ago • 2
ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions Paper • 2605.20087 • Published 16 days ago • 18
ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions Paper • 2605.20087 • Published 16 days ago • 18
ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging Paper • 2605.12419 • Published 23 days ago • 1
ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging Paper • 2605.12419 • Published 23 days ago • 1
What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models Paper • 2506.06485 • Published Jun 6, 2025 • 5
What do Language Models Learn and When? The Implicit Curriculum Hypothesis Paper • 2604.08510 • Published Apr 9 • 4
What do Language Models Learn and When? The Implicit Curriculum Hypothesis Paper • 2604.08510 • Published Apr 9 • 4
Jailbreak Distillation: Renewable Safety Benchmarking Paper • 2505.22037 • Published May 28, 2025 • 1
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety Paper • 2510.08240 • Published Oct 9, 2025 • 41
Beyond Reasoning Gains: Mitigating General Capabilities Forgetting in Large Reasoning Models Paper • 2510.21978 • Published Oct 24, 2025 • 16
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation Paper • 2603.18886 • Published Mar 19 • 6