Test-Time Strategies for More Efficient and Accurate Agentic RAG Paper • 2603.12396 • Published 7 days ago • 2
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games Paper • 2603.09022 • Published 10 days ago • 21 • 2
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use Paper • 2603.08262 • Published 10 days ago • 32 • 2
M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM Paper • 2603.16844 • Published 2 days ago • 9 • 2
V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising Paper • 2603.16792 • Published 2 days ago • 3 • 2
I Know What I Don't Know: Latent Posterior Factor Models for Multi-Evidence Probabilistic Reasoning Paper • 2603.15670 • Published 6 days ago • 1 • 2
HistoAtlas: A Pan-Cancer Morphology Atlas Linking Histomics to Molecular Programs and Clinical Outcomes Paper • 2603.16587 • Published 2 days ago • 2
Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context Paper • 2603.15653 • Published 13 days ago • 4 • 2
SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation Paper • 2603.16864 • Published 2 days ago • 14 • 2
ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning Paper • 2603.16060 • Published 3 days ago • 2
Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning Paper • 2603.16189 • Published 3 days ago • 9 • 2
ECG-Reasoning-Benchmark: A Benchmark for Evaluating Clinical Reasoning Capabilities in ECG Interpretation Paper • 2603.14326 • Published 4 days ago • 1 • 1
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper • 2603.13398 • Published 8 days ago • 135 • 4
VAREX: A Benchmark for Multi-Modal Structured Extraction from Documents Paper • 2603.15118 • Published 3 days ago • 2