Chain of Evidence: Pixel-Level Visual Attribution for Iterative Retrieval-Augmented Generation Paper • 2605.01284 • Published 7 days ago • 2
How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum Paper • 2604.25907 • Published 11 days ago • 3
A Benchmark for Interactive World Models with a Unified Action Generation Framework Paper • 2605.03941 • Published 4 days ago • 4
Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO Paper • 2604.27488 • Published 9 days ago • 5
SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion Paper • 2605.01466 • Published 7 days ago • 5
ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue Paper • 2605.01371 • Published 7 days ago • 5
TCDA: Thread-Constrained Discourse-Aware Modeling for Conversational Sentiment Quadruple Analysis Paper • 2605.01717 • Published 6 days ago • 5
StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing Paper • 2605.02904 • Published Apr 5 • 6
Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces Paper • 2605.02801 • Published 5 days ago • 6
Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies Paper • 2605.03596 • Published 4 days ago • 6
PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination Paper • 2605.03571 • Published 4 days ago • 6
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning Paper • 2605.02913 • Published Apr 8 • 7
SVGS: Enhancing Gaussian Splatting Using Primitives with Spatially Varying Colors Paper • 2411.18966 • Published 5 days ago • 8
SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment Paper • 2605.04012 • Published 4 days ago • 10
StableI2I: Spotting Unintended Changes in Image-to-Image Transition Paper • 2605.04453 • Published 3 days ago • 10
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness Paper • 2605.02396 • Published 5 days ago • 19
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL Paper • 2604.28123 • Published 8 days ago • 42
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories Paper • 2605.04036 • Published 4 days ago • 61