Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why Paper • 2606.19602 • Published 12 days ago • 4
Forecasting Downstream Performance of LLMs With Proxy Metrics Paper • 2605.18607 • Published May 18 • 14
MINTEval: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems Paper • 2605.18565 • Published May 19 • 5
Sparse Autoencoders enable Robust and Interpretable Fine-tuning of CLIP models Paper • 2605.15961 • Published May 15 • 10
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization Paper • 2605.13641 • Published May 13 • 51
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196
Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training Paper • 2511.07328 • Published May 4 • 16
EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions Paper • 2602.00095 • Published Apr 30 • 3
Significance and Stability Analysis of Gene-Environment Interaction using RGxEStat Paper • 2604.03337 • Published Apr 3 • 1
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 244