TRIAGE: Dialectical Reasoning for Explainable Risk Prediction on Irregularly Sampled Medical Time Series with LLMs Paper • 2606.09030 • Published 18 days ago • 30
Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models Paper • 2606.16281 • Published 11 days ago • 34
Argument Reconstruction as Supervision for Critical Thinking in LLMs Paper • 2603.17432 • Published Mar 18 • 3
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 82
CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models Paper • 2605.08735 • Published May 9 • 71
ReviewScore: Misinformed Peer Review Detection with Large Language Models Paper • 2509.21679 • Published Sep 25, 2025 • 64 • 4
Argument Reconstruction as Supervision for Critical Thinking in LLMs Paper • 2603.17432 • Published Mar 18 • 3
CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays Paper • 2602.23276 • Published Feb 26 • 16
view article Article Argunauts Update: Learning Formal Argument Analysis with RLVF and HIRPO ggbetz • Dec 2, 2025 • 2
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published Jan 12 • 33