Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks Paper • 2604.02795 • Published 5 days ago • 3
SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale Paper • 2603.22455 • Published 15 days ago • 2
ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents Paper • 2604.01664 • Published 6 days ago • 8 • 1
ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents Paper • 2604.01664 • Published 6 days ago • 8
ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding Paper • 2508.21496 • Published Aug 29, 2025 • 55