DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data Paper • 2604.19859 • Published 17 days ago • 51
LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction Paper • 2604.19550 • Published 17 days ago • 4
LRAT Collection Official resources for LRAT, including trajectory-trained dense retrievers and the LRAT training dataset for agentic search. • 4 items • Updated 30 days ago • 4
AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents Paper • 2603.14465 • Published Mar 15 • 23
MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning Paper • 2603.03379 • Published Mar 3 • 32
DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents Paper • 2602.07035 • Published Feb 3 • 30
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26
Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text Paper • 2601.10355 • Published Jan 15 • 39
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs Paper • 2601.11000 • Published Jan 16 • 27
MatchTIR Collection The official datasets and model checkpoints of MatchTIR. • 6 items • Updated Jan 16 • 3
MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching Paper • 2601.10712 • Published Jan 15 • 24
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published Oct 16, 2025 • 34
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System Paper • 2509.18091 • Published Sep 22, 2025 • 34