8 29

Sunhao Dai

KID-22

KID-22

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning

authored a paper 10 days ago

OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

upvoted a paper 10 days ago

OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

View all activity

Organizations

upvoted a paper 3 days ago

SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning

Paper • 2606.22873 • Published 4 days ago • 8

upvoted a paper 10 days ago

OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation

Paper • 2606.16838 • Published 11 days ago • 18

upvoted a collection 2 months ago

DR-Venus

Collection

5 items • Updated 11 days ago • 17

upvoted 2 papers 2 months ago

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

Paper • 2604.19859 • Published Apr 21 • 54

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

Paper • 2604.19550 • Published Apr 21 • 5

upvoted 2 papers 3 months ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published Apr 7 • 69

Learning to Retrieve from Agent Trajectories

Paper • 2604.04949 • Published Mar 30 • 72

upvoted a collection 3 months ago

LRAT

Collection

Official resources for LRAT, including trajectory-trained dense retrievers and the LRAT training dataset for agentic search. • 4 items • Updated Apr 8 • 5

upvoted a paper 3 months ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Paper • 2603.14465 • Published Mar 15 • 23

upvoted 4 papers 4 months ago

upvoted 3 papers 5 months ago

GISA: A Benchmark for General Information-Seeking Assistant

Paper • 2602.08543 • Published Feb 9 • 26

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Paper • 2601.10355 • Published Jan 15 • 39

When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs

Paper • 2601.11000 • Published Jan 16 • 27

upvoted a collection 5 months ago

MatchTIR

Collection

The official datasets and model checkpoints of MatchTIR. • 6 items • Updated Jan 16 • 3

upvoted a paper 5 months ago

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching

Paper • 2601.10712 • Published Jan 15 • 24

upvoted a paper 6 months ago

RecGPT-V2 Technical Report

Paper • 2512.14503 • Published Dec 16, 2025 • 18

upvoted a paper 8 months ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published Oct 16, 2025 • 34

Sunhao Dai

AI & ML interests

Recent Activity

Organizations

KID-22's activity