Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus Paper • 2606.15345 • Published 12 days ago • 16
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale Paper • 2409.15637 • Published Sep 24, 2024
Implicit Personalization in Language Models: A Systematic Study Paper • 2405.14808 • Published May 23, 2024
Automatic Generation of Model and Data Cards: A Step Towards Responsible AI Paper • 2405.06258 • Published May 10, 2024
BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data Paper • 2410.16491 • Published Oct 21, 2024 • 2
Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba Paper • 2406.12754 • Published Jun 18, 2024
Chumor 2.0: Towards Benchmarking Chinese Humor Understanding Paper • 2412.17729 • Published Dec 23, 2024
Towards Global AI Inclusivity: A Large-Scale Multilingual Terminology Dataset (GIST) Paper • 2412.18367 • Published Dec 24, 2024
CORE: Measuring Multi-Agent LLM Interaction Quality under Game-Theoretic Pressures Paper • 2508.11915 • Published Aug 16, 2025
Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design Paper • 2508.17573 • Published Aug 25, 2025 • 1
Taming Object Hallucinations with Verified Atomic Confidence Estimation Paper • 2511.09228 • Published Nov 12, 2025
Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision Paper • 2604.12002 • Published Apr 13 • 12
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection Paper • 2605.16865 • Published May 16 • 9
PaperMentor: A Human-Centered Multi-Agent Writing Tutor for AI Research Papers on Overleaf Paper • 2606.08857 • Published 18 days ago • 2