17 12 7

Jiarui Liu

Jerry999

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

Jerry999/ARS

published a dataset 2 days ago

Jerry999/ARS

upvoted a paper 7 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

View all activity

Organizations

updated a dataset 1 day ago

Jerry999/ARS

Updated about 23 hours ago • 50

published a dataset 2 days ago

Jerry999/ARS

Updated about 23 hours ago • 50

upvoted a paper 7 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

Paper • 2606.15345 • Published 12 days ago • 16

authored 17 papers 12 days ago

Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale

Paper • 2409.15637 • Published Sep 24, 2024

Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba

Paper • 2406.12754 • Published Jun 18, 2024

Chumor 2.0: Towards Benchmarking Chinese Humor Understanding

Paper • 2412.17729 • Published Dec 23, 2024

Towards Global AI Inclusivity: A Large-Scale Multilingual Terminology Dataset (GIST)

Paper • 2412.18367 • Published Dec 24, 2024

CORE: Measuring Multi-Agent LLM Interaction Quality under Game-Theoretic Pressures

Paper • 2508.11915 • Published Aug 16, 2025

Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design

Paper • 2508.17573 • Published Aug 25, 2025 • 1

Social World Models

Paper • 2509.00559 • Published Aug 30, 2025 • 1

Toward Honest Language Models for Deductive Reasoning

Paper • 2511.09222 • Published Nov 12, 2025

Taming Object Hallucinations with Verified Atomic Confidence Estimation

Paper • 2511.09228 • Published Nov 12, 2025

Mind the Sim2Real Gap in User Simulation for Agentic Tasks

Paper • 2603.11245 • Published Mar 11

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

Paper • 2604.12002 • Published Apr 13 • 12

MixSD: Mixed Contextual Self-Distillation for Knowledge Injection

Paper • 2605.16865 • Published May 16 • 9

Reinforcing Human Behavior Simulation via Verbal Feedback

Paper • 2605.20506 • Published May 19

PaperMentor: A Human-Centered Multi-Agent Writing Tutor for AI Research Papers on Overleaf

Paper • 2606.08857 • Published 18 days ago • 2

Jiarui Liu

AI & ML interests

Recent Activity

Organizations

Jerry999's activity