4 36

Cheng Qian

chengq9

https://qiancheng0.github.io

qiancheng0

AI & ML interests

Agent, Tool Learning

Recent Activity

upvoted a paper 19 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

upvoted a paper 20 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

upvoted a paper 22 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

View all activity

Organizations

upvoted a paper 19 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

Paper • 2606.05445 • Published 22 days ago • 8

upvoted a paper 20 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 21 days ago • 43

upvoted a paper 22 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Paper • 2606.02754 • Published 24 days ago • 13

upvoted a paper 28 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published May 25 • 21

submitted a paper to Daily Papers 28 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published May 25 • 21

updated a dataset about 1 month ago

chengq9/CreativityBench-MM

Viewer • Updated about 1 month ago • 1.2k • 89

published a dataset about 1 month ago

chengq9/CreativityBench-MM

Viewer • Updated about 1 month ago • 1.2k • 89

upvoted 2 papers about 1 month ago

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 223

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

updated a dataset about 2 months ago

chengq9/CreativityBench

Viewer • Updated May 7 • 3.29k • 176 • 2

submitted a paper to Daily Papers about 2 months ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 23

upvoted a paper about 2 months ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 23

upvoted a paper 2 months ago

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Paper • 2601.11957 • Published Jan 28 • 3

upvoted a paper 3 months ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published Apr 7 • 69

published a dataset 3 months ago

chengq9/CreativityBench

Viewer • Updated May 7 • 3.29k • 176 • 2

upvoted a paper 3 months ago

NarrativeTrack: Evaluating Video Language Models Beyond the Frame

Paper • 2601.01095 • Published Jan 3 • 8

upvoted 2 papers 4 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Paper • 2602.21320 • Published Feb 24 • 12

upvoted a collection 5 months ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 12 items • Updated 4 days ago • 112

upvoted a paper 6 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 31

Cheng Qian

AI & ML interests

Recent Activity

Organizations

chengq9's activity