bingxuan li

bx6d

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

updated a dataset 14 days ago

bx6d/tailor-bench

published a dataset 14 days ago

bx6d/tailor-bench

View all activity

Organizations

upvoted a paper 1 day ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 4 days ago • 85

updated a dataset 14 days ago

bx6d/tailor-bench

Preview • Updated 14 days ago • 27

published a dataset 14 days ago

bx6d/tailor-bench

Preview • Updated 14 days ago • 27

upvoted a paper 20 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 21 days ago • 43

upvoted a paper 27 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published about 1 month ago • 21

upvoted 2 papers about 1 month ago

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 223

Useful Memories Become Faulty When Continuously Updated by LLMs

Paper • 2605.12978 • Published May 13 • 19

upvoted a paper about 2 months ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 23

upvoted a paper 2 months ago

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Paper • 2601.11957 • Published Jan 28 • 3

updated a dataset 3 months ago

bx6d/EchoFoley-6k

Viewer • Updated Mar 26 • 8.62k • 16

published a dataset 3 months ago

bx6d/EchoFoley-6k

Viewer • Updated Mar 26 • 8.62k • 16

upvoted a paper 7 months ago

MedSAM3: Delving into Segment Anything with Medical Concepts

Paper • 2511.19046 • Published Nov 24, 2025 • 55

upvoted a paper 9 months ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published Sep 29, 2025 • 12

upvoted a paper about 1 year ago

Contrastive Visual Data Augmentation

Paper • 2502.17709 • Published Feb 24, 2025 • 2

published a dataset over 1 year ago

bx6d/planet

Updated Feb 17, 2025 • 1

bingxuan li

AI & ML interests

Recent Activity

Organizations

bx6d's activity