Jeff

JiayuJeff

6 26 5

JiayuJeff

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

Trimming the Long-Tail of Visual World Modeling Evaluation

upvoted a paper 1 day ago

GBC: Gradient-Based Connections for Optimizing Multi-Agent Systems

updated a dataset 2 days ago

JiayuJeff/PlanBench-XL

View all activity

Organizations

None yet

upvoted a paper about 24 hours ago

Trimming the Long-Tail of Visual World Modeling Evaluation

Paper • 2606.24256 • Published 8 days ago • 35

upvoted a paper 1 day ago

GBC: Gradient-Based Connections for Optimizing Multi-Agent Systems

Paper • 2606.28187 • Published 5 days ago • 11

updated a dataset 2 days ago

JiayuJeff/PlanBench-XL

Viewer • Updated 2 days ago • 327 • 118 • 4

authored a paper 6 days ago

BioInsight: Multi-Agent Orchestration for Interactive Biomedical Knowledge Discovery

Paper • 2606.20997 • Published 12 days ago • 3

upvoted 2 papers 7 days ago

BioInsight: Multi-Agent Orchestration for Interactive Biomedical Knowledge Discovery

Paper • 2606.20997 • Published 12 days ago • 3

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 10 days ago • 95

commented a paper 8 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 10 days ago • 95 •

authored a paper 8 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 10 days ago • 95

upvoted a collection 8 days ago

awesome-agentic-benchmarks

Collection

3 items • Updated 8 days ago • 2

upvoted a paper 8 days ago

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

Paper • 2604.04017 • Published Apr 5 • 8

submitted a paper to Daily Papers 8 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 10 days ago • 95

liked a dataset 11 days ago

JiayuJeff/PlanBench-XL

Viewer • Updated 2 days ago • 327 • 118 • 4

published a dataset 11 days ago

JiayuJeff/PlanBench-XL

Viewer • Updated 2 days ago • 327 • 118 • 4

commented a paper 25 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 27 days ago • 44 •

upvoted a paper 25 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

Paper • 2606.05445 • Published 28 days ago • 8

upvoted a paper 26 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 27 days ago • 44

submitted a paper to Daily Papers 26 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 27 days ago • 44

updated a dataset 26 days ago

JiayuJeff/AdaPlanBench

Updated 26 days ago • 129 • 3

liked a dataset 27 days ago

JiayuJeff/AdaPlanBench

Updated 26 days ago • 129 • 3

published a dataset 27 days ago

JiayuJeff/AdaPlanBench

Updated 26 days ago • 129 • 3

Jeff

AI & ML interests

Recent Activity

Organizations

JiayuJeff's activity