Yanhao
YanhaoLi
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 14 hours ago
Kimi K2.5: Visual Agentic Intelligence
upvoted
a
paper
12 days ago
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
upvoted
a
paper
about 2 months ago
Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering
Organizations
None yet