Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dylu's picture
8 13

dylu

ludybupt
Mi6paulino's profile picture sioyang's profile picture tahamajs's profile picture
·
  • ludybupt

AI & ML interests

None yet

Recent Activity

authored a paper about 8 hours ago
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
new activity about 2 months ago
Qwen/Qwen3-Next-80B-A3B-Thinking:Request for SWE-bench-Verified Evaluation Metrics of Qwen3-Next-80B-A3B.
new activity 4 months ago
SWE-bench/SWE-smith:How to get Previous version images(May 8 version in docker hub)
View all activity

Organizations

None yet

authored a paper about 8 hours ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published 10 days ago • 28
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs