Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
SWE-bench-Live
's Collections
Cross-platform-bench
SWE-bench-Live
Cross-platform-bench
updated
4 days ago
The benchmarks evaluate LM agent on SWE/Computer-use tasks across different operating systems.
Upvote
-
Sort: Collection
SWE-bench-Live/Windows
Viewer
•
Updated
4 days ago
•
61
•
88
SWE-bench-Live/OS-bench
Viewer
•
Updated
about 2 hours ago
•
87
•
121
Upvote
-
Sort: Collection
Share collection
View history
Collection guide
Browse collections