Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Orfeas Menis Mastromichalakis's picture
4 1

Orfeas Menis Mastromichalakis

menorf
·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
upvoted a paper 2 days ago
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
new activity 16 days ago
harborframework/parity-experiments:aime
View all activity

Organizations

Bainbridge's profile picture

authored a paper 1 day ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published 8 days ago • 22
upvoted a paper 2 days ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published 8 days ago • 22
New activity in harborframework/parity-experiments 16 days ago

aime

#52 opened 16 days ago by
menorf

aime

#51 opened 16 days ago by
menorf

Add AIME adapter validation results

#50 opened 16 days ago by
menorf

Add AIME adapter validation results

#49 opened 16 days ago by
menorf
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs