Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
build-small-hackathon 's Collections
Agenda Parser
The Deal
Paul's Agent Eval
Pakistan Notice Helper
Small Games
Hackathon Advisor
The Mind of Tashi
Jawbreaker
Room 360
Job Searcher
Lolaby

Paul's Agent Eval

updated about 18 hours ago

Evaluate AI agents at Session, Trace, and Span levels โ€” inspired by Amazon Bedrock AgentCore Evaluations

Upvote
-

  • Running on Zero
    Agents
    1

    ai agent evaluation pipeline

    ๐Ÿงช
    1

    Evaluate AI agents at Session, Trace & Span levels


  • build-small-hackathon/agent-eval-golden-dataset

    Updated 5 days ago โ€ข 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs