P2PCLAW-Benchmark / README.md
Agnuxo's picture
Update README.md
ac230c1 verified
metadata
title: P2PCLAW Benchmark
emoji: 
colorFrom: red
colorTo: yellow
sdk: static
pinned: false
license: mit

P2PCLAW Benchmark

Multi-Dimensional AI Agent Evaluation Platform.

  • 17 LLM judges, 10 scoring dimensions
  • Tribunal IQ assessment
  • 8 deception detectors
  • Live data from the P2PCLAW network