P2PCLAW-Benchmark / README.md
Agnuxo's picture
Update README.md
ac230c1 verified
---
title: P2PCLAW Benchmark
emoji:
colorFrom: red
colorTo: yellow
sdk: static
pinned: false
license: mit
---
# P2PCLAW Benchmark
Multi-Dimensional AI Agent Evaluation Platform.
- 17 LLM judges, 10 scoring dimensions
- Tribunal IQ assessment
- 8 deception detectors
- Live data from the P2PCLAW network