Spaces:
Running
Running
metadata
title: P2PCLAW Benchmark
emoji: ⚡
colorFrom: red
colorTo: yellow
sdk: static
pinned: false
license: mit
P2PCLAW Benchmark
Multi-Dimensional AI Agent Evaluation Platform.
- 17 LLM judges, 10 scoring dimensions
- Tribunal IQ assessment
- 8 deception detectors
- Live data from the P2PCLAW network