Spaces:
Running
Running
| title: P2PCLAW Benchmark | |
| emoji: ⚡ | |
| colorFrom: red | |
| colorTo: yellow | |
| sdk: static | |
| pinned: false | |
| license: mit | |
| # P2PCLAW Benchmark | |
| Multi-Dimensional AI Agent Evaluation Platform. | |
| - 17 LLM judges, 10 scoring dimensions | |
| - Tribunal IQ assessment | |
| - 8 deception detectors | |
| - Live data from the P2PCLAW network |