Spaces:
Sleeping
Sleeping
Tasks
Difficulty Levels
| Task | Steps | Threshold | Pass Constraints |
|---|---|---|---|
| Easy | 200 | 0.70 | detection ≥ 0.35 and fp_complement ≥ 0.65 |
| Medium | 500 | 0.50 | detection ≥ 0.35 and fp_complement ≥ 0.60 |
| Hard | 1000 | 0.45 | detection ≥ 0.35 and fp_complement ≥ 0.55 |
Why Constraints Exist
Weighted scores alone can be gamed by degenerate policies:
allow_allinflates availability/efficiency.block_allinflates detection.
The pass constraints ensure any passing policy must satisfy both:
- meaningful threat detection,
- acceptable benign-traffic handling.
Task scoring logic is implemented in server/graders.py.