CounterFeint / openenv.yaml
QuantumTransformer's picture
Upload folder using huggingface_hub
26bf1c9 verified
Raw
History Blame Contribute Delete
1.62 kB
spec_version: 1
name: counterfeint
description: >
Three-agent adversarial ad-fraud environment. A Fraudster proposes scam
ads into a review queue, an Investigator audits each ad with a six-tool
surface (advertiser_history, landing_page, payment_method, targeting_overlap,
campaign_structure, policy_classifier a Llama Guard 3 / Purple Llama
mock), and an Auditor grades both sides with deterministic Track A
(reasoning quality) + Track B (plausibility) rubrics. Designed for the
Multi-Agent Interactions and Self-Improvement hackathon themes.
type: space
sdk: docker
runtime: fastapi
app: server.app:app
port: 8000
# --- Multi-agent / OpenEnv metadata (additive, optional) ------------------
roles:
- name: fraudster
description: Proposes/modifies/skips ad creatives to slip fraud past the Investigator.
- name: investigator
description: Reviews queued ads, calls six investigation tools, issues verdicts.
- name: auditor
description: Oversight layer. Grades Investigator reasoning (Track A) + Fraudster plausibility (Track B). Not trained in Phase 3; retained for evaluation.
themes:
- multi-agent-interactions
- self-improving-agents
- long-horizon-planning
tags:
- multi-agent
- adversarial
- ads
- trust-and-safety
- self-play
- rubric-based
- llama-guard
# Endpoints exposed for external judges / evaluators (see server/public_api.py).
public_endpoints:
- method: GET
path: /api/v1/match/{match_id}
- method: POST
path: /api/v1/tools/policy_classifier
- method: GET
path: /api/v1/metrics