Security · PyTorch · Clinical Checking...
Task:
Episode:
Step:0
Reward:0.0000
Done:
📥 Observation
Press Reset to load the first observation...
🏆 Reward
No reward yet
⚡ Build Action
INFODebug panel ready. Select a task and press Reset to start.
🔑 API Configuration
📊 Run History
No runs yet. Configure a model above and run.
ℹ️ Tips

Groq — Fast, free tier, use llama-3.3-70b-versatile

OpenRouter — Many models, free tier has rate limits

HuggingFace — Use your HF token with router.huggingface.co/v1

⚠️ Free tier models may hit rate limits on 9 tasks

📊

Run a benchmark to see results here. Configure your API key and model on the left, then click Run.

Benchmark logs will appear here...