Commit History

Fix campaign IDs: load dynamically from env instead of hardcoded values
e8094c5

Soham Banerjee commited on

Update README path to server/app.py structure and dependencies
c769b2e

Soham Banerjee commited on

Upgrade Gradio UI theme and styling aesthetics
7c416c2

Soham Banerjee commited on

Fix campaign detection reset logic in UI
e177d10

Soham Banerjee commited on

Add permissive trailing slashes for API endpoints
54de384

Soham Banerjee commited on

Fix Dockerfile CMD to use server.app
7bf9d4b

Soham Banerjee commited on

Merge branch 'main' of https://github.com/oki-dokii/Meta into bruh
ef72aeb

Soham Banerjee commited on

Fix validate-submission issues: pyproject.toml setup, server entrypoint, app path, and uv.lock
29ae803

Soham Banerjee commited on

Merge branch 'bruh'
de331a4

Soham Banerjee commited on

fix: Appended the required score=<score> to [END] stdout logs for OpenEnv compatibility
b377684

Soham Banerjee commited on

Merge branch 'bruh'
a054d58

Soham Banerjee commited on

chore: Strict compliance with OpenAI client AST init
494f108

Soham Banerjee commited on

Merge branch 'bruh'
ed47fce

Soham Banerjee commited on

chore: strict compliance with OpenEnv inference env variables ast-validation
de96010

Soham Banerjee commited on

docs: Add architecture and core loop Mermaid diagrams
4f6ae75

Soham Banerjee commited on

docs: Add architecture and core loop Mermaid diagrams
47c3d7b

Soham Banerjee commited on

Update README.md
58cc6be
unverified

Soham Banerjee commited on

docs: Fix penalties header to remove negative reward reference
6e93689

Soham Banerjee commited on

docs: Update reward constraints copy to 0.0-1.0 in UI and README
e486f66

Soham Banerjee commited on

Fix page unresponsiveness by removing demo.load() events
e18dfa2

Jashandeep Singh Copilot commited on

Fix event handlers by moving functions to module level
783ba73

Jashandeep Singh Copilot commited on

Simplify theme/CSS for Gradio 6.0 and improve performance
a617197

Jashandeep Singh Copilot commited on

Fix Gradio theme/css initialization to make buttons work
bb0d19d

Jashandeep Singh Copilot commited on

Fix Gradio 6.0 deprecation warnings and campaign loading
b35b650

Jashandeep Singh commited on

Add comprehensive multi-provider documentation and tests
7f9923b

Jashandeep Singh commited on

Add multi-provider support: Groq (default), OpenAI, Grok
d36acd9

Jashandeep Singh commited on

Add Groq API support alongside OpenAI and Grok
2a479c8

Jashandeep Singh commited on

Fix README color for HF
39f65af

Jashandeep Singh commited on

Deploy to HF Spaces
ff6af76

Jashandeep Singh Copilot commited on

Finalise OpenEnv submission: Clamp rewards to 0.0-1.0, update Gradio UI, and add Groq pipelines
192db9d

Soham Banerjee commited on

app.py: Fix Gradio 6 deprecations and unused imports
04de9f3

Soham Banerjee commited on

inference.py: Groq default + dynamic scenario loading + campaign/adversarial prompt hints
2421327

Soham Banerjee commited on

README: add judge tip for deterministic reset(campaign_id=...) under campaign section
64f2c91

Soham Banerjee commited on

Restore moderation_benchmark.json from d741d4b (128 scenarios, 100/100 checks)
b2860e4

Soham Banerjee commited on

Merge + 3 fixes: README accurate (128 scen / real baselines), is_adversarial in state, reset(campaign_id) (100/100 checks)
fa17b3c

Soham Banerjee commited on

3 fixes: README accurate, is_adversarial in state, reset(campaign_id) (100/100)
10c3c6e

Soham Banerjee commited on

2b 3a
abf8abc

DayalGupta03 commited on

Appeal mechanic: is_adversarial + env.appeal() 2-turn flow (92/92 checks)
d741d4b

Soham Banerjee commited on

Fill easy GT gaps: full label×action coverage (79/79 checks)
a4c538a

Soham Banerjee commited on

Graduated severity penalty: sev-5→-0.30, sev-4→-0.15, sev-3→-0.05 (66/66)
94717ed

Soham Banerjee commited on

Cross-post campaign mechanic: campaign_id in state, +0.15 escalate-all bonus (61/61 checks)
748cef6

Soham Banerjee commited on

10 ambiguous hard scenarios + full valid_actions test suite (53/53 checks)
941d83d

Soham Banerjee commited on

Close all 4 scoring gaps (+~6 pts)
9bc46b3

Soham Banerjee commited on

v2 docs & validation: README rewrite, openenv.yaml v2.0, validator 47/47
68d61d8

Soham Banerjee commited on

Update inference.py: expand TASKS to all 75 scenarios (25 easy / 20 medium / 30 hard)
2426958

Soham Banerjee commited on

phase 1 and phase 2a
78d0a45

DayalGupta03 commited on

ContentModerationEnv v1.0 — complete OpenEnv benchmark
2a39e79

Soham Banerjee commited on