Spaces:

jdsb06
/

content-moderation-env

Sleeping

App Files Files Community

content-moderation-env

Commit History

Fix campaign IDs: load dynamically from env instead of hardcoded values

e8094c5

Soham Banerjee commited on Apr 8

Update README path to server/app.py structure and dependencies

c769b2e

Soham Banerjee commited on Apr 8

Upgrade Gradio UI theme and styling aesthetics

7c416c2

Soham Banerjee commited on Apr 8

Fix campaign detection reset logic in UI

e177d10

Soham Banerjee commited on Apr 8

Add permissive trailing slashes for API endpoints

54de384

Soham Banerjee commited on Apr 8

Fix Dockerfile CMD to use server.app

7bf9d4b

Soham Banerjee commited on Apr 8

Merge branch 'main' of https://github.com/oki-dokii/Meta into bruh

ef72aeb

Soham Banerjee commited on Apr 8

Fix validate-submission issues: pyproject.toml setup, server entrypoint, app path, and uv.lock

29ae803

Soham Banerjee commited on Apr 8

Merge branch 'bruh'

de331a4

Soham Banerjee commited on Apr 8

fix: Appended the required score=<score> to [END] stdout logs for OpenEnv compatibility

b377684

Soham Banerjee commited on Apr 8

Merge branch 'bruh'

a054d58

Soham Banerjee commited on Apr 8

chore: Strict compliance with OpenAI client AST init

494f108

Soham Banerjee commited on Apr 8

Merge branch 'bruh'

ed47fce

Soham Banerjee commited on Apr 8

chore: strict compliance with OpenEnv inference env variables ast-validation

de96010

Soham Banerjee commited on Apr 8

docs: Add architecture and core loop Mermaid diagrams

4f6ae75

Soham Banerjee commited on Apr 8

docs: Add architecture and core loop Mermaid diagrams

47c3d7b

Soham Banerjee commited on Apr 8

Update README.md

58cc6be
unverified

Soham Banerjee commited on Apr 8

docs: Fix penalties header to remove negative reward reference

6e93689

Soham Banerjee commited on Apr 8

docs: Update reward constraints copy to 0.0-1.0 in UI and README

e486f66

Soham Banerjee commited on Apr 8

Fix page unresponsiveness by removing demo.load() events

e18dfa2

Jashandeep Singh Copilot commited on Apr 7

Fix event handlers by moving functions to module level

783ba73

Jashandeep Singh Copilot commited on Apr 7

Simplify theme/CSS for Gradio 6.0 and improve performance

a617197

Jashandeep Singh Copilot commited on Apr 7

Fix Gradio theme/css initialization to make buttons work

bb0d19d

Jashandeep Singh Copilot commited on Apr 7

Fix Gradio 6.0 deprecation warnings and campaign loading

b35b650

Jashandeep Singh commited on Apr 7

Add comprehensive multi-provider documentation and tests

7f9923b

Jashandeep Singh commited on Apr 7

Add multi-provider support: Groq (default), OpenAI, Grok

d36acd9

Jashandeep Singh commited on Apr 7

Add Groq API support alongside OpenAI and Grok

2a479c8

Jashandeep Singh commited on Apr 7

Fix README color for HF

39f65af

Jashandeep Singh commited on Apr 7

Deploy to HF Spaces

ff6af76

Jashandeep Singh Copilot commited on Apr 7

Finalise OpenEnv submission: Clamp rewards to 0.0-1.0, update Gradio UI, and add Groq pipelines

192db9d

Soham Banerjee commited on Apr 7

app.py: Fix Gradio 6 deprecations and unused imports

04de9f3

Soham Banerjee commited on Apr 5

inference.py: Groq default + dynamic scenario loading + campaign/adversarial prompt hints

2421327

Soham Banerjee commited on Apr 5

README: add judge tip for deterministic reset(campaign_id=...) under campaign section

64f2c91

Soham Banerjee commited on Apr 5

Restore moderation_benchmark.json from d741d4b (128 scenarios, 100/100 checks)

b2860e4

Soham Banerjee commited on Apr 5

Merge + 3 fixes: README accurate (128 scen / real baselines), is_adversarial in state, reset(campaign_id) (100/100 checks)

fa17b3c

Soham Banerjee commited on Apr 5

3 fixes: README accurate, is_adversarial in state, reset(campaign_id) (100/100)

10c3c6e

Soham Banerjee commited on Apr 5

2b 3a

abf8abc

DayalGupta03 commited on Apr 4

Appeal mechanic: is_adversarial + env.appeal() 2-turn flow (92/92 checks)

d741d4b

Soham Banerjee commited on Apr 4

Fill easy GT gaps: full label×action coverage (79/79 checks)

a4c538a

Soham Banerjee commited on Apr 4

Graduated severity penalty: sev-5→-0.30, sev-4→-0.15, sev-3→-0.05 (66/66)

94717ed

Soham Banerjee commited on Apr 4

Cross-post campaign mechanic: campaign_id in state, +0.15 escalate-all bonus (61/61 checks)

748cef6

Soham Banerjee commited on Apr 4

10 ambiguous hard scenarios + full valid_actions test suite (53/53 checks)

941d83d

Soham Banerjee commited on Apr 4

Close all 4 scoring gaps (+~6 pts)

9bc46b3

Soham Banerjee commited on Apr 4

v2 docs & validation: README rewrite, openenv.yaml v2.0, validator 47/47

68d61d8

Soham Banerjee commited on Apr 4

Update inference.py: expand TASKS to all 75 scenarios (25 easy / 20 medium / 30 hard)

2426958

Soham Banerjee commited on Apr 4

phase 1 and phase 2a

78d0a45

DayalGupta03 commited on Apr 4

ContentModerationEnv v1.0 — complete OpenEnv benchmark

2a39e79

Soham Banerjee commited on Apr 1

Commit History

Fix campaign IDs: load dynamically from env instead of hardcoded values e8094c5

Update README path to server/app.py structure and dependencies c769b2e

Upgrade Gradio UI theme and styling aesthetics 7c416c2

Fix campaign detection reset logic in UI e177d10

Add permissive trailing slashes for API endpoints 54de384

Fix Dockerfile CMD to use server.app 7bf9d4b

Merge branch 'main' of https://github.com/oki-dokii/Meta into bruh ef72aeb

Fix validate-submission issues: pyproject.toml setup, server entrypoint, app path, and uv.lock 29ae803

Merge branch 'bruh' de331a4

fix: Appended the required score=<score> to [END] stdout logs for OpenEnv compatibility b377684

Merge branch 'bruh' a054d58

chore: Strict compliance with OpenAI client AST init 494f108

Merge branch 'bruh' ed47fce

chore: strict compliance with OpenEnv inference env variables ast-validation de96010

docs: Add architecture and core loop Mermaid diagrams 4f6ae75

docs: Add architecture and core loop Mermaid diagrams 47c3d7b

Update README.md 58cc6be unverified

docs: Fix penalties header to remove negative reward reference 6e93689

docs: Update reward constraints copy to 0.0-1.0 in UI and README e486f66

Fix page unresponsiveness by removing demo.load() events e18dfa2

Fix event handlers by moving functions to module level 783ba73

Simplify theme/CSS for Gradio 6.0 and improve performance a617197

Fix Gradio theme/css initialization to make buttons work bb0d19d

Fix Gradio 6.0 deprecation warnings and campaign loading b35b650

Add comprehensive multi-provider documentation and tests 7f9923b

Add multi-provider support: Groq (default), OpenAI, Grok d36acd9

Add Groq API support alongside OpenAI and Grok 2a479c8

Fix README color for HF 39f65af

Deploy to HF Spaces ff6af76

Finalise OpenEnv submission: Clamp rewards to 0.0-1.0, update Gradio UI, and add Groq pipelines 192db9d

app.py: Fix Gradio 6 deprecations and unused imports 04de9f3

inference.py: Groq default + dynamic scenario loading + campaign/adversarial prompt hints 2421327

README: add judge tip for deterministic reset(campaign_id=...) under campaign section 64f2c91

Restore moderation_benchmark.json from d741d4b (128 scenarios, 100/100 checks) b2860e4

Merge + 3 fixes: README accurate (128 scen / real baselines), is_adversarial in state, reset(campaign_id) (100/100 checks) fa17b3c

3 fixes: README accurate, is_adversarial in state, reset(campaign_id) (100/100) 10c3c6e

2b 3a abf8abc

Appeal mechanic: is_adversarial + env.appeal() 2-turn flow (92/92 checks) d741d4b

Fill easy GT gaps: full label×action coverage (79/79 checks) a4c538a

Graduated severity penalty: sev-5→-0.30, sev-4→-0.15, sev-3→-0.05 (66/66) 94717ed

Cross-post campaign mechanic: campaign_id in state, +0.15 escalate-all bonus (61/61 checks) 748cef6

10 ambiguous hard scenarios + full valid_actions test suite (53/53 checks) 941d83d

Close all 4 scoring gaps (+~6 pts) 9bc46b3

v2 docs & validation: README rewrite, openenv.yaml v2.0, validator 47/47 68d61d8

Update inference.py: expand TASKS to all 75 scenarios (25 easy / 20 medium / 30 hard) 2426958

phase 1 and phase 2a 78d0a45

ContentModerationEnv v1.0 — complete OpenEnv benchmark 2a39e79

Fix campaign IDs: load dynamically from env instead of hardcoded values

e8094c5

Update README path to server/app.py structure and dependencies

c769b2e

Upgrade Gradio UI theme and styling aesthetics

7c416c2

Fix campaign detection reset logic in UI

e177d10

Add permissive trailing slashes for API endpoints

54de384

Fix Dockerfile CMD to use server.app

7bf9d4b

Merge branch 'main' of https://github.com/oki-dokii/Meta into bruh

ef72aeb

Fix validate-submission issues: pyproject.toml setup, server entrypoint, app path, and uv.lock

29ae803

Merge branch 'bruh'

de331a4

fix: Appended the required score=<score> to [END] stdout logs for OpenEnv compatibility

b377684

Merge branch 'bruh'

a054d58

chore: Strict compliance with OpenAI client AST init

494f108

Merge branch 'bruh'

ed47fce

chore: strict compliance with OpenEnv inference env variables ast-validation

de96010

docs: Add architecture and core loop Mermaid diagrams

4f6ae75

docs: Add architecture and core loop Mermaid diagrams

47c3d7b

Update README.md

58cc6be
unverified

docs: Fix penalties header to remove negative reward reference

6e93689

docs: Update reward constraints copy to 0.0-1.0 in UI and README

e486f66

Fix page unresponsiveness by removing demo.load() events

e18dfa2

Fix event handlers by moving functions to module level

783ba73

Simplify theme/CSS for Gradio 6.0 and improve performance

a617197

Fix Gradio theme/css initialization to make buttons work

bb0d19d

Fix Gradio 6.0 deprecation warnings and campaign loading

b35b650

Add comprehensive multi-provider documentation and tests

7f9923b

Add multi-provider support: Groq (default), OpenAI, Grok

d36acd9

Add Groq API support alongside OpenAI and Grok

2a479c8

Fix README color for HF

39f65af

Deploy to HF Spaces

ff6af76

Finalise OpenEnv submission: Clamp rewards to 0.0-1.0, update Gradio UI, and add Groq pipelines

192db9d

app.py: Fix Gradio 6 deprecations and unused imports

04de9f3

inference.py: Groq default + dynamic scenario loading + campaign/adversarial prompt hints

2421327

README: add judge tip for deterministic reset(campaign_id=...) under campaign section

64f2c91

Restore moderation_benchmark.json from d741d4b (128 scenarios, 100/100 checks)

b2860e4

Merge + 3 fixes: README accurate (128 scen / real baselines), is_adversarial in state, reset(campaign_id) (100/100 checks)

fa17b3c

3 fixes: README accurate, is_adversarial in state, reset(campaign_id) (100/100)

10c3c6e

2b 3a

abf8abc

Appeal mechanic: is_adversarial + env.appeal() 2-turn flow (92/92 checks)

d741d4b

Fill easy GT gaps: full label×action coverage (79/79 checks)

a4c538a

Graduated severity penalty: sev-5→-0.30, sev-4→-0.15, sev-3→-0.05 (66/66)

94717ed

Cross-post campaign mechanic: campaign_id in state, +0.15 escalate-all bonus (61/61 checks)

748cef6

10 ambiguous hard scenarios + full valid_actions test suite (53/53 checks)

941d83d

Close all 4 scoring gaps (+~6 pts)

9bc46b3

v2 docs & validation: README rewrite, openenv.yaml v2.0, validator 47/47

68d61d8

Update inference.py: expand TASKS to all 75 scenarios (25 easy / 20 medium / 30 hard)

2426958

phase 1 and phase 2a

78d0a45

ContentModerationEnv v1.0 — complete OpenEnv benchmark

2a39e79