Spaces:

SapphireGaze429
/

OpenSecOpsEnv2

Sleeping

App Files Files Community

OpenSecOpsEnv2 / docs /OpenEnv_Hackathon_README.md

SapphireGaze429's picture

SapphireGaze429

Please work

b595345 about 2 months ago

|

History Blame Contribute Delete

2.59 kB

🧠 OpenEnv Hackathon — Judging & Expectations Guide

🚨 TL;DR (What You Actually Need to Do)

Build an environment where an LLM can train and measurably improve at something meaningful, then:

Show actual training
Provide evidence (metrics, reward curves, comparisons)
Tell a clear, compelling story

A messy but ambitious project with real training evidence beats a polished but shallow one.

⚖️ Judging Criteria (Core Evaluation)

1. 🌟 Environment Innovation — 40%

Is your environment novel, creative, or challenging?
Does it meaningfully test agent behavior?
Avoid overused ideas (grid worlds, chess clones, etc.)

2. 🎤 Storytelling & Presentation — 30%

Clearly explain:
- The problem
- The environment
- What the agent learned
Demo should be engaging and easy to follow

3. 📈 Showing Improvement in Rewards — 20%

Must prove learning happened
Evidence:
- Reward curves
- Before vs after behavior
- Baseline comparisons

4. ⚙️ Reward & Training Pipeline — 10%

Reward logic should be coherent and hard to exploit
Training should improve agent behavior

📦 Minimum Submission Requirements

Use OpenEnv (latest release)
Provide a working training script (Unsloth or HuggingFace TRL)
Show training evidence (loss + reward plots)
Submit:
- Mini-blog OR
- <2 min video OR
- Slides
Host on Hugging Face Spaces
Provide a README with problem, environment, results, links

Rules:

One submission per team
Submit environment URL
No changes after deadline

🧪 What Judges Look For

🔬 Real Training

Training must run against your environment
Show learning with plots, metrics, comparisons

🧠 Reward Design

Dense and informative rewards
Hard to game
Avoid simple binary rewards

🚀 Ambitious Problems

Solve something LLMs struggle with
Prefer underexplored domains

📊 Clear Results

Label axes properly
Save plots as images
Show comparisons clearly

📖 Tell a Story

Your README should answer:

Problem
Environment
Results
Why it matters

🧹 Clean Engineering

Use OpenEnv properly
Follow Gym API (reset, step, state)
Maintain clean architecture

🧭 Problem Selection Guidelines

Reuse Round 1 idea only if it fits themes
Build environment + reward model early
Ensure alignment with judging criteria

🏁 Final Advice

Be ambitious
Show real learning
Communicate clearly

Judges want projects that push the frontier of LLM training.