Commit History

Add 02 / WHAT THE AGENT REVIEWS — three real env scenes
4233bf4
Running

sam25kat Claude Opus 4.7 (1M context) commited on

Expand BLOG.md to full end-to-end submission writeup
5b5a584

sam25kat Claude Opus 4.7 (1M context) commited on

Repoint blog links to BLOG.md file (per hackathon guidance)
c0449da

sam25kat Claude Opus 4.7 (1M context) commited on

Restructure landing for storytelling — proof-first
8602b50

sam25kat Claude Opus 4.7 (1M context) commited on

Add Results + Resources sections to landing page
e47ffb7

sam25kat Claude Opus 4.7 (1M context) commited on

Add hackathon + team credit (Meta × Hugging Face OpenEnv · ~The Cook House)
1ccf723

sam25kat Claude Opus 4.7 (1M context) commited on

Update stats: 16 → 76 scenarios, 72 → 430 findings
76295c0

sam25kat Claude Opus 4.7 (1M context) commited on

Link published HF Community blog from README
40d1711

sam25kat Claude Opus 4.7 (1M context) commited on

Add SFT→GRPO hybrid pipeline, 60+ scenarios, semantic graders, full results
d2a68fa

sam25kat Claude Opus 4.7 (1M context) commited on

Add HF training Space: Gradio UI + GRPO train script
443f900

sam25kat Claude Sonnet 4.6 commited on

Fix GRPOConfig: max_new_tokens → max_completion_length, remove temperature
a28dc6a

sam25kat Claude Sonnet 4.6 commited on

Add adaptive curriculum, GRPO training notebook, and beginner guide
4c557cd

sam25kat Claude Sonnet 4.6 commited on

Refine product positioning: premium landing page + README
6c30cc3

sameerkatte Claude Opus 4.6 (1M context) commited on

Add HTML landing page at GET / for HF Space preview
ab388ea

sameerkatte Claude Opus 4.6 (1M context) commited on

Clamp grader scores strictly within (0, 1) for validator compliance
382a35d

sameerkatte Claude Opus 4.6 (1M context) commited on

Add [START]/[STEP]/[END] structured output markers to inference.py
6dbb8cf

sameerkatte Claude Opus 4.6 (1M context) commited on

Pass openenv validate: add multi-mode deployment + runtime endpoints
8d618ab

sameerkatte Claude Opus 4.6 (1M context) commited on

Fix /reset to accept empty/missing body for validator compatibility
ee2d45e

sameerkatte Claude Opus 4.6 (1M context) commited on

Deploy to HF Spaces: add baseline scores, fix deps and validation
2ee6649

sameerkatte Claude Opus 4.6 (1M context) commited on

Initial commit: SecureReview OpenEnv environment
8b4c1a6

sameerkatte Claude Opus 4.6 (1M context) commited on