Commit History

V4: steep reward slope - full-episode eval + stronger milestones + progress bonus
f885d09

Roopalgn commited on

fixes : rewards and training
d68729f

Coding Ninja commited on

Fix reward consistency and timeout progress
fcc31aa

Coding Ninja commited on

Fix reward-stationary reset controls and notebook training setup
2a5fb9e

Coding Ninja commited on

fix: rebalance reward range + unblock enrollment->monitoring->analysis phases
0825b0e

Roopalgn commited on

fix: remove duplicated curriculum block, dedup FDA table, accumulate total_reward for invalid steps
0aa2f10

Roopalgn commited on

fix(N1-N6): unblock SUBMIT_TO_FDA, harden episode lifecycle, add tests
5ed3409

Roopalgn commited on

V4: fix 5 critical + 8 major + 2 moderate issues
d148dd5

Roopalgn commited on

fix: V2 reward tuning + single-step training script for GRPO slope
17289ed

Roopalgn commited on

fix: rebalance rewards for GRPO slope - remove efficiency baseline, add milestone bonuses
d8168e4

Roopalgn commited on

Merge branch 'Roopalgn:main' into main
c8a460b
unverified

Suyash Kumar commited on

fix: G26 wire trainer.train(), G23 fix episode_phase progression
d5e1bb8

Coding Ninja commited on

fix: populate latent biology and failure telemetry
02ffe8b

Roopalgn commited on

fix: add root route so HF Space doesn't show 404
6b6624d

Roopalgn commited on

Push 7 Phase A: grounding + kaggle notebook + docs
62441e1

Roopalgn commited on

style: fix ruff lint and format errors
264ab32

Coding Ninja commited on

fix: wire shaping_bonus, advance_curriculum, AdversarialDesigner into episode_manager
0628014

Coding Ninja commited on

[Push 6] Suyash: CI workflow, container hardening, /transcripts endpoint, entrypoint validation, lint/format fixes
1684c0c

Coding Ninja commited on

[Push 5] Suyash: adaptive curriculum, dashboard backend, hardening
8859234

Coding Ninja commited on

[Push 4] Suyash: train.py, eval_compare.py, plot_rewards.py, env config, LLM judge
f42c8af

Coding Ninja commited on

Push 3: Curriculum controller, hidden-state pipeline, phase detector, trial judge, and full EpisodeManager wiring
1f2ca34

Coding Ninja commited on

Push 2: Rule-Engine, Reward Components, Episode Logging, Noise Model
22170b0

Coding Ninja commited on

fix(structure): align file layout with ARCHITECTURE.md — rules/ and root models.py
c2ae0e2

Coding Ninja commited on

refactor(env): rename environment/ to server/, add openenv.yaml, fix Dockerfile base image
d4cb143

Coding Ninja commited on