Commit History

feat: reward verifier alignment, notebook hardening, model name fix
cdc237b
Running

CreativeEngineer Claude Opus 4.6 commited on

refactor: replace unsloth with plain transformers+peft for GRPO training
3313e24

CreativeEngineer Claude Opus 4.6 commited on

feat: upgrade notebook to Qwen3.5-4B with H100 hyperparams
2cb6617

CreativeEngineer Claude Opus 4.6 commited on

docs: align training workflow and plan ssot
e8e5af5

CreativeEngineer commited on

refactor: align colab notebook with shared llm helpers
ddcb837

CreativeEngineer commited on

feat: make llm training workflow low-fidelity only
9c3599b

CreativeEngineer commited on

fix: align notebook evaluation with grpo rewards
8254ade

CreativeEngineer commited on

fix: robust JSON array extraction and notebook GRPO fixes
e826e11

CreativeEngineer Claude Opus 4.6 commited on

docs: clarify notebook surfaces and OpenEnv guidance
2348d3e

CreativeEngineer commited on

feat: polish notebook and README for hackathon submission
c647aa0

CreativeEngineer Claude Opus 4.6 commited on

feat: add HF Space deployment + GRPO training notebook
3bfd80a

CreativeEngineer Claude Opus 4.6 commited on

docs: codify multifidelity training policy
513a2e2

CreativeEngineer commited on

feat: add replay playtest and tighten fail-fast validation
8bf0155

CreativeEngineer commited on

docs: require trained policy evidence
27d58b3

CreativeEngineer commited on

docs: clarify notebook artifact requirement
d22b376

CreativeEngineer commited on

docs: fix p1 parameterization blocker fallout
acb992c

CreativeEngineer commited on

docs: sync status trackers to verifier state
ba716cf

CreativeEngineer commited on

docs: lock p1 plan and hackathon runtime setup
5354ca9

CreativeEngineer commited on