fusion-design-lab / training

Commit History

feat: reward verifier alignment, notebook hardening, model name fix
cdc237b
Running

CreativeEngineer Claude Opus 4.6 commited on

refactor: replace unsloth with plain transformers+peft for GRPO training
3313e24

CreativeEngineer Claude Opus 4.6 commited on

feat: upgrade notebook to Qwen3.5-4B with H100 hyperparams
2cb6617

CreativeEngineer Claude Opus 4.6 commited on

feat: add real-time stellarator optimization demo animation
3b185f9

CreativeEngineer Claude Opus 4.6 commited on

docs: align training workflow and plan ssot
e8e5af5

CreativeEngineer commited on

refactor: align colab notebook with shared llm helpers
ddcb837

CreativeEngineer commited on

feat: make llm training workflow low-fidelity only
9c3599b

CreativeEngineer commited on

feat: add model-driven llm reward evaluation
ede4c5c

CreativeEngineer commited on

fix: align notebook evaluation with grpo rewards
8254ade

CreativeEngineer commited on

fix: robust JSON array extraction and notebook GRPO fixes
e826e11

CreativeEngineer Claude Opus 4.6 commited on

feat: add reward and verifier monitoring telemetry
5e0e606

CreativeEngineer commited on

docs: clarify notebook surfaces and OpenEnv guidance
2348d3e

CreativeEngineer commited on

fix: restore ppo smoke early termination
9827b11

CreativeEngineer commited on

fix: harden ppo smoke action handling
40011e5

CreativeEngineer commited on

feat: polish notebook and README for hackathon submission
c647aa0

CreativeEngineer Claude Opus 4.6 commited on

feat: add llm rollout contract and simplify ppo smoke
ebd0ff3

CreativeEngineer commited on

feat: add HF Space deployment + GRPO training notebook
3bfd80a

CreativeEngineer Claude Opus 4.6 commited on

docs: codify multifidelity training policy
513a2e2

CreativeEngineer commited on

feat: add replay playtest and tighten fail-fast validation
8bf0155

CreativeEngineer commited on

docs: require trained policy evidence
27d58b3

CreativeEngineer commited on

docs: clarify notebook artifact requirement
d22b376

CreativeEngineer commited on

docs: fix p1 parameterization blocker fallout
acb992c

CreativeEngineer commited on

docs: sync status trackers to verifier state
ba716cf

CreativeEngineer commited on

docs: lock p1 plan and hackathon runtime setup
5354ca9

CreativeEngineer commited on

chore: add hackathon repo guardrails
98ffb4a

CreativeEngineer commited on

chore: scaffold fusion design lab repo
65b799e

CreativeEngineer commited on