CreativeEngineer commited on
Commit
27d58b3
·
1 Parent(s): 714d655

docs: require trained policy evidence

Browse files
README.md CHANGED
@@ -10,9 +10,9 @@ The repo is organized around one clear submission thesis:
10
  - explicit constraints and feasibility semantics
11
  - a reward function that is iteratively improved through observed behavior
12
 
13
- Training is supporting evidence. The environment is the product.
14
 
15
- A trained model is optional for this repo's submission story. A public Colab notebook artifact is still required by the hackathon, and that notebook can stay evaluation-first if training evidence is weak.
16
 
17
  ## Current Status
18
 
 
10
  - explicit constraints and feasibility semantics
11
  - a reward function that is iteratively improved through observed behavior
12
 
13
+ The environment is the product. A trained policy is still required as evidence that agents can learn and use the environment rather than only manual or scripted play.
14
 
15
+ A trained model is required for this repo's submission story. A public Colab notebook artifact is also required by the hackathon, and that notebook should include a trained-policy demonstration rather than stay purely eval-first.
16
 
17
  ## Current Status
18
 
docs/FUSION_DESIGN_LAB_PLAN_V2.md CHANGED
@@ -7,7 +7,7 @@
7
 
8
  ## 1. Submission Thesis
9
 
10
- Fusion Design Lab is not primarily a "trained model for fusion" submission.
11
 
12
  It is a clear, reproducible environment for one constrained scientific design task:
13
 
@@ -17,7 +17,7 @@ It is a clear, reproducible environment for one constrained scientific design ta
17
  - explicit constraints and failure semantics
18
  - reward logic that can be explained and iterated
19
 
20
- Training is supporting evidence. The environment is the product.
21
 
22
  ## 2. Current State
23
 
@@ -97,7 +97,7 @@ Compute surfaces:
97
 
98
  - Northflank is the main compute workspace for verifier-heavy work
99
  - HF Space is the hosted environment surface
100
- - Colab is the required public artifact and can stay evaluation-first if training evidence is weak
101
 
102
  Evidence order:
103
 
@@ -172,8 +172,8 @@ Gate 7: submission artifacts exist
172
 
173
  If training evidence is weak:
174
 
175
- - keep the notebook evaluation-first
176
- - ship the environment, playtest, and baseline story anyway
177
 
178
  If HF Space deployment is delayed:
179
 
 
7
 
8
  ## 1. Submission Thesis
9
 
10
+ Fusion Design Lab is not only a "trained model for fusion" submission.
11
 
12
  It is a clear, reproducible environment for one constrained scientific design task:
13
 
 
17
  - explicit constraints and failure semantics
18
  - reward logic that can be explained and iterated
19
 
20
+ The environment is the product. A trained policy is required supporting evidence because it demonstrates that the environment is learnable in practice rather than only manually playable.
21
 
22
  ## 2. Current State
23
 
 
97
 
98
  - Northflank is the main compute workspace for verifier-heavy work
99
  - HF Space is the hosted environment surface
100
+ - Colab is the required public artifact and should show trained-policy behavior against the live environment
101
 
102
  Evidence order:
103
 
 
172
 
173
  If training evidence is weak:
174
 
175
+ - keep claims conservative about policy quality
176
+ - still ship a trained-policy demonstration and document its limitations plainly
177
 
178
  If HF Space deployment is delayed:
179
 
training/README.md CHANGED
@@ -1,9 +1,9 @@
1
  Training and evaluation notebooks belong here.
2
 
3
- This repository treats notebooks as supporting evidence for the environment, not the primary product.
4
 
5
  ## Status
6
 
7
  - [ ] Northflank notebook artifacts saved
8
  - [ ] Colab notebook saved
9
- - [ ] training evidence included only if it is persuasive
 
1
  Training and evaluation notebooks belong here.
2
 
3
+ This repository treats notebooks and trained-policy runs as supporting evidence for the environment, not the primary product.
4
 
5
  ## Status
6
 
7
  - [ ] Northflank notebook artifacts saved
8
  - [ ] Colab notebook saved
9
+ - [ ] trained-policy evidence saved
training/notebooks/README.md CHANGED
@@ -11,7 +11,7 @@ Recommended split:
11
 
12
  - Northflank notebook: main compute workspace on the team H100
13
  - Colab notebook: thin public artifact required by the hackathon
14
- - trained model: optional; if training evidence is weak, the required Colab notebook can stay eval-first
15
 
16
  ## Status
17
 
@@ -41,4 +41,4 @@ Runnable repo path:
41
  - note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
42
  - latest passing artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
43
 
44
- The notebooks are supporting evidence for the environment, not the primary product. The required artifact is the notebook itself, not a trained model checkpoint.
 
11
 
12
  - Northflank notebook: main compute workspace on the team H100
13
  - Colab notebook: thin public artifact required by the hackathon
14
+ - trained model: required; the Colab notebook should include a trained-policy demonstration even if performance is modest
15
 
16
  ## Status
17
 
 
41
  - note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
42
  - latest passing artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
43
 
44
+ The notebooks are supporting evidence for the environment, not the primary product. The required artifact is the notebook plus trained-policy evidence; a standalone checkpoint file is optional only if the notebook can still demonstrate the trained behavior.