Commit ·
27d58b3
1
Parent(s): 714d655
docs: require trained policy evidence
Browse files- README.md +2 -2
- docs/FUSION_DESIGN_LAB_PLAN_V2.md +5 -5
- training/README.md +2 -2
- training/notebooks/README.md +2 -2
README.md
CHANGED
|
@@ -10,9 +10,9 @@ The repo is organized around one clear submission thesis:
|
|
| 10 |
- explicit constraints and feasibility semantics
|
| 11 |
- a reward function that is iteratively improved through observed behavior
|
| 12 |
|
| 13 |
-
|
| 14 |
|
| 15 |
-
A trained model is
|
| 16 |
|
| 17 |
## Current Status
|
| 18 |
|
|
|
|
| 10 |
- explicit constraints and feasibility semantics
|
| 11 |
- a reward function that is iteratively improved through observed behavior
|
| 12 |
|
| 13 |
+
The environment is the product. A trained policy is still required as evidence that agents can learn and use the environment rather than only manual or scripted play.
|
| 14 |
|
| 15 |
+
A trained model is required for this repo's submission story. A public Colab notebook artifact is also required by the hackathon, and that notebook should include a trained-policy demonstration rather than stay purely eval-first.
|
| 16 |
|
| 17 |
## Current Status
|
| 18 |
|
docs/FUSION_DESIGN_LAB_PLAN_V2.md
CHANGED
|
@@ -7,7 +7,7 @@
|
|
| 7 |
|
| 8 |
## 1. Submission Thesis
|
| 9 |
|
| 10 |
-
Fusion Design Lab is not
|
| 11 |
|
| 12 |
It is a clear, reproducible environment for one constrained scientific design task:
|
| 13 |
|
|
@@ -17,7 +17,7 @@ It is a clear, reproducible environment for one constrained scientific design ta
|
|
| 17 |
- explicit constraints and failure semantics
|
| 18 |
- reward logic that can be explained and iterated
|
| 19 |
|
| 20 |
-
|
| 21 |
|
| 22 |
## 2. Current State
|
| 23 |
|
|
@@ -97,7 +97,7 @@ Compute surfaces:
|
|
| 97 |
|
| 98 |
- Northflank is the main compute workspace for verifier-heavy work
|
| 99 |
- HF Space is the hosted environment surface
|
| 100 |
-
- Colab is the required public artifact and
|
| 101 |
|
| 102 |
Evidence order:
|
| 103 |
|
|
@@ -172,8 +172,8 @@ Gate 7: submission artifacts exist
|
|
| 172 |
|
| 173 |
If training evidence is weak:
|
| 174 |
|
| 175 |
-
- keep
|
| 176 |
-
- ship
|
| 177 |
|
| 178 |
If HF Space deployment is delayed:
|
| 179 |
|
|
|
|
| 7 |
|
| 8 |
## 1. Submission Thesis
|
| 9 |
|
| 10 |
+
Fusion Design Lab is not only a "trained model for fusion" submission.
|
| 11 |
|
| 12 |
It is a clear, reproducible environment for one constrained scientific design task:
|
| 13 |
|
|
|
|
| 17 |
- explicit constraints and failure semantics
|
| 18 |
- reward logic that can be explained and iterated
|
| 19 |
|
| 20 |
+
The environment is the product. A trained policy is required supporting evidence because it demonstrates that the environment is learnable in practice rather than only manually playable.
|
| 21 |
|
| 22 |
## 2. Current State
|
| 23 |
|
|
|
|
| 97 |
|
| 98 |
- Northflank is the main compute workspace for verifier-heavy work
|
| 99 |
- HF Space is the hosted environment surface
|
| 100 |
+
- Colab is the required public artifact and should show trained-policy behavior against the live environment
|
| 101 |
|
| 102 |
Evidence order:
|
| 103 |
|
|
|
|
| 172 |
|
| 173 |
If training evidence is weak:
|
| 174 |
|
| 175 |
+
- keep claims conservative about policy quality
|
| 176 |
+
- still ship a trained-policy demonstration and document its limitations plainly
|
| 177 |
|
| 178 |
If HF Space deployment is delayed:
|
| 179 |
|
training/README.md
CHANGED
|
@@ -1,9 +1,9 @@
|
|
| 1 |
Training and evaluation notebooks belong here.
|
| 2 |
|
| 3 |
-
This repository treats notebooks as supporting evidence for the environment, not the primary product.
|
| 4 |
|
| 5 |
## Status
|
| 6 |
|
| 7 |
- [ ] Northflank notebook artifacts saved
|
| 8 |
- [ ] Colab notebook saved
|
| 9 |
-
- [ ]
|
|
|
|
| 1 |
Training and evaluation notebooks belong here.
|
| 2 |
|
| 3 |
+
This repository treats notebooks and trained-policy runs as supporting evidence for the environment, not the primary product.
|
| 4 |
|
| 5 |
## Status
|
| 6 |
|
| 7 |
- [ ] Northflank notebook artifacts saved
|
| 8 |
- [ ] Colab notebook saved
|
| 9 |
+
- [ ] trained-policy evidence saved
|
training/notebooks/README.md
CHANGED
|
@@ -11,7 +11,7 @@ Recommended split:
|
|
| 11 |
|
| 12 |
- Northflank notebook: main compute workspace on the team H100
|
| 13 |
- Colab notebook: thin public artifact required by the hackathon
|
| 14 |
-
- trained model:
|
| 15 |
|
| 16 |
## Status
|
| 17 |
|
|
@@ -41,4 +41,4 @@ Runnable repo path:
|
|
| 41 |
- note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
|
| 42 |
- latest passing artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
|
| 43 |
|
| 44 |
-
The notebooks are supporting evidence for the environment, not the primary product. The required artifact is the notebook
|
|
|
|
| 11 |
|
| 12 |
- Northflank notebook: main compute workspace on the team H100
|
| 13 |
- Colab notebook: thin public artifact required by the hackathon
|
| 14 |
+
- trained model: required; the Colab notebook should include a trained-policy demonstration even if performance is modest
|
| 15 |
|
| 16 |
## Status
|
| 17 |
|
|
|
|
| 41 |
- note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
|
| 42 |
- latest passing artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
|
| 43 |
|
| 44 |
+
The notebooks are supporting evidence for the environment, not the primary product. The required artifact is the notebook plus trained-policy evidence; a standalone checkpoint file is optional only if the notebook can still demonstrate the trained behavior.
|