CreativeEngineer commited on
Commit
3f7be89
·
1 Parent(s): 5aace7c

docs: mark northflank smoke pass

Browse files
README.md CHANGED
@@ -35,10 +35,10 @@ Implementation status:
35
  - [x] Re-run the baseline comparison on the `constellaration`-backed branch state
36
  - [x] Replace the synthetic evaluator with `constellaration`
37
  - [x] Add a runnable Northflank smoke workflow and note
 
38
  - [ ] Add tracked `P1` fixtures under `server/data/p1/`
39
  - [ ] Run manual playtesting and record the first reward pathology
40
  - [ ] Refresh the heuristic baseline for the real verifier path
41
- - [ ] Pass the Northflank smoke test on the H100 workspace
42
  - [ ] Deploy the real environment to HF Space
43
 
44
  ## Known Gaps
@@ -107,10 +107,7 @@ uv sync --extra notebooks
107
  1. Add tracked `P1` fixtures under `server/data/p1`.
108
  2. Run manual playtest episodes and record the first real reward pathology, if any.
109
  3. Refresh the heuristic baseline using manual playtest evidence, then save one comparison trace.
110
- 4. Pass a Northflank smoke test:
111
- - import `constellaration`
112
- - run one rotating-ellipse generation plus one low-fidelity verifier call
113
- - write an artifact to persistent storage
114
  5. Deploy the environment to HF Space.
115
  6. Add the Colab notebook under `training/notebooks`.
116
 
 
35
  - [x] Re-run the baseline comparison on the `constellaration`-backed branch state
36
  - [x] Replace the synthetic evaluator with `constellaration`
37
  - [x] Add a runnable Northflank smoke workflow and note
38
+ - [x] Pass the Northflank smoke test on the H100 workspace
39
  - [ ] Add tracked `P1` fixtures under `server/data/p1/`
40
  - [ ] Run manual playtesting and record the first reward pathology
41
  - [ ] Refresh the heuristic baseline for the real verifier path
 
42
  - [ ] Deploy the real environment to HF Space
43
 
44
  ## Known Gaps
 
107
  1. Add tracked `P1` fixtures under `server/data/p1`.
108
  2. Run manual playtest episodes and record the first real reward pathology, if any.
109
  3. Refresh the heuristic baseline using manual playtest evidence, then save one comparison trace.
110
+ 4. Use the passing Northflank H100 setup to produce remote traces and comparisons from the real verifier path.
 
 
 
111
  5. Deploy the environment to HF Space.
112
  6. Add the Colab notebook under `training/notebooks`.
113
 
docs/FUSION_DELIVERABLES_MAP.md CHANGED
@@ -12,6 +12,7 @@ Use this map to sequence execution, not to reopen already-locked task choices.
12
  - [x] official `constellaration` verifier loop is wired
13
  - [x] baseline comparison has been rerun on the real verifier path
14
  - [x] Northflank smoke workflow and note are committed
 
15
  - [ ] tracked fixtures are checked in
16
  - [ ] manual playtest evidence exists
17
  - [ ] heuristic baseline has been refreshed for the real verifier path
@@ -101,12 +102,12 @@ flowchart LR
101
 
102
  ## Priority Order
103
 
 
 
104
  1. Add tracked fixtures and run fixture sanity checks.
105
  2. Manual-playtest the environment and record the first real pathology, if any.
106
  3. Refresh the heuristic baseline from that evidence.
107
- 4. Bring up the Northflank H100 workspace with persistent storage.
108
- 5. Pass the Northflank smoke test.
109
- 6. Make one stable OpenEnv `P1` task work remotely with clear, reproducible rules.
110
- 7. Use the notebook to show traces and comparisons; include training only if it adds signal.
111
- 8. Record the demo around environment clarity, verifier fidelity, reward shaping, and one stable trajectory.
112
- 9. Polish the repo only after the artifacts are real.
 
12
  - [x] official `constellaration` verifier loop is wired
13
  - [x] baseline comparison has been rerun on the real verifier path
14
  - [x] Northflank smoke workflow and note are committed
15
+ - [x] Northflank smoke test has passed on the team H100
16
  - [ ] tracked fixtures are checked in
17
  - [ ] manual playtest evidence exists
18
  - [ ] heuristic baseline has been refreshed for the real verifier path
 
102
 
103
  ## Priority Order
104
 
105
+ Northflank compute bring-up and smoke validation are complete.
106
+
107
  1. Add tracked fixtures and run fixture sanity checks.
108
  2. Manual-playtest the environment and record the first real pathology, if any.
109
  3. Refresh the heuristic baseline from that evidence.
110
+ 4. Make one stable OpenEnv `P1` task work remotely with clear, reproducible rules.
111
+ 5. Use the notebook to show traces and comparisons; include training only if it adds signal.
112
+ 6. Record the demo around environment clarity, verifier fidelity, reward shaping, and one stable trajectory.
113
+ 7. Polish the repo only after the artifacts are real.
 
 
docs/FUSION_DESIGN_LAB_PLAN_V2.md CHANGED
@@ -13,6 +13,7 @@
13
  - [x] post-terminal `step()` guard is in place
14
  - [x] baseline comparison has been rerun on the real verifier path
15
  - [x] Northflank smoke workflow and note are committed
 
16
  - [ ] tracked `P1` fixtures are added
17
  - [ ] manual playtest evidence is recorded
18
  - [ ] heuristic baseline is refreshed for the real verifier path
@@ -479,6 +480,7 @@ The repo should make the environment easy to understand:
479
  - notebook starts on the team H100
480
  - persistent storage mount is usable
481
  - smoke test artifact is written successfully
 
482
 
483
  ### Gate 1: Environment Contract Locked
484
 
 
13
  - [x] post-terminal `step()` guard is in place
14
  - [x] baseline comparison has been rerun on the real verifier path
15
  - [x] Northflank smoke workflow and note are committed
16
+ - [x] Northflank smoke test has passed on the team H100
17
  - [ ] tracked `P1` fixtures are added
18
  - [ ] manual playtest evidence is recorded
19
  - [ ] heuristic baseline is refreshed for the real verifier path
 
480
  - notebook starts on the team H100
481
  - persistent storage mount is usable
482
  - smoke test artifact is written successfully
483
+ - latest artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
484
 
485
  ### Gate 1: Environment Contract Locked
486
 
docs/FUSION_NEXT_12_HOURS_CHECKLIST.md CHANGED
@@ -15,6 +15,7 @@ Do not expand scope beyond one stable task. Training is supporting evidence, not
15
  - [x] replace the synthetic evaluator with `constellaration`
16
  - [x] re-run baselines on the real verifier path
17
  - [x] commit the Northflank smoke workflow and note
 
18
  - [ ] add tracked fixtures and manual playtest evidence
19
  - [ ] refresh the heuristic baseline after the real-verifier rerun
20
 
@@ -41,11 +42,12 @@ Carry these rules through the whole checklist:
41
 
42
  1. Bring up the Northflank Jupyter Notebook with PyTorch on the team H100.
43
  2. Attach persistent storage before relying on saved models, caches, or fixture downloads.
44
- 3. Pass a concrete smoke test:
45
  - import `constellaration`
46
  - generate one rotating-ellipse boundary
47
  - run one low-fidelity verifier call
48
- - write one artifact to persistent storage
 
49
 
50
  Exit condition: the notebook is not just open; the verifier path works and persistent storage is usable.
51
 
 
15
  - [x] replace the synthetic evaluator with `constellaration`
16
  - [x] re-run baselines on the real verifier path
17
  - [x] commit the Northflank smoke workflow and note
18
+ - [x] pass the Northflank smoke test on the team H100
19
  - [ ] add tracked fixtures and manual playtest evidence
20
  - [ ] refresh the heuristic baseline after the real-verifier rerun
21
 
 
42
 
43
  1. Bring up the Northflank Jupyter Notebook with PyTorch on the team H100.
44
  2. Attach persistent storage before relying on saved models, caches, or fixture downloads.
45
+ 3. Preserve the concrete smoke-test evidence:
46
  - import `constellaration`
47
  - generate one rotating-ellipse boundary
48
  - run one low-fidelity verifier call
49
+ - keep one artifact in persistent storage
50
+ - current artifact: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
51
 
52
  Exit condition: the notebook is not just open; the verifier path works and persistent storage is usable.
53
 
training/notebooks/README.md CHANGED
@@ -16,6 +16,7 @@ Recommended split:
16
 
17
  - [x] Northflank smoke notebook note saved
18
  - [x] runnable Northflank smoke script saved
 
19
  - [ ] manual-playtest notebook or trace notebook saved
20
  - [ ] thin public Colab notebook saved
21
 
@@ -37,5 +38,6 @@ Runnable repo path:
37
 
38
  - `uv run python training/notebooks/northflank_smoke.py --output-dir <mounted-persistent-storage-path>`
39
  - note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
 
40
 
41
  The notebooks are supporting evidence for the environment, not the primary product.
 
16
 
17
  - [x] Northflank smoke notebook note saved
18
  - [x] runnable Northflank smoke script saved
19
+ - [x] Northflank smoke test passed on the team H100
20
  - [ ] manual-playtest notebook or trace notebook saved
21
  - [ ] thin public Colab notebook saved
22
 
 
38
 
39
  - `uv run python training/notebooks/northflank_smoke.py --output-dir <mounted-persistent-storage-path>`
40
  - note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
41
+ - latest passing artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
42
 
43
  The notebooks are supporting evidence for the environment, not the primary product.