Commit ·
3f7be89
1
Parent(s): 5aace7c
docs: mark northflank smoke pass
Browse files
README.md
CHANGED
|
@@ -35,10 +35,10 @@ Implementation status:
|
|
| 35 |
- [x] Re-run the baseline comparison on the `constellaration`-backed branch state
|
| 36 |
- [x] Replace the synthetic evaluator with `constellaration`
|
| 37 |
- [x] Add a runnable Northflank smoke workflow and note
|
|
|
|
| 38 |
- [ ] Add tracked `P1` fixtures under `server/data/p1/`
|
| 39 |
- [ ] Run manual playtesting and record the first reward pathology
|
| 40 |
- [ ] Refresh the heuristic baseline for the real verifier path
|
| 41 |
-
- [ ] Pass the Northflank smoke test on the H100 workspace
|
| 42 |
- [ ] Deploy the real environment to HF Space
|
| 43 |
|
| 44 |
## Known Gaps
|
|
@@ -107,10 +107,7 @@ uv sync --extra notebooks
|
|
| 107 |
1. Add tracked `P1` fixtures under `server/data/p1`.
|
| 108 |
2. Run manual playtest episodes and record the first real reward pathology, if any.
|
| 109 |
3. Refresh the heuristic baseline using manual playtest evidence, then save one comparison trace.
|
| 110 |
-
4.
|
| 111 |
-
- import `constellaration`
|
| 112 |
-
- run one rotating-ellipse generation plus one low-fidelity verifier call
|
| 113 |
-
- write an artifact to persistent storage
|
| 114 |
5. Deploy the environment to HF Space.
|
| 115 |
6. Add the Colab notebook under `training/notebooks`.
|
| 116 |
|
|
|
|
| 35 |
- [x] Re-run the baseline comparison on the `constellaration`-backed branch state
|
| 36 |
- [x] Replace the synthetic evaluator with `constellaration`
|
| 37 |
- [x] Add a runnable Northflank smoke workflow and note
|
| 38 |
+
- [x] Pass the Northflank smoke test on the H100 workspace
|
| 39 |
- [ ] Add tracked `P1` fixtures under `server/data/p1/`
|
| 40 |
- [ ] Run manual playtesting and record the first reward pathology
|
| 41 |
- [ ] Refresh the heuristic baseline for the real verifier path
|
|
|
|
| 42 |
- [ ] Deploy the real environment to HF Space
|
| 43 |
|
| 44 |
## Known Gaps
|
|
|
|
| 107 |
1. Add tracked `P1` fixtures under `server/data/p1`.
|
| 108 |
2. Run manual playtest episodes and record the first real reward pathology, if any.
|
| 109 |
3. Refresh the heuristic baseline using manual playtest evidence, then save one comparison trace.
|
| 110 |
+
4. Use the passing Northflank H100 setup to produce remote traces and comparisons from the real verifier path.
|
|
|
|
|
|
|
|
|
|
| 111 |
5. Deploy the environment to HF Space.
|
| 112 |
6. Add the Colab notebook under `training/notebooks`.
|
| 113 |
|
docs/FUSION_DELIVERABLES_MAP.md
CHANGED
|
@@ -12,6 +12,7 @@ Use this map to sequence execution, not to reopen already-locked task choices.
|
|
| 12 |
- [x] official `constellaration` verifier loop is wired
|
| 13 |
- [x] baseline comparison has been rerun on the real verifier path
|
| 14 |
- [x] Northflank smoke workflow and note are committed
|
|
|
|
| 15 |
- [ ] tracked fixtures are checked in
|
| 16 |
- [ ] manual playtest evidence exists
|
| 17 |
- [ ] heuristic baseline has been refreshed for the real verifier path
|
|
@@ -101,12 +102,12 @@ flowchart LR
|
|
| 101 |
|
| 102 |
## Priority Order
|
| 103 |
|
|
|
|
|
|
|
| 104 |
1. Add tracked fixtures and run fixture sanity checks.
|
| 105 |
2. Manual-playtest the environment and record the first real pathology, if any.
|
| 106 |
3. Refresh the heuristic baseline from that evidence.
|
| 107 |
-
4.
|
| 108 |
-
5.
|
| 109 |
-
6.
|
| 110 |
-
7.
|
| 111 |
-
8. Record the demo around environment clarity, verifier fidelity, reward shaping, and one stable trajectory.
|
| 112 |
-
9. Polish the repo only after the artifacts are real.
|
|
|
|
| 12 |
- [x] official `constellaration` verifier loop is wired
|
| 13 |
- [x] baseline comparison has been rerun on the real verifier path
|
| 14 |
- [x] Northflank smoke workflow and note are committed
|
| 15 |
+
- [x] Northflank smoke test has passed on the team H100
|
| 16 |
- [ ] tracked fixtures are checked in
|
| 17 |
- [ ] manual playtest evidence exists
|
| 18 |
- [ ] heuristic baseline has been refreshed for the real verifier path
|
|
|
|
| 102 |
|
| 103 |
## Priority Order
|
| 104 |
|
| 105 |
+
Northflank compute bring-up and smoke validation are complete.
|
| 106 |
+
|
| 107 |
1. Add tracked fixtures and run fixture sanity checks.
|
| 108 |
2. Manual-playtest the environment and record the first real pathology, if any.
|
| 109 |
3. Refresh the heuristic baseline from that evidence.
|
| 110 |
+
4. Make one stable OpenEnv `P1` task work remotely with clear, reproducible rules.
|
| 111 |
+
5. Use the notebook to show traces and comparisons; include training only if it adds signal.
|
| 112 |
+
6. Record the demo around environment clarity, verifier fidelity, reward shaping, and one stable trajectory.
|
| 113 |
+
7. Polish the repo only after the artifacts are real.
|
|
|
|
|
|
docs/FUSION_DESIGN_LAB_PLAN_V2.md
CHANGED
|
@@ -13,6 +13,7 @@
|
|
| 13 |
- [x] post-terminal `step()` guard is in place
|
| 14 |
- [x] baseline comparison has been rerun on the real verifier path
|
| 15 |
- [x] Northflank smoke workflow and note are committed
|
|
|
|
| 16 |
- [ ] tracked `P1` fixtures are added
|
| 17 |
- [ ] manual playtest evidence is recorded
|
| 18 |
- [ ] heuristic baseline is refreshed for the real verifier path
|
|
@@ -479,6 +480,7 @@ The repo should make the environment easy to understand:
|
|
| 479 |
- notebook starts on the team H100
|
| 480 |
- persistent storage mount is usable
|
| 481 |
- smoke test artifact is written successfully
|
|
|
|
| 482 |
|
| 483 |
### Gate 1: Environment Contract Locked
|
| 484 |
|
|
|
|
| 13 |
- [x] post-terminal `step()` guard is in place
|
| 14 |
- [x] baseline comparison has been rerun on the real verifier path
|
| 15 |
- [x] Northflank smoke workflow and note are committed
|
| 16 |
+
- [x] Northflank smoke test has passed on the team H100
|
| 17 |
- [ ] tracked `P1` fixtures are added
|
| 18 |
- [ ] manual playtest evidence is recorded
|
| 19 |
- [ ] heuristic baseline is refreshed for the real verifier path
|
|
|
|
| 480 |
- notebook starts on the team H100
|
| 481 |
- persistent storage mount is usable
|
| 482 |
- smoke test artifact is written successfully
|
| 483 |
+
- latest artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
|
| 484 |
|
| 485 |
### Gate 1: Environment Contract Locked
|
| 486 |
|
docs/FUSION_NEXT_12_HOURS_CHECKLIST.md
CHANGED
|
@@ -15,6 +15,7 @@ Do not expand scope beyond one stable task. Training is supporting evidence, not
|
|
| 15 |
- [x] replace the synthetic evaluator with `constellaration`
|
| 16 |
- [x] re-run baselines on the real verifier path
|
| 17 |
- [x] commit the Northflank smoke workflow and note
|
|
|
|
| 18 |
- [ ] add tracked fixtures and manual playtest evidence
|
| 19 |
- [ ] refresh the heuristic baseline after the real-verifier rerun
|
| 20 |
|
|
@@ -41,11 +42,12 @@ Carry these rules through the whole checklist:
|
|
| 41 |
|
| 42 |
1. Bring up the Northflank Jupyter Notebook with PyTorch on the team H100.
|
| 43 |
2. Attach persistent storage before relying on saved models, caches, or fixture downloads.
|
| 44 |
-
3.
|
| 45 |
- import `constellaration`
|
| 46 |
- generate one rotating-ellipse boundary
|
| 47 |
- run one low-fidelity verifier call
|
| 48 |
-
-
|
|
|
|
| 49 |
|
| 50 |
Exit condition: the notebook is not just open; the verifier path works and persistent storage is usable.
|
| 51 |
|
|
|
|
| 15 |
- [x] replace the synthetic evaluator with `constellaration`
|
| 16 |
- [x] re-run baselines on the real verifier path
|
| 17 |
- [x] commit the Northflank smoke workflow and note
|
| 18 |
+
- [x] pass the Northflank smoke test on the team H100
|
| 19 |
- [ ] add tracked fixtures and manual playtest evidence
|
| 20 |
- [ ] refresh the heuristic baseline after the real-verifier rerun
|
| 21 |
|
|
|
|
| 42 |
|
| 43 |
1. Bring up the Northflank Jupyter Notebook with PyTorch on the team H100.
|
| 44 |
2. Attach persistent storage before relying on saved models, caches, or fixture downloads.
|
| 45 |
+
3. Preserve the concrete smoke-test evidence:
|
| 46 |
- import `constellaration`
|
| 47 |
- generate one rotating-ellipse boundary
|
| 48 |
- run one low-fidelity verifier call
|
| 49 |
+
- keep one artifact in persistent storage
|
| 50 |
+
- current artifact: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
|
| 51 |
|
| 52 |
Exit condition: the notebook is not just open; the verifier path works and persistent storage is usable.
|
| 53 |
|
training/notebooks/README.md
CHANGED
|
@@ -16,6 +16,7 @@ Recommended split:
|
|
| 16 |
|
| 17 |
- [x] Northflank smoke notebook note saved
|
| 18 |
- [x] runnable Northflank smoke script saved
|
|
|
|
| 19 |
- [ ] manual-playtest notebook or trace notebook saved
|
| 20 |
- [ ] thin public Colab notebook saved
|
| 21 |
|
|
@@ -37,5 +38,6 @@ Runnable repo path:
|
|
| 37 |
|
| 38 |
- `uv run python training/notebooks/northflank_smoke.py --output-dir <mounted-persistent-storage-path>`
|
| 39 |
- note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
|
|
|
|
| 40 |
|
| 41 |
The notebooks are supporting evidence for the environment, not the primary product.
|
|
|
|
| 16 |
|
| 17 |
- [x] Northflank smoke notebook note saved
|
| 18 |
- [x] runnable Northflank smoke script saved
|
| 19 |
+
- [x] Northflank smoke test passed on the team H100
|
| 20 |
- [ ] manual-playtest notebook or trace notebook saved
|
| 21 |
- [ ] thin public Colab notebook saved
|
| 22 |
|
|
|
|
| 38 |
|
| 39 |
- `uv run python training/notebooks/northflank_smoke.py --output-dir <mounted-persistent-storage-path>`
|
| 40 |
- note: `training/notebooks/NORTHFLANK_SMOKE_NOTE.md`
|
| 41 |
+
- latest passing artifact example: `/home/jovyan/fusion-design-lab/smoke/northflank_smoke_20260308T023646Z.json`
|
| 42 |
|
| 43 |
The notebooks are supporting evidence for the environment, not the primary product.
|