card: final 2026-04-20 retrain results (6/8 chars), note skipped puff+ic
Browse files
README.md
CHANGED
|
@@ -39,21 +39,24 @@ Retrained on the post-schema-drop (13 numeric cols), new-transforms
|
|
| 39 |
hitstun) basis. See `research-notes-2026-04-20.md` in the MIMIC repo
|
| 40 |
for methodology + results analysis.
|
| 41 |
|
| 42 |
-
| Character | Run | Train games | Val loss |
|
| 43 |
-
|---|---|---|---|---|
|
| 44 |
-
| **Fox** | `fox-20260420-baseline`
|
| 45 |
-
| **Falco** | `falco-20260420-baseline`
|
| 46 |
-
| **Marth** | `marth-20260420-baseline`
|
| 47 |
-
| **Sheik** | `sheik-20260420-baseline`
|
| 48 |
-
| Captain Falcon
|
| 49 |
-
| Luigi
|
| 50 |
-
|
| 51 |
-
|
| 52 |
-
|
| 53 |
-
|
| 54 |
-
val 0.6322)
|
| 55 |
-
|
| 56 |
-
|
|
|
|
|
|
|
|
|
|
| 57 |
|
| 58 |
## Repo layout
|
| 59 |
|
|
|
|
| 39 |
hitstun) basis. See `research-notes-2026-04-20.md` in the MIMIC repo
|
| 40 |
for methodology + results analysis.
|
| 41 |
|
| 42 |
+
| Character | Run | Train games | Val loss | Step |
|
| 43 |
+
|---|---|---|---|---|
|
| 44 |
+
| **Fox** | `fox-20260420-baseline` | 31,030 | 0.7144 | 32768 |
|
| 45 |
+
| **Falco** | `falco-20260420-baseline` | 20,882 | 0.7487 | 31392 |
|
| 46 |
+
| **Marth** | `marth-20260420-baseline` | 11,759 | 0.6664 | 31065 |
|
| 47 |
+
| **Sheik** | `sheik-20260420-baseline` | 51,751 | 0.6566 | 26160 |
|
| 48 |
+
| **Captain Falcon** | `cptfalcon-20260420-baseline` | 17,557 | 0.7368 | watchdog |
|
| 49 |
+
| **Luigi** | `luigi-20260420-baseline` | 2,290 | 0.7460 | watchdog |
|
| 50 |
+
|
| 51 |
+
**Peach**, **Jigglypuff**, and **Ice Climbers** remain on pre-2026-04-20
|
| 52 |
+
schemas:
|
| 53 |
+
|
| 54 |
+
- `peach-20260420-baseline` (val 0.6322) was trained on the 22-col
|
| 55 |
+
schema before the schema drop β loadable via its pickled config.
|
| 56 |
+
- `puff` and `ice_climbers` missed the 2026-04-20 retrain cycle due
|
| 57 |
+
to a download-script bug; their existing HF checkpoints are on the
|
| 58 |
+
old schema. **These two are incompatible with the current 13-col
|
| 59 |
+
inference code path.** Will be retrained in a follow-on cycle.
|
| 60 |
|
| 61 |
## Repo layout
|
| 62 |
|