Buckets:
| # deprecated/ablations_t0.95/ + deprecated/ablations_t0.95_source/ | |
| These were the canonical ablation evaluations until 2026-05-05, when the user | |
| re-ran all 6 ablation cells at the **sweep-optimal RL temperature T=1.15** | |
| (rather than T=0.95). The T=1.15 results are now canonical at | |
| `evaluation/eight_prompt/ablations/`. | |
| ## What's archived here | |
| - `ablations_t0.95/` — server-side-copy of the T=0.95 cell contents that previously | |
| occupied `evaluation/eight_prompt/ablations/` in this bucket (mostly bucket-staged | |
| copies of the McClain/plasmidgpt-rl-* model evaluations at T=0.95) | |
| - `ablations_t0.95_source/` — the local strict-QC re-run of all 6 ablations at T=0.95 | |
| performed on g6-big (`/opt/dlami/nvme/strict_qc_ablations/`); per-cell | |
| `outputs.csv` + `qc/` artifacts + `metadata.json`. Manifest at | |
| `ablations_t0.95_source/manifest.json` carries seed, sha256, and pass counts. | |
| ## Numerical comparison | |
| | Ablation | T=0.95 | T=1.15 (canonical) | | |
| |---|---:|---:| | |
| | full_reward | 66.88% | 78.35% | | |
| | no_repeat_penalty | 72.17% | 75.15% | | |
| | no_length_prior | 71.38% | 72.15% | | |
| | no_cassette_bonus | 19.80% | 44.52% | | |
| | length_only | 34.73% | 37.90% | | |
| | cds_only | 2.40% | 1.73% | | |
| Cassette-bonus removal remains the largest single-component drop in both | |
| settings (47.1pp at T=0.95, 33.8pp at T=1.15). All ablations except cds_only | |
| improve at the higher temperature, consistent with the rejection-sampling | |
| sweep showing T=1.15 as the GRPO peak. | |
Xet Storage Details
- Size:
- 1.46 kB
- Xet hash:
- 7b255d2d2cebd8f9c3425ed7b3eafb65814611e1222a42f3a1d3f23713d12ec0
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.