Buckets:

McClain's picture
|
download
raw
1.09 kB
# rejection_sampling_v2
Rerun of the v1 baselines/rejection_sampling/ protocol at per-model
optimal temperature (from this session's temperature sweep). Same
two prompts (ATG + cfg.default_query GFP cassette), same 10K samples,
same in-process scorer for the reward column, plus strict QC mirroring
analysis2 thresholds (ORI ≥99% identity, AMR ≥100% identity, no ≥50bp
direct repeats).
Generated: 2026-04-30T19:08:34.790388Z
## Per-cell results
| Cell | Model | T | n | strict-QC pass rate | sha256(outputs.csv) |
|---|---|---:|---:|---:|---|
| Base_t1 | UCL-CSSB/PlasmidGPT | 1.0 | 10000 | 5.82% | `1a326eef8578e653…` |
| GRPO_t1.15 | UCL-CSSB/PlasmidGPT-GRPO | 1.15 | 10000 | 19.55% | `8a8738b1485a2043…` |
| SFT_t1 | UCL-CSSB/PlasmidGPT-SFT | 1.0 | 10000 | 5.75% | `fa586c04962e4552…` |
## v1 cross-check
All v2 outputs.csv SHAs were verified to differ from the v1
`baselines/rejection_sampling/{Base,SFT,GRPO}/outputs.csv` SHAs:
- Base: `363e89d716c87e8a…`
- SFT: `0921fb93c60b2fac…`
- GRPO: `404d3cb55215ad70…`
Generated by `scripts/launch_rejection_v2.sh`.

Xet Storage Details

Size:
1.09 kB
·
Xet hash:
b232552ab19b49e49a33938deaf59bf8a5c042ef174977e6f565aed389706992

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.