Buckets:

UCL-CSSB
/

PlasmidRL-ICML

Files

xet

UCL-CSSB/PlasmidRL-ICML / rejection_sampling_v2 /direct /README.md

McClain

10 days ago

preview code

download

raw

1.09 kB

rejection_sampling_v2

Rerun of the v1 baselines/rejection_sampling/ protocol at per-model optimal temperature (from this session's temperature sweep). Same two prompts (ATG + cfg.default_query GFP cassette), same 10K samples, same in-process scorer for the reward column, plus strict QC mirroring analysis2 thresholds (ORI ≥99% identity, AMR ≥100% identity, no ≥50bp direct repeats).

Generated: 2026-04-30T19:08:34.790388Z

Per-cell results

Cell	Model	T	n	strict-QC pass rate	sha256(outputs.csv)
Base_t1	UCL-CSSB/PlasmidGPT	1.0	10000	5.82%	`1a326eef8578e653…`
GRPO_t1.15	UCL-CSSB/PlasmidGPT-GRPO	1.15	10000	19.55%	`8a8738b1485a2043…`
SFT_t1	UCL-CSSB/PlasmidGPT-SFT	1.0	10000	5.75%	`fa586c04962e4552…`

v1 cross-check

All v2 outputs.csv SHAs were verified to differ from the v1 baselines/rejection_sampling/{Base,SFT,GRPO}/outputs.csv SHAs:

Base: 363e89d716c87e8a…
SFT: 0921fb93c60b2fac…
GRPO: 404d3cb55215ad70…

Generated by scripts/launch_rejection_v2.sh.

Xet Storage Details

Size:: 1.09 kB
Xet hash:: b232552ab19b49e49a33938deaf59bf8a5c042ef174977e6f565aed389706992

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.