Refresh code/ with latest BLT-Reasoner sources (post-campaign) bc7101b verified LauraGG commited on 8 days ago
HANDOFF v3: 77.5% via bottleneck-as-regularizer at inference 572be28 verified LauraGG commited on 8 days ago
BREAKTHROUGH: same GRPO ckpt eval without bottleneck = 77.5% GSM8K AR (vs 52.5% with bottleneck) e4f8490 verified LauraGG commited on 8 days ago
EXP 1+3: AR n=200 K=16 ablation result (51% normal, delta_zero=50.5pp) bbfa759 verified LauraGG commited on 9 days ago
EXP 1+3: TF n=200 K=16 ablation result (delta_zero=23pp) d73b8a1 verified LauraGG commited on 9 days ago
EXP: block_z_to_x=True (leak-closure principled test) fa673e0 verified LauraGG commited on 10 days ago
Add teacher-forced ablation on pilot7b final (baseline for block_z_to_x exp) 70f5507 verified LauraGG commited on 10 days ago
EXP: 7B with block_z_to_x=True (leak-closure principled test) 833435e verified LauraGG commited on 10 days ago
HANDOFF: add GRPO Phase C null result + revised next steps 6e77ac7 verified LauraGG commited on 10 days ago
Add publication-grade HANDOFF: 4-way ablation, mechanism, lineage 3e238b7 verified LauraGG commited on 10 days ago
Add control: no_bottleneck (final ckpt + n=100 ablation) 2b3c49b verified LauraGG commited on 10 days ago
Add control: no_infonce (final ckpt + n=100 ablation) 1f59760 verified LauraGG commited on 10 days ago
BLT-Reasoner pilot 1: ckpts + code + logs + ablations 9477b5c verified LauraGG commited on 11 days ago