docs: finalize R4 documentation β Dueling 84% Holdout, full ablation record acbd4c5 Lee93whut commited on 4 days ago
feat(round3): buffer=80k + target_freq=1500 + shaping=0.5 β 74% holdout, SPL=0.735 c1b9ba8 Lee93whut commited on 4 days ago
feat(round2): extended training, Double DQN 64% holdout, SPL=0.633 ff1b1b8 Lee93whut commited on 4 days ago
feat(round1): baseline DQN variants β Vanilla/Double/Dueling/Double+Dueling bf17b0c Lee93whut commited on 4 days ago