docs: clean up R3/R4 record and consolidate technical narrative 92423f0 Lee93whut Lee93whut commited on 3 days ago
refactor(model): update architecture docs and set dueling as default algorithm 34ad2cc Lee93whut commited on 4 days ago
docs(round4): complete experiment record — A1/A2/A3 full EVAL data and conclusions a91b194 Lee93whut commited on 4 days ago
feat(round1): baseline DQN variants — Vanilla/Double/Dueling/Double+Dueling bf17b0c Lee93whut commited on 4 days ago