CryptoYogi's picture
SFT v4.2 adapter: vanilla Qwen3-0.6B base (DAPT skipped), r=8, q_proj+v_proj, lr=5e-05, 2 epochs, 13083 samples
d81f79e verified