rdxtremity's picture
Stage 2 fine-tuning on human search feedback (CosineDistance TripletLoss, lr=1e-5) — 2026-05-19
81013d1 verified