wilyub's picture
OT-Agents 8B SFT β€” Qwen3-8B full FT, 5 epochs on 10k sharegpt (4 DCAgent mixes), train_loss 0.1973
d4332b0 verified