evoxtral-rl / training_args.bin

Commit History

RL adapter (RAFT): lr=5e-05, ep=1
1260e39
verified

YongkangZOU commited on