two_agent_rdpo_iter_3 / config.json

Commit History

RDPO-7b-beta0.01-eta0.001
94d7011
verified

YYYYYYibo commited on

Training in progress, step 100
8c90a58
verified

YYYYYYibo commited on