Commit History

Add model card with training details
5473b25
verified

thomasjhuang commited on

RLOO checkpoint at optimizer step 150 - Fixed prompt format, temp=0.1, lr=3e-6
8c36a57
verified

thomasjhuang commited on

RLOO checkpoint at optimizer step 150 - Fixed prompt format, temp=0.1, lr=3e-6
85219cc
verified

thomasjhuang commited on

initial commit
649e82b
verified

thomasjhuang commited on