AntiAtropos / training /train.py

Commit History

show script
85b91e2

div18 commited on

final commit
aaad9f1

div18 commited on

reward etc tuning
67810ba

div18 commited on

fix logging
746de52

div18 commited on

cimmits
c2815cb

div18 commited on

env changes
70cdeae

div18 commited on

training changes
7dbb622

div18 commited on

entropy spread
d23c9c4

div18 commited on

better entropy
c56d720

div18 commited on

grad messup
d6b3052

div18 commited on

softer logits
eae0446

div18 commited on

OOM
0f6141d

div18 commited on

OOM?
a1a33c8

div18 commited on

LORA
8a3d2d7

div18 commited on

OOM
381091a

div18 commited on

Ms
b1e6564

div18 commited on

mm
836c3a5

div18 commited on

smaller logits
1b9be85

div18 commited on

Fix OOM
d2f9b0c

div18 commited on

fix backprop
871c1ae

div18 commited on

changes
d41d25d

div18 commited on

edits
619e74d

div18 commited on

code
e890160

div18 commited on