sweep ft hyperparameters
-
th135/olmo-1B-30BT-weightdecay0.1-metamathqa-sweep-lr1.0e-5-bs8-wdft0.0
1B • Updated • 1 -
th135/olmo-1B-30BT-weightdecay0.1-metamathqa-sweep-lr1.0e-5-bs8-wdft0.1
1B • Updated • 1 -
th135/olmo-1B-30BT-weightdecay0.1-metamathqa-sweep-lr1.0e-5-bs8-wdft1.0
1B • Updated • 1 -
th135/olmo-1B-30BT-weightdecay0.1-metamathqa-sweep-lr1.0e-5-bs16-wdft0.1
1B • Updated • 2