th135/llama-1B-20BT-weightdecay1.0-seed42-simplescaling-sweep-lr6.0e-4-bs32-wdft1.0 1B • Updated Feb 12 • 1
th135/llama-1B-20BT-weightdecay1.0-seed42-simplescaling-sweep-lr6.0e-4-bs32-wdft0.1 1B • Updated Feb 12 • 1
th135/llama-1B-20BT-weightdecay1.0-seed42-simplescaling-sweep-lr6.0e-4-bs32-wdft0.0 1B • Updated Feb 12 • 1
th135/llama-1B-20BT-weightdecay1.0-seed42-simplescaling-sweep-lr6.0e-4-bs16-wdft1.0 1B • Updated Feb 12 • 1
th135/llama-1B-20BT-weightdecay1.0-seed42-simplescaling-sweep-lr6.0e-4-bs16-wdft0.1 1B • Updated Feb 12 • 1