th135/llama-1B-20BT-weightdecay0.01-seed42-metamathqa-ftweightdecay0.01 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-weightdecay0.001-seed42-metamathqa-ftweightdecay1.0 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-weightdecay0.001-seed42-metamathqa-ftweightdecay0.01 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-weightdecay0.001-seed42-metamathqa-ftweightdecay0.1 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-weightdecay0.0001-seed42-metamathqa-ftweightdecay1.0 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-weightdecay0.0001-seed42-metamathqa-ftweightdecay0.01 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-weightdecay0.0001-seed42-metamathqa-ftweightdecay0.1 1B • Updated Dec 27, 2025 • 2
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.1-weightdecay1.0-seed42-metamathqa 1B • Updated Dec 27, 2025
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.1-weightdecay0.1-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.1-weightdecay0.01-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.1-weightdecay0.001-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.1-weightdecay0.0001-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.05-weightdecay1.0-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.05-weightdecay0.1-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.05-weightdecay0.01-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.05-weightdecay0.001-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.05-weightdecay0.0001-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1
th135/llama-1B-20BT-ffwmysubset20BT-mathematics0.01-weightdecay1.0-seed42-metamathqa 1B • Updated Dec 27, 2025 • 1