Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.14 Text Generation • 8B • Updated 15 days ago • 11
Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.14 Text Generation • 8B • Updated 15 days ago • 11
Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.13 Text Generation • 8B • Updated 16 days ago • 140
Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.13 Text Generation • 8B • Updated 16 days ago • 140
Neelectric/Llama-3.2-1B-Instruct_SFT_mathsp_ewc_v00.02 Text Generation • 1B • Updated 19 days ago • 33
Neelectric/Llama-3.2-1B-Instruct_SFT_mathsp_ewc_v00.02 Text Generation • 1B • Updated 19 days ago • 33
Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.11.2 Text Generation • 8B • Updated 22 days ago • 142
Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.11.2 Text Generation • 8B • Updated 22 days ago • 142
SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks Paper • 2605.31433 • Published 27 days ago • 28