Naahraf27/npo_llama-3.2-1b-instruct_forget10_ep10_lr5e-5_alpha1.0_beta0.1 Text Generation • 1B • Updated Apr 16 • 99
Naahraf27/npo_llama-3.2-3b-instruct_forget10_ep5_lr2e-5_alpha2.0_beta0.1 Text Generation • 3B • Updated about 1 month ago • 219
Naahraf27/npo_llama-3.1-8b-instruct_forget10_ep5_lr5e-5_alpha2.0_beta0.1 Text Generation • 8B • Updated Apr 16 • 103