clembench-playpen/Mistral-Small-24B-Instruct-0.1k-warmup_playpen_SFT-e3_DFINAL_0.1K-steps
Updated
clembench-playpen/llama3.1-8B_DPO_noSFT
Updated
clembench-playpen/Mistral-Small-24B-Instruc-0.1k-warmup_playpen_SFT-e3_DFINAL_0.6K-steps
Updated
clembench-playpen/meta-llama-3.1-8b_KTO_noSFT
Updated
clembench-playpen/Llama-3.1-70B_KTO_noSFT
Updated
clembench-playpen/llama-3.1-8B-Instruct-multi-step-collator_playpen_SFT-e3_DFINAL_0.7K-steps
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_SFT_E1_D40005_Michael_unload_from4bit
Text Generation
• 8B • Updated
• 2
clembench-playpen/meta-llama-3.1-8b-instruct-unsloth-bnb-4bit_KTO_Aborted_best_models_F_KTO_noSFT
Updated
clembench-playpen/llama3.1_DPO_2neg_Aborted_best_models_old_LA_DPO_noSFT
Updated
clembench-playpen/llama-3.1-8B-Instruct-only-warmup_playpen_SFT-e3_warm-up_0.225K-steps
clembench-playpen/llama-3.1-8B-Instruct-only-warmup_playpen_SFT-e3_warm-up_0.1K-steps
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_SFT_E1_D40005_merged_4bit
Text Generation
• 8B • Updated
• 2
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_SFT_E1_D40005_merged_16bit
Text Generation
• 8B • Updated
• 2
clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_SFT_E1_D40005_full_precision
Text Generation
• 8B • Updated
• 2
clembench-playpen/llama-3.1-70B-Instruct-rehearsal_playpen_SFT-e3_DABL02_0.82K-steps
clembench-playpen/llama-3.1-70B-Instruct-0.1k-warmup_playpen_SFT-e3_DFINAL_0.6K-steps
clembench-playpen/llama-3.1-8B-Instruct-rehearsal-steps_playpen_SFT-e3_DABL02_0.93K-steps
clembench-playpen/llama-3.1-8B-Instruct-rehearsal-steps_playpen_SFT-e3_DFINAL_0.93K-steps
clembench-playpen/llama-3.1-8B-Instruct-warmup-0.1K-steps-full-prompt_playpen_SFT-e3_DFINAL_0.7K-steps
clembench-playpen/llama-3.1-8B-Instruct-fp_SFT_e1_DFINAL_merged_fp16
Text Generation
• 8B • Updated
clembench-playpen/llama-3.1-8B-Instruct-v1.6-only-full-precision-lora_v3_playpen_SFT-e3_DFINAL_0.7K-steps
clembench-playpen/llama-3.1-8B-Instruct-v1.6-only-full-precision-lora_v2_playpen_SFT-e3_DABL01_1.4K-steps
clembench-playpen/llama-3.1-8B-Instruct-v1.6-only-fp_SFT_e1_DABL01_merged_fp16
Text Generation
• 8B • Updated
• 1
clembench-playpen/llama3.1_D40005_DPO_allneg_Aborted_best_models_old_LA
Updated
clembench-playpen/llama3.1_D40005_DPO_5neg_Aborted_best_models_old_LA
Updated
clembench-playpen/llama3.1_D40005_DPO_3neg_Aborted_best_models_old_LA
Updated
clembench-playpen/llama-3.1-8B-Instruct-v1.6-only-completion-only_playpen_SFT-e3_DABL01
clembench-playpen/llama-3.1-8B-Instruct-v1.6-only-completion-only_playpen_SFT_DABL01
clembench-playpen/llama-3.1-8B-Instruct-v1.6-only_playpen_SFT_DABL01
clembench-playpen/abl_L8B_DPO_1neg_Aborted_best_models_FINAL
Updated