mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_dcftv1.2
Text Generation
• 9B • Updated • 9
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.9995_dcftv1.2
Text Generation
• 9B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.98_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_scheduler_constant_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_adambeta2_0.999_dcftv1.2
Text Generation
• 9B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.85_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_adambeta1_0.95_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_lr5e-6_dcftv1.2
Text Generation
• 9B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_lr8e-6_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_bsz256_dcftv1.2
Text Generation
• 9B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_bsz512_dcftv1.2
Text Generation
• 9B • Updated • 7
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.9_dcftv1.2
Text Generation
• 9B • Updated • 6
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-7_dcftv1.2
Text Generation
• 9B • Updated • 5
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.92_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_adambeta2_0.95_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-6_dcftv1.2
Text Generation
• 9B • Updated • 3
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.995_dcftv1.2
Text Generation
• 9B • Updated • 2
mlfoundations-dev/hp_ablations_gemma_lr1e-6_dcftv1.2
Text Generation
• 9B • Updated • 2
mlfoundations-dev/hp_ablations_gemma_lr1e-5_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/llama3-1_8b_webinstruct_750k
Text Generation
• 8B • Updated mlfoundations-dev/oh_v1.2_opengpt_x2
mlfoundations-dev/oh_v1.2_opengpt_x.125
mlfoundations-dev/oh_v1.2_opengpt_x.5
mlfoundations-dev/oh_v1.2_opengpt_x.25
mlfoundations-dev/oh_v1.2_alpaca_x4
Updated
mlfoundations-dev/oh_v1.2_alpaca_x.5
Updated
mlfoundations-dev/oh_v1.2_alpaca_x2
Updated
mlfoundations-dev/oh_v1.2_alpaca_x.25
Updated
mlfoundations-dev/hp_ablations_qwen_scheduler_linear_warmup0.10_dcftv1.2
Text Generation
• 8B • Updated • 2
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr5e-7_dcftv1.2
Text Generation
• 8B • Updated • 3