mlfoundations-dev/oh_v1.3_opengpt_x.125
Text Generation
• 8B • Updated mlfoundations-dev/oh_v1.3_opengpt_x.25
Text Generation
• 8B • Updated mlfoundations-dev/oh_v1.3_opengpt_x4
Text Generation
• 8B • Updated mlfoundations-dev/oh_v1.3_opengpt_x2
Text Generation
• 8B • Updated mlfoundations-dev/oh_v1.3_slim_orca_x.125
Text Generation
• 8B • Updated mlfoundations-dev/hp_ablations_llama3_epoch3_dcftv1.2
Text Generation
• 8B • Updated • 1
mlfoundations-dev/hp_ablations_mistral_epoch4_dcftv1.2
Text Generation
• 7B • Updated • 2
mlfoundations-dev/hp_ablations_mistral_epoch2_dcftv1.2
Text Generation
• 7B • Updated • 2
mlfoundations-dev/hp_ablations_llama3_epoch4_dcftv1.2
Text Generation
• 8B • Updated mlfoundations-dev/hp_ablations_llama3_epoch2_dcftv1.2
Text Generation
• 8B • Updated • 2
mlfoundations-dev/hp_ablations_qwen_epoch4_dcftv1.2
Text Generation
• 8B • Updated • 1
mlfoundations-dev/hp_ablations_qwen_epoch2_dcftv1.2
Text Generation
• 8B • Updated • 1
mlfoundations-dev/hp_ablations_mistral_epoch3_dcftv1.2
Text Generation
• 7B • Updated • 2
mlfoundations-dev/hp_ablations_qwen_epoch3_dcftv1.2
Text Generation
• 8B • Updated • 2
mlfoundations-dev/hp_ablations_mistral_epoch5_dcftv1.2
Text Generation
• 7B • Updated • 3
mlfoundations-dev/hp_ablations_qwen_epoch5_dcftv1.2
Text Generation
• 8B • Updated mlfoundations-dev/hp_ablations_mistral_epoch1_dcftv1.2
Text Generation
• 7B • Updated mlfoundations-dev/hp_ablations_llama3_epoch1_dcftv1.2
Text Generation
• 8B • Updated • 2
mlfoundations-dev/hp_ablations_qwen_epoch1_dcftv1.2
Text Generation
• 8B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-7_dcftv1.2
Text Generation
• 9B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_lr2e-6_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.15_dcftv1.2
Text Generation
• 9B • Updated • 3
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.10_dcftv1.2
Text Generation
• 9B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr5e-7_dcftv1.2
Text Generation
• 9B • Updated • 1
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.99_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.05_dcftv1.2
Text Generation
• 9B • Updated • 2
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-6_dcftv1.2
Text Generation
• 9B • Updated • 4
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_dcftv1.2
Text Generation
• 9B • Updated • 2
mlfoundations-dev/hp_ablations_gemma_scheduler_inverse_sqrt_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr5e-7_dcftv1.2
Text Generation
• 9B • Updated • 1