mlfoundations-dev/llama3-1_8b_webinstruct_original_700k
Text Generation
• 8B • Updated mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated • 2
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
• 7B • Updated • 2
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated • 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated • 2
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
• 7B • Updated • 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated • 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated mlfoundations-dev/oh_v1.3_slim_orca_x4
Text Generation
• 8B • Updated • 1
mlfoundations-dev/original_tiger_dataset_small
Text Generation
• 8B • Updated • 2
mlfoundations-dev/hp_ablations_gemma_epoch4
Text Generation
• 9B • Updated • 2
mlfoundations-dev/hp_ablations_gemma_epoch2
Text Generation
• 9B • Updated • 2
mlfoundations-dev/oh-dcft-v3.1-llama-3.1-8b
Text Generation
• 8B • Updated • 1
• 1
mlfoundations-dev/oh-dcft-v3.1-claude-3-5-haiku-20241022
Text Generation
• 8B • Updated • 13
• 5
mlfoundations-dev/oh_v1.3_metamath_x8
Text Generation
• 8B • Updated mlfoundations-dev/hp_ablations_gemma_epoch3
Text Generation
• 9B • Updated • 4
mlfoundations-dev/hp_ablations_gemma_epoch4_dcftv1.2
Text Generation
• 9B • Updated • 2
mlfoundations-dev/hp_ablations_gemma_bsz1024
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_epoch2_dcftv1.2
Text Generation
• 9B • Updated • 2
mlfoundations-dev/oh_v1.3_metamath_x2
Text Generation
• 8B • Updated mlfoundations-dev/hp_ablations_gemma_epoch3_dcftv1.2
Text Generation
• 9B • Updated mlfoundations-dev/oh_v1.3_metamath_x.5
Text Generation
• 8B • Updated mlfoundations-dev/oh_v1.3_metamath_x.25
Text Generation
• 8B • Updated mlfoundations-dev/oh-dcft-v3.1-gpt-4o-2024-11-20
Text Generation
• 8B • Updated • 3
• mlfoundations-dev/oh_v1.3_metamath_x.125
Text Generation
• 8B • Updated mlfoundations-dev/hp_ablations_gemma_epoch1
Text Generation
• 9B • Updated mlfoundations-dev/hp_ablations_gemma_bsz2048
Text Generation
• 9B • Updated • 5
mlfoundations-dev/hp_ablations_gemma_epoch1_dcftv1.2
Text Generation
• 9B • Updated • 4
mlfoundations-dev/oh-dcft-v3.1-llama-3.2-1b
Text Generation
• 8B • Updated • 1