-
-
-
-
-
-
Inference Providers
Active filters: full
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.95
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.995
Text Generation
• 7B • Updated
• 3
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.9995
Text Generation
• 7B • Updated
• 2
mlfoundations-dev/hp_ablations_mistral_bsz256
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.999
Text Generation
• 7B • Updated
• 3
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.99
Text Generation
• 7B • Updated
mlfoundations-dev/hp_ablations_mistral_bsz512
Text Generation
• 7B • Updated
• 4
mlfoundations-dev/hp_ablations_mistral_bsz1024
Text Generation
• 7B • Updated
• 7
mlfoundations-dev/hp_ablations_mistral_lr1e-6
Text Generation
• 7B • Updated
• 2
mlfoundations-dev/hp_ablations_mistral_lr1e-5
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/hp_ablations_mistral_lr8e-6
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/hp_ablations_mistral_lr2e-6
Text Generation
• 7B • Updated
• 4
mlfoundations-dev/hp_ablations_mistral_lr5e-6
Text Generation
• 7B • Updated
• 6
mlfoundations-dev/hp_ablations_mistral_bsz2048
Text Generation
• 7B • Updated
• 4
mlfoundations-dev/hp_ablations_mistral_scheduler_constant
Text Generation
• 7B • Updated
• 4
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05
Text Generation
• 7B • Updated
• 5
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.10
Text Generation
• 7B • Updated
• 8
mlfoundations-dev/hp_ablations_mistral_scheduler_inverse_sqrt
Text Generation
• 7B • Updated
• 6
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.15
Text Generation
• 7B • Updated
• 4
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10
Text Generation
• 7B • Updated
• 6
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.05
Text Generation
• 7B • Updated
• 4
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-7
Text Generation
• 7B • Updated
• 11
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-6
Text Generation
• 7B • Updated
• 8
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr5e-7
Text Generation
• 7B • Updated
• 11
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr5e-7
Text Generation
• 7B • Updated
• 5
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-7
Text Generation
• 7B • Updated
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-6
Text Generation
• 7B • Updated
• 8
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_redistribution_hf
Text Generation
• 8B • Updated
• 5
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.95
Text Generation
• 8B • Updated
• 5
mlfoundations-dev/hp_ablations_qwen_adambeta1_0.9
Text Generation
• 8B • Updated
• 7