-
-
-
-
-
-
Inference Providers
Active filters: full
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
• 7B • Updated
• 2
mlfoundations-dev/hp_ablations_grid_qwen_bsz128_lr5e-6
Text Generation
• 8B • Updated
• 2
mlfoundations-dev/hp_ablations_grid_qwen_bsz64_lr8e-6
Text Generation
• 8B • Updated
• 8
mlfoundations-dev/hp_ablations_grid_qwen_bsz512_lr8e-6
Text Generation
• 8B • Updated
mlfoundations-dev/hp_ablations_grid_qwen_bsz256_lr5e-6
Text Generation
• 8B • Updated
mlfoundations-dev/hp_ablations_grid_qwen_bsz512_lr5e-6
Text Generation
• 8B • Updated
• 7
mlfoundations-dev/hp_ablations_grid_qwen_bsz256_lr8e-6
Text Generation
• 8B • Updated
• 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
• 7B • Updated
• 8
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated
• 5
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated
• 2
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated
• 9
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
• 7B • Updated
• 5
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated
• 7
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated
• 6
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/llama3-1_8b_webinstruct_original_700k
Text Generation
• 8B • Updated
• 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated
• 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
• 7B • Updated
• 8
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
• 7B • Updated
• 2
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
• 7B • Updated
• 4
mlfoundations-dev/oh_v1.3_evol_instruct_x8
Text Generation
• 8B • Updated
cutelemonlili/saves_llama3.2_3b_origianl_MATH_training_rewrite_common_shorter
Text Generation
• 4B • Updated
• 2
cutelemonlili/saves_llama3.2_3b_origianl_MATH_training_rewrite_common_normal
Text Generation
• 4B • Updated
• 1
cutelemonlili/saves_llama3.2_3b_origianl_MATH_training_rewrite_common_shorter_8b
Text Generation
• 4B • Updated
• 1
mlfoundations-dev/llama3-1_8b_webinstruct_original_750k_uniform
Text Generation
• 8B • Updated
• 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7-mistralv0.3
Text Generation
• 7B • Updated
• 1
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7-mistralv0.3
Text Generation
• 7B • Updated