-
-
-
-
-
-
Inference Providers
Active filters: full
mlfoundations-dev/hp_ablations_qwen_scheduler_inverse_sqrt_dcftv1.2
Text Generation
• 8B • Updated
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr5e-7_dcftv1.2
Text Generation
• 8B • Updated
• 4
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_dcftv1.2
Text Generation
• 8B • Updated
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.15_dcftv1.2
Text Generation
• 8B • Updated
• 2
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-7_dcftv1.2
Text Generation
• 8B • Updated
• 5
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-6_dcftv1.2
Text Generation
• 8B • Updated
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-7_dcftv1.2
Text Generation
• 8B • Updated
• 6
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr5e-7_dcftv1.2
Text Generation
• 8B • Updated
• 4
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_dcftv1.2
Text Generation
• 8B • Updated
• 3
mlfoundations-dev/hp_ablations_qwen_scheduler_linear_warmup0.10_dcftv1.2
Text Generation
• 8B • Updated
• 6
mlfoundations-dev/hp_ablations_qwen_scheduler_linear_warmup0.05_dcftv1.2
Text Generation
• 8B • Updated
• 10
lightblue/qwen2.5-7B-instruct-simpo
Text Generation
• 8B • Updated
ahmedheakl/qwen2.5_0.5b_llamafac_500k
Text Generation
• 0.6B • Updated
ahmedheakl/qwen2.5_0.5b_llamafac_130k-stack
Text Generation
• 0.6B • Updated
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.85_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.95_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.92_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.995_dcftv1.2
Text Generation
• 9B • Updated
• 3
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.9_dcftv1.2
Text Generation
• 9B • Updated
• 8
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.999_dcftv1.2
Text Generation
• 9B • Updated
• 1
mlfoundations-dev/hp_ablations_gemma_bsz256_dcftv1.2
Text Generation
• 9B • Updated
• 1
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.99_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.95_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_lr8e-6_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.98_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_bsz512_dcftv1.2
Text Generation
• 9B • Updated
• 3
mlfoundations-dev/hp_ablations_gemma_scheduler_constant_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.9995_dcftv1.2
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_lr5e-6_dcftv1.2
Text Generation
• 9B • Updated
• 1
mlfoundations-dev/hp_ablations_gemma_lr2e-6_dcftv1.2
Text Generation
• 9B • Updated