Inference Providers
Active filters: full
mradermacher/llama3-1_8b_mlfoundations-dev-stackexchange_sports-GGUF
8B • Updated • 12
• 1
mradermacher/stackexchange_devops-GGUF
8B • Updated • 57
• 2
mlfoundations-dev/deepspeed_no_offload
Text Generation
• 8B • Updated • 5
mlfoundations-dev/training_baseline
Text Generation
• 8B • Updated • 4
mlfoundations-dev/deepspeed_no_offload_liger
Text Generation
• 8B • Updated • 3
mlfoundations-dev/deepspeed_no_offload_liger_torchcompile
Text Generation
• 8B • Updated • 3
secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_VD-QWQ-Noisy-Small-8k_Qwen2.5-7B-Instruct_full_sft_1e-5
Text Generation
• 8B • Updated • 2
secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_VD-QWQ-Noisy-Small-16k_Qwen2.5-7B-Instruct_full_sft_1e-5
Text Generation
• 8B • Updated • 3
mlfoundations-dev/2k_chunk_general-thought-feb-25
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 3
secmlr/VD-DS-Clean-8k_VD-DS-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5
Text Generation
• 8B • Updated • 6
secmlr/VD-QWQ-Clean-8k_VD-QWQ-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5
Text Generation
• 8B • Updated • 4
mlfoundations-dev/global_batchsize_512_lradjusted8
Text Generation
• 8B • Updated • 3
mradermacher/diffullama-GGUF
7B • Updated • 129
mlfoundations-dev/global_batchsize_512_lradjusted32_warmup05
Text Generation
• 8B • Updated • 4
secmlr/rz_simplier_reasoning_VD-DS-Clean-8k_VD-DS-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5_sft
Text Generation
• 8B • Updated • 10
mradermacher/diffullama-i1-GGUF
7B • Updated • 230
secmlr/SWE-BENCH-500-train-set-claude-reasoning-localization_qwen_code_7B_test_swe_localization
Text Generation
• 8B • Updated • 5
mlfoundations-dev/global_batchsize_512_lradjusted64
Text Generation
• 8B • Updated • 3
mradermacher/oh_v1.3_unnatural_instructions_x8-GGUF
8B • Updated • 11
• 1
mlfoundations-dev/global_batchsize_512_lradjusted32
Text Generation
• 8B • Updated • 4
mlfoundations-dev/global_batchsize_512_lradjusted32_constant
Text Generation
• 8B • Updated • 3
mlfoundations-dev/global_batchsize_512_lradjusted16
Text Generation
• 8B • Updated • 6
secmlr/ruizhe_simplier_VD-QWQ-Clean-8k_VD-QWQ-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5_QwQ8k16k
Text Generation
• 8B • Updated • 3
mradermacher/llama3-1_8b_mlfoundations-dev-stackexchange_sports-i1-GGUF
8B • Updated • 37
• 2
cutelemonlili/Qwen2.5-7B-Instruct_Lean_Code_no_nl
Text Generation
• 8B • Updated • 2
cutelemonlili/Qwen2.5-3B-Instruct_Lean_Code_no_nl
Text Generation
• 3B • Updated • 4
cutelemonlili/Qwen2.5-1.5B-Instruct_Lean_Code_no_nl
Text Generation
• 2B • Updated • 2
cutelemonlili/Qwen2.5-0.5B-Instruct_Lean_Code_no_nl
Text Generation
• 0.5B • Updated • 3