Models

6,308

Full-text search

Active filters: full

mradermacher/llama3-1_8b_mlfoundations-dev-stackexchange_sports-GGUF

8B • Updated Jul 31, 2025 • 12 • 1

mradermacher/stackexchange_devops-GGUF

8B • Updated Jul 31, 2025 • 57 • 2

mlfoundations-dev/deepspeed_no_offload

Text Generation • 8B • Updated Mar 11, 2025 • 5

mlfoundations-dev/training_baseline

Text Generation • 8B • Updated Mar 11, 2025 • 4

mlfoundations-dev/deepspeed_no_offload_liger

Text Generation • 8B • Updated Mar 11, 2025 • 3

mlfoundations-dev/deepspeed_no_offload_liger_torchcompile

Text Generation • 8B • Updated Mar 11, 2025 • 3

secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_VD-QWQ-Noisy-Small-8k_Qwen2.5-7B-Instruct_full_sft_1e-5

Text Generation • 8B • Updated Mar 12, 2025 • 2

secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_VD-QWQ-Noisy-Small-16k_Qwen2.5-7B-Instruct_full_sft_1e-5

Text Generation • 8B • Updated Mar 13, 2025 • 3

mlfoundations-dev/2k_chunk_general-thought-feb-25

Text Generation • 8B • Updated Mar 13, 2025 • 3

Leon97ZJU/llama_answer

Text Generation • 8B • Updated Mar 12, 2025 • 3

Leon97ZJU/qwen_answer

Text Generation • 8B • Updated Mar 12, 2025 • 3

secmlr/VD-DS-Clean-8k_VD-DS-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5

Text Generation • 8B • Updated Mar 12, 2025 • 6

secmlr/VD-QWQ-Clean-8k_VD-QWQ-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5

Text Generation • 8B • Updated Mar 12, 2025 • 4

mlfoundations-dev/global_batchsize_512_lradjusted8

Text Generation • 8B • Updated Mar 14, 2025 • 3

mradermacher/diffullama-GGUF

7B • Updated Jul 11, 2025 • 129

mlfoundations-dev/global_batchsize_512_lradjusted32_warmup05

Text Generation • 8B • Updated Mar 14, 2025 • 4

secmlr/rz_simplier_reasoning_VD-DS-Clean-8k_VD-DS-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5_sft

Text Generation • 8B • Updated Mar 13, 2025 • 10

mradermacher/diffullama-i1-GGUF

7B • Updated Jul 11, 2025 • 230

secmlr/SWE-BENCH-500-train-set-claude-reasoning-localization_qwen_code_7B_test_swe_localization

Text Generation • 8B • Updated Mar 13, 2025 • 5

mlfoundations-dev/global_batchsize_512_lradjusted64

Text Generation • 8B • Updated Mar 14, 2025 • 3

mradermacher/oh_v1.3_unnatural_instructions_x8-GGUF

8B • Updated Jul 11, 2025 • 11 • 1

mlfoundations-dev/global_batchsize_512_lradjusted32

Text Generation • 8B • Updated Mar 13, 2025 • 4

mlfoundations-dev/global_batchsize_512_lradjusted32_constant

Text Generation • 8B • Updated Mar 14, 2025 • 3

mlfoundations-dev/global_batchsize_512_lradjusted16

Text Generation • 8B • Updated Mar 14, 2025 • 6

secmlr/ruizhe_simplier_VD-QWQ-Clean-8k_VD-QWQ-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5_QwQ8k16k

Text Generation • 8B • Updated Mar 13, 2025 • 3

mradermacher/llama3-1_8b_mlfoundations-dev-stackexchange_sports-i1-GGUF

8B • Updated Jul 11, 2025 • 37 • 2

cutelemonlili/Qwen2.5-7B-Instruct_Lean_Code_no_nl

Text Generation • 8B • Updated Mar 13, 2025 • 2

cutelemonlili/Qwen2.5-3B-Instruct_Lean_Code_no_nl

Text Generation • 3B • Updated Mar 13, 2025 • 4

cutelemonlili/Qwen2.5-1.5B-Instruct_Lean_Code_no_nl

Text Generation • 2B • Updated Mar 13, 2025 • 2

cutelemonlili/Qwen2.5-0.5B-Instruct_Lean_Code_no_nl

Text Generation • 0.5B • Updated Mar 13, 2025 • 3