Edit Models filters

Models

23

Full-text search

Active filters: open-r1/DAPO-Math-17k-Processed

ldqvinh/qwen3-0.6b-base-grpo-v2-2048

Updated Jul 17, 2025

CaptainHPY/Qwen2.5-7B-R1

Text Generation • 5B • Updated Sep 17, 2025 • 2

CaptainHPY/Qwen2.5-7B-R1-GGUF

Text Generation • 8B • Updated Sep 18, 2025 • 40

hdong0/Qwen3-1.7B-base-Open-R1-GRPO_dapo_acc_4096_nokl

Text Generation • 2B • Updated Oct 7, 2025 • 3

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_16384_nokl

Text Generation • 8B • Updated Oct 10, 2025 • 41

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_nokl

Text Generation • 8B • Updated Oct 9, 2025 • 13

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_nokl

Text Generation • 8B • Updated Oct 8, 2025 • 7

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_to_16384_nokl

Text Generation • 8B • Updated Oct 12, 2025 • 7

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_to_16384_nokl

Text Generation • 8B • Updated Oct 14, 2025 • 2

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_8192_to_16384_nokl

Text Generation • 8B • Updated Oct 15, 2025 • 16

AmirhoseinGH/Gnosis-Qwen3-1.7B-Hybrid

Text Classification • 2B • Updated Jan 7 • 47

AmirhoseinGH/Gnosis-Qwen3-4B-Instruct-2507

Text Classification • 4B • Updated Jan 7 • 31

AmirhoseinGH/Gnosis-Qwen3-4B-Thinking-2507

Text Classification • 4B • Updated Jan 7 • 16

AmirhoseinGH/Gnosis-Qwen3-8B

Text Classification • 8B • Updated Jan 7 • 5

Jackrong/Llama3.1-8B-Thinking-R1

Text Generation • 8B • Updated Dec 21, 2025 • 32

Jackrong/Llama3.1-8B-Thinking-R1-GGUF

Text Generation • 8B • Updated Dec 21, 2025 • 4

mradermacher/Llama3.1-8B-Thinking-R1-GGUF

8B • Updated Dec 22, 2025 • 396

mradermacher/Llama3.1-8B-Thinking-R1-i1-GGUF

8B • Updated Dec 22, 2025 • 258

Stormtrooperaim/Small-Math

Text Generation • 0.6B • Updated Dec 23, 2025 • 10

Stormtrooperaim/Small-Math-Q8_0-GGUF

0.6B • Updated Dec 23, 2025 • 6

Swekerr/llama1b_dapo_math_grpo-v1

Text Generation • 1B • Updated Dec 25, 2025 • 10

Swekerr/llama1b_dapo_math_grpo-v1.1

Text Generation • 1B • Updated Dec 25, 2025 • 6

bingyy/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-DAPO-Math-17k

Text Generation • Updated Jan 8 • 3