Inference Providers
Active filters: open-r1
smolagents/SmolVLM2-2.2B-Instruct-Agentic-GUI
Image-Text-to-Text
• 2B • Updated • 141
• 61
HectorHe/DeepSeek-V2-Lite-aux-free-sft-math7k-1epoch-1e-4-gamma
Text Generation
• 16B • Updated • 38
• 1
Neelectric/Llama-3.2-1B-Instruct_SFT_Math-220kv00.04
Text Generation
• 1B • Updated • 6
• 1
Neelectric/Llama-3.2-1B-Instruct_SFT_sciencev00.01
Text Generation
• 1B • Updated • 90
• 1
Neelectric/Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.06
Text Generation
• 1B • Updated • 89
• 1
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 7
yucaiwen/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO
Text Generation
• 8B • Updated • 8
• 1
JinnP/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
bangan/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated liusq19/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
stepyoun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
howey/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 3
wxnfifth/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
• 8B • Updated • 5
Text Generation
• 8B • Updated • 1
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
• 2B • Updated • 2
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
• 2B • Updated • 6
skzxjus/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-GGUF
8B • Updated • 119
skzxjus/Qwen2.5-7B-1m-Open-R1-Distill
Text Generation
• 8B • Updated • 4
• 4
skzxjus/Qwen2.5-7B-Open-R1-GRPO
Text Generation
• 8B • Updated ununtrium/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-i1-GGUF
8B • Updated • 358
yeshsurya/Qwen2.5-7B-Math-with_50stepGRPO
Text Generation
• 8B • Updated • 2
mradermacher/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math-GGUF
2B • Updated • 74
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-GGUF
8B • Updated • 58
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
• 8B • Updated Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
• 8B • Updated • 1
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
• 2B • Updated • 2