-
-
-
-
-
-
Inference Providers
Active filters: GRPO
Text Generation
• 0.1B • Updated
• 1
kavanmevada/SmolGRPO-135M
Text Generation
• 0.6B • Updated
• 1
kavanmevada/SmolGRPO-135M-adapter
Updated
Text Generation
• 0.1B • Updated
• 4
harikrushna2272/SmolGRPO-135M
Text Generation
• 0.1B • Updated
Text Generation
• 0.1B • Updated
Text Generation
• 0.1B • Updated
• 2
Text Generation
• 0.1B • Updated
Text Generation
• 0.1B • Updated
Text Generation
• 11B • Updated
• 8
• 4
Delta-Vector/Nanuq-R1-14B
Text Generation
• 14B • Updated
• 11
• 2
Koitenshin/Nanuq-R1-14B-GGUF
14B • Updated
• 115
mradermacher/Nanuq-R1-9B-GGUF
11B • Updated
• 47
• 1
mradermacher/Nanuq-R1-14B-GGUF
14B • Updated
• 86
mradermacher/Nanuq-R1-14B-i1-GGUF
14B • Updated
• 195
mradermacher/Nanuq-R1-9B-i1-GGUF
11B • Updated
• 82
amritansh/merged_model_audio_gemma
Image-Text-to-Text
• 5B • Updated
• 1
Text Generation
• 0.1B • Updated
• 1
KRadim/custom-SmolGRPO-135M
0.1B • Updated
Text Generation
• 0.1B • Updated
• 1
Text Generation
• 2B • Updated
• 4
mradermacher/GCPO-R1-1.5B-GGUF
2B • Updated
• 31
mradermacher/GCPO-R1-1.5B-i1-GGUF
2B • Updated
• 21
ahmedelhefnawy/SmolGRPO-135M
Updated
Text Generation
• 0.1B • Updated
• 1
MarouaneSanhaji/SmolGRPO-135M
Text Generation
• 0.1B • Updated
• 1
Text Generation
• 0.1B • Updated
• 1
Text Classification
• 0.1B • Updated
• 4
zeniftw/SmolLM2_135M_Grpo_Gsm8k
Text Generation
• 0.1B • Updated
• 2
Text Generation
• 0.1B • Updated
• 1