Edit Models filters
Apps
Inference Providers
Active filters:
QuestionAnswering
JamieAi33/Phi-2-QLora
JamieAi33/Phi-2_PEFT
KakashiH/BashExplainer_Gemma
2KKLabs/Kaleidoscope_small_v1
2KKLabs/Kaleidoscope_large_v1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins
Reinforcement Learning
•
8B
•
Updated
•
91
•
2
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base
Reinforcement Learning
•
8B
•
Updated
•
69
•
2
SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
32
SEGAgentRL/LLDS-R-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
28
•
1
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base
Reinforcement Learning
•
3B
•
Updated
•
23
•
1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base-MA
Reinforcement Learning
•
3B
•
Updated
•
35
•
1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base
Reinforcement Learning
•
3B
•
Updated
•
22
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
30
•
1
mradermacher/LLDS-A-GSPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
438
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-GGUF
8B
•
Updated
•
670
•
1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
•
31
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF
8B
•
Updated
•
7k
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-GGUF
8B
•
Updated
•
679
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-GGUF
3B
•
Updated
•
782
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-i1-GGUF
8B
•
Updated
•
4.86k
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
828
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
849
•
1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-GGUF
3B
•
Updated
•
760
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-MA-GGUF
3B
•
Updated
•
834
•
1
mradermacher/LLDS-R-GSPO-Qwen2.5-3B-Ins-GGUF
3B
•
Updated
•
628
•
1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B
•
Updated
•
2.66k
•
1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B
•
Updated
•
2.58k
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated
•
2.5k
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated
•
2.72k
•
1