Inference Providers
Active filters: rl
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 5
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 7
• 1
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 2
McClain/naive-dna-llama-6mer
Text Generation
• 0.2B • Updated abaryan/CyberXP_Agent_Llama_3.2_1B
Text Generation
• 1B • Updated • 152
• mradermacher/CyberXP_Agent_Llama_3.2_1B-GGUF
1B • Updated • 190
• 1
PokeeAI/pokee_research_7b
Text Generation
• 8B • Updated • 12
• • 100
ArtusDev/PokeeAI_pokee_research_7b-EXL3
Updated • 17
Anonymouslolol/qwen3-8B-hanabi-step110
Reinforcement Learning
• Updated • 19
Mungert/pokee_research_7b-GGUF
Text Generation
• 8B • Updated • 3.09k
• 1
HarleyCooper/Qwen3-0.6B-Dakota-Grammar-RL
Text Generation
• 0.8B • Updated • 4
mradermacher/Qwen3-0.6B-Dakota-Grammar-RL-GGUF
Reinforcement Learning
• 0.8B • Updated • 172
HarleyCooper/Qwen3-0.6B-Dakota-Grammar-RL-400
Text Generation
• Updated Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 1