Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
SLoonker
's Collections
CoT Distillation
Multi-Agent Coding
Small RL Datasets For Training
Small RL Datasets For Training
updated
Mar 5
Upvote
-
Sort: Collection
SLoonker/RL-Claude-Reasoning-SFT
Viewer
•
Updated
Mar 5
•
2.21k
•
21
•
1
SLoonker/RL-Claude-Reasoning-GRPO-Prompts
Viewer
•
Updated
Mar 5
•
2.21k
•
19
SLoonker/RL-Ling-Coding-DPO
Viewer
•
Updated
Mar 5
•
2.84k
•
15
SLoonker/RL-Claude-Creative-Writing-SFT
Viewer
•
Updated
Mar 5
•
818
•
101
•
1
SLoonker/RL-STEM-DPO
Viewer
•
Updated
Mar 5
•
1.51k
•
11
SLoonker/RL-OpenCodeReasoning-DPO
Viewer
•
Updated
Mar 5
•
2.75k
•
28
Upvote
-
Sort: Collection
Share collection
View history
Collection guide
Browse collections