Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
lewtun
's Collections
β Awesome RL datasets π β
β Long-context post-training π§Ά β
H4
Awesome RLHF
Mistral 7B + UltraChat + Arithmo checkpoints
Hub tools
Gemma RLAIF
β Awesome RL datasets π β
updated
Sep 23, 2025
Upvote
1
ScaleAI/SWE-bench_Pro
Benchmark
β’
Updated
Feb 23
β’
731
β’
40.2k
β’
110
agentica-org/DeepScaleR-Preview-Dataset
Viewer
β’
Updated
Feb 10, 2025
β’
40.3k
β’
14.2k
β’
199
open-r1/DAPO-Math-17k-Processed
Viewer
β’
Updated
Nov 10, 2025
β’
34.8k
β’
7.79k
β’
69
Upvote
1
Share collection
View history
Collection guide
Browse collections