Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
BounharAbdelaziz 's Collections
RL for LLMs
LLM Post-training
AI Agents
Moroccan Darija Datasets
SFT for LLMs
Reward models
SFT Mix
General RL
RL Vision
RL Maths
RL Code
RL Agents
SFT Math
SFT Informal Maths
SFT Vision
SFT Vision Thinking
SFT VLM
Code Agents
Web Agents
Frugal-AI
RLHF/RLVR
Moroccan Darija LLMs
Moroccan Darija Embeddings Models & Datasets
Moroccan Speech Models & Datasets
Translation Models & Datasets
Arabic (MSA) Language Models & Datasets
Arabic (MSA) Summarization Models & Datasets

RL Code

updated 1 day ago
Upvote
-

  • Skywork/Skywork-OR1-RL-Data

    Viewer • Updated May 29, 2025 • 119k • 5.19k • 67

    Note Math split contains 14.1k samples


  • a-m-team/AM-Thinking-v1-RL-Dataset

    Viewer • Updated May 21, 2025 • 54.8k • 299 • 18

    Note 22k samples of code


  • PRIME-RL/Eurus-2-RL-Data

    Viewer • Updated Feb 19, 2025 • 483k • 2.14k • 56

  • PrimeIntellect/Multi-SWE-RL

    Viewer • Updated Nov 21, 2025 • 4.7k • 385 • 1

  • PrimeIntellect/SYNTHETIC-2-RL

    Viewer • Updated Jul 10, 2025 • 156k • 82 • 3

  • likaixin/TACO-verified

    Viewer • Updated Apr 17, 2025 • 12.9k • 341 • 19

  • agentica-org/DeepCoder-Preview-Dataset

    Viewer • Updated Apr 9, 2025 • 25k • 2.4k • 105

    Note Used by NousCoder 14B.

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs