Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yujunzhou 's Collections
EVOL-RL

EVOL-RL

updated Oct 3, 2025

The models trained with EVOL-RL

Upvote
1

  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-4B-Base

    4B • Updated Sep 13, 2025 • 1

  • yujunzhou/EVOL-RL-MATH-500-Qwen3-4B-Base

    4B • Updated Sep 13, 2025 • 5

  • yujunzhou/EVOL-RL-AIME24-Qwen3-4B-Base

    4B • Updated Aug 17, 2025

  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-8B-Base

    8B • Updated Sep 18, 2025 • 2

  • yujunzhou/EVOL-RL-MATH-500-Qwen3-8B-Base

    8B • Updated Aug 29, 2025

  • yujunzhou/EVOL-RL-AIME24-Qwen3-8B-Base

    8B • Updated Aug 26, 2025 • 1

  • Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

    Paper • 2509.15194 • Published Sep 18, 2025 • 33
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs