EVOL-RL - a yujunzhou Collection

yujunzhou 's Collections

EVOL-RL

updated 27 days ago

The models trained with EVOL-RL

yujunzhou/EVOL-RL-MATH-Train-Qwen3-4B-Base

4B • Updated Sep 13, 2025 • 3
yujunzhou/EVOL-RL-AIME24-Qwen3-4B-Base

4B • Updated Aug 17, 2025 • 2
yujunzhou/EVOL-RL-MATH-Train-Qwen3-8B-Base

8B • Updated Sep 18, 2025
yujunzhou/EVOL-RL-MATH-500-Qwen3-8B-Base

8B • Updated Aug 29, 2025 • 1
yujunzhou/EVOL-RL-AIME24-Qwen3-8B-Base

8B • Updated Aug 26, 2025 • 2
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18, 2025 • 33