The models trained with EVOL-RL
Yujun Zhou
yujunzhou
AI & ML interests
None yet
Recent Activity
new activity
about 21 hours ago
yujunzhou/AIME-TTT-OctoThinker-8B-Hybrid-Base-TTRL:Running in MSTY Studio submitted
a paper
3 months ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning Organizations
None yet