Qi
Wandou72
AI & ML interests
None yet
Recent Activity
published
a model
9 days ago
Wandou72/GDPO_test
upvoted
a
paper
4 months ago
VCRL: Variance-based Curriculum Reinforcement Learning for Large
Language Models
updated
a model
8 months ago
Wandou72/Aicrowd_v4_rag_grpo_v2
Organizations
None yet