GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 18 days ago • 208
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published 11 days ago • 37 • 4
view post Post 2571 I am very excited to see the release of nyuuzyou/gitee-code. This is exactly what I have been looking for. Thank you to @nyuuzyou for his hard work on this. See translation 3 replies · 🤗 6 6 + Reply