Collections
Discover the best community collections!
Collections trending this week
-
lyf07/LLaMAX3-8B-Alpaca-WALAR
Translation • 8B • Updated • 59 -
lyf07/Qwen3-8B-WALAR
Translation • 8B • Updated • 71 -
lyf07/Translategemma-4B-it-WALAR
Translation • 769k • Updated • 64 -
Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation
Paper • 2603.13045 • Published • 1
-
lyf07/LLaMAX3-8B-Alpaca-WALAR
Translation • 8B • Updated • 59 -
lyf07/Qwen3-8B-WALAR
Translation • 8B • Updated • 71 -
lyf07/Translategemma-4B-it-WALAR
Translation • 769k • Updated • 64 -
Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation
Paper • 2603.13045 • Published • 1