habilisl/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rugged_stocky_magpie Text Generation • 0.5B • Updated Oct 1, 2025
habilisl/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rugged_stocky_magpie Text Generation • 0.5B • Updated Oct 1, 2025
habilisl/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-soaring_placid_crow Text Generation • 0.5B • Updated Sep 25, 2025
habilisl/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-soaring_placid_crow Text Generation • 0.5B • Updated Sep 25, 2025
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10, 2025 • 662