Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10 • 660
garos/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-squinting_strong_duck Text Generation • 0.5B • Updated Apr 30 • 9
garos/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-noisy_gentle_alpaca Text Generation • 0.5B • Updated Apr 30 • 6
garos/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-leaping_territorial_skunk Text Generation • 0.5B • Updated Apr 30 • 7
garos/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-beaked_deadly_crab Text Generation • 0.5B • Updated Apr 30 • 18
garos/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-squinting_strong_duck Text Generation • 0.5B • Updated Apr 30 • 9
garos/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-noisy_gentle_alpaca Text Generation • 0.5B • Updated Apr 30 • 6