Running 183 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 183 Building and scaling RL environments for LLM training
david-thrower/smol-smoltalk-plus-reasoning-synthetic-data Viewer • Updated Jan 31, 2025 • 3k • 157 • 5
NousResearch/DeepHermes-3-Llama-3-3B-Preview Text Generation • 3B • Updated Mar 13, 2025 • 593 • • 39
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • 8B • Updated Apr 10, 2025 • 1.14k • • 362