ByteDance-Seed/Seed-X-RM-7B
Translation • Updated
• 33 • 30
None defined yet.
Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining
Learn Hard Problems During RL with Reference Guided Fine-tuning