openai/gsm8k
Benchmark • Updated • 17.6k • 900k • 1.41k
Post-Training Lora models on gsm8k task based on LLaDA-8B-Instruct for the paper Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective
Base model
GSAI-ML/LLaDA-8B-Instruct