| license: mit | |
| datasets: | |
| - TIGER-Lab/AceCode-87K | |
| base_model: | |
| - GSAI-ML/LLaDA-8B-Instruct | |
| Post-Training Full models on code task based on LLaDA-8B-Instruct for the paper Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective |
| license: mit | |
| datasets: | |
| - TIGER-Lab/AceCode-87K | |
| base_model: | |
| - GSAI-ML/LLaDA-8B-Instruct | |
| Post-Training Full models on code task based on LLaDA-8B-Instruct for the paper Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective |