ESPO-Code / README.md
JingyangOu's picture
Update README.md
9fb6869 verified
---
license: mit
datasets:
- TIGER-Lab/AceCode-87K
base_model:
- GSAI-ML/LLaDA-8B-Instruct
---
Post-Training Full models on code task based on LLaDA-8B-Instruct for the paper Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective