GSAI-ML
/

ESPO-Code

Model card Files Files and versions

ESPO-Code / README.md

JingyangOu's picture

Update README.md

9fb6869 verified 14 days ago

|

history blame contribute delete

250 Bytes

	---
	license: mit
	datasets:
	- TIGER-Lab/AceCode-87K
	base_model:
	- GSAI-ML/LLaDA-8B-Instruct
	---

	Post-Training Full models on code task based on LLaDA-8B-Instruct for the paper Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective