Yihe Deng PRO
ydeng9
AI & ML interests
LLM post-training
Recent Activity
published
a dataset 4 days ago
DuoGuard/duoguard-iter1-data published
a dataset 4 days ago
DuoGuard/duoguard-seed-data updated
a dataset about 2 months ago
ydeng9/OpenVLThinker-grpo-hard