arxiv:2602.05494
SHILONG DENG
zczlsde
AI & ML interests
RL, NLP
Recent Activity
liked a dataset about 1 month ago
fviffe/PHMsheffield upvoted a paper about 1 month ago
From Natural Language to Extensive-Form Game Representations authored a paper 5 months ago
A Unified Framework for Rethinking Policy Divergence Measures in GRPO