Yun Qu's picture

Yun Qu

yunqu

·

https://scholar.google.com/citations?user=l9Ky9goAAAAJ&hl=zh-CN&oi=ao

AI & ML interests

None yet

Recent Activity

authored a paper about 7 hours ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

upvoted a paper about 12 hours ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

submitted a paper about 12 hours ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

View all activity

Organizations

None yet

Papers 9

arxiv:2605.06139

arxiv:2602.01970

arxiv:2510.16882

arxiv:2507.04632

models 0

None public yet

datasets 0

None public yet