qi tao's picture

1

qi tao

qxzsybzd

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

View all activity

Organizations

None yet

qxzsybzd 's datasets

None public yet