arxiv:2506.06395
Li Pengyi
LiPengyi29
AI & ML interests
None yet
Recent Activity
upvoted a paper 1 day ago
How Far Can Unsupervised RLVR Scale LLM Training? upvoted a paper about 1 month ago
Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities submitted
a paper
about 1 month ago
Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities