arxiv:2512.15687
Zhenwen Liang
invokerliang
AI & ML interests
Mathematical Reasoning.
Recent Activity
authored
a paper
about 1 month ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
upvoted
a
paper
about 1 month ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning