_'s picture

9

_

sijyy

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Kwai Keye-VL-2.0 Technical Report

upvoted a paper 5 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a paper 5 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

View all activity

Organizations

None yet

upvoted a paper 16 days ago

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 18 days ago • 189

upvoted 2 papers 5 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 92

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150

upvoted a paper 8 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 109

upvoted 4 papers 11 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14, 2025 • 146

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11, 2025 • 43

ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability

Paper • 2508.07050 • Published Aug 9, 2025 • 117

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161

upvoted a paper about 1 year ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 119