kennyKK

kennykkk25

3

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

upvoted an article 5 months ago

Forge: Scalable Agent RL Framework and Algorithm

upvoted a paper over 1 year ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

View all activity

Organizations

upvoted a paper 22 days ago

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Paper • 2606.10968 • Published 23 days ago • 42

upvoted an article 5 months ago

Article

Forge: Scalable Agent RL Framework and Algorithm

MiniMax-AI

•

Feb 13

• 156

upvoted a paper over 1 year ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10, 2025 • 153