Kaiyan Zhang's picture

Kaiyan Zhang

iseesaw

·

https://iseesaw.github.io/

AI & ML interests

Large Reasoning Models, Reinforcement Learning, Agent

Recent Activity

authored a paper about 2 hours ago

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

authored a paper about 2 hours ago

From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models

authored a paper about 2 hours ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

View all activity

Organizations

iseesaw 's datasets

None public yet