Kaiyan Zhang
iseesaw
AI & ML interests
Large Reasoning Models, Reinforcement Learning, Agent
Recent Activity
authored a paper about 2 hours ago
Attention as a Compass: Efficient Exploration for Process-Supervised RL
in Reasoning Models authored a paper about 2 hours ago
From Perception to Cognition: A Survey of Vision-Language Interactive
Reasoning in Multimodal Large Language Models authored a paper about 2 hours ago
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe