kk

zhangyikai

·

yumyikai@gmail.com

AI & ML interests

None yet

Organizations

upvoted a paper 4 months ago

V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts

Paper • 2603.10848 • Published Mar 11 • 16

upvoted 4 papers 5 months ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published Feb 12 • 63

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Paper • 2602.06820 • Published Feb 6 • 15

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

V_0: A Generalist Value Model for Any Policy at State Zero

Paper • 2602.03584 • Published Feb 3 • 22

upvoted 3 papers about 1 year ago

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Paper • 2505.22334 • Published May 28, 2025 • 36

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28, 2025 • 46

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 82

upvoted a paper about 2 years ago

Ovis: Structural Embedding Alignment for Multimodal Large Language Model

Paper • 2405.20797 • Published May 31, 2024 • 34