9 10

kk

zhangyikai

yumyikai@gmail.com

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts

updated a Space about 2 months ago

Now-Join-Us/Generalist-Value-Model-V0

upvoted a paper about 2 months ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

View all activity

Organizations

upvoted a paper 21 days ago

V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts

Paper • 2603.10848 • Published 22 days ago • 13

updated a Space about 2 months ago

Generalist Value Model V0

😻

Predict model performance on new instructions instantly

upvoted 3 papers about 2 months ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published Feb 12 • 62

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Paper • 2602.06820 • Published Feb 6 • 13

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

liked a Space about 2 months ago

Generalist Value Model V0

😻

Predict model performance on new instructions instantly

upvoted a paper about 2 months ago

V_0: A Generalist Value Model for Any Policy at State Zero

Paper • 2602.03584 • Published Feb 3 • 22

updated a model about 2 months ago

Now-Join-Us/Generalist-Value-Model-V0

Updated Feb 4

published a Space about 2 months ago

Generalist Value Model V0

😻

Predict model performance on new instructions instantly

published a model about 2 months ago

Now-Join-Us/Generalist-Value-Model-V0

Updated Feb 4

liked a model 9 months ago

AIDC-AI/Ovis-U1-3B

Any-to-Any • Updated Jul 3, 2025 • 1.54k • 215

upvoted 2 papers 10 months ago

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Paper • 2505.22334 • Published May 28, 2025 • 36

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28, 2025 • 46

upvoted a paper 11 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 82

updated 6 models about 1 year ago

kk

AI & ML interests

Recent Activity

Organizations

zhangyikai's activity

Generalist Value Model V0

Generalist Value Model V0

Generalist Value Model V0