Wenhan Ma

CuteNPC

1 6 14

https://github.com/CuteNPC

CuteNPC

AI & ML interests

Large Language Model

Recent Activity

authored a paper about 13 hours ago

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

upvoted a paper 5 months ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

upvoted a paper 6 months ago

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

View all activity

Organizations

None yet

authored a paper about 13 hours ago

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

Paper • 2606.30406 • Published 3 days ago • 4

upvoted a paper 5 months ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published Feb 3 • 49

upvoted a paper 6 months ago

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

Paper • 2512.17495 • Published Dec 19, 2025 • 20

upvoted a paper 7 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 107

liked a model 8 months ago

Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-bs17k-batch32

Text Generation • 16B • Updated Feb 22, 2025 • 6 • 1

authored a paper 8 months ago

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers

Paper • 2510.11370 • Published Oct 13, 2025 • 4

authored a paper about 1 year ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 81

upvoted a paper about 1 year ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

authored a paper about 1 year ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 86