2 24 6

张康宁

zhangkangning

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Latent Reasoning in LLMs as a Vocabulary-Space Superposition

upvoted a paper 11 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

upvoted a paper 11 days ago

Agents' Last Exam

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Latent Reasoning in LLMs as a Vocabulary-Space Superposition

Paper • 2510.15522 • Published Oct 17, 2025 • 5

upvoted 2 papers 11 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 247

Agents' Last Exam

Paper • 2606.05405 • Published 24 days ago • 365

updated a dataset 26 days ago

zhangkangning/mmskills

Viewer • Updated 26 days ago • 515 • 5.55k • 3

upvoted a paper 26 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 29 days ago • 119

upvoted a paper about 1 month ago

MMSkills: Towards Multimodal Skills for General Visual Agents

Paper • 2605.13527 • Published May 14 • 121

liked a dataset about 1 month ago

zhangkangning/mmskills

Viewer • Updated 26 days ago • 515 • 5.55k • 3

upvoted a paper about 1 month ago

ACE-LoRA: Adaptive Orthogonal Decoupling for Continual Image Editing

Paper • 2605.14948 • Published May 14 • 2

submitted a paper to Daily Papers about 1 month ago

MMSkills: Towards Multimodal Skills for General Visual Agents

Paper • 2605.13527 • Published May 14 • 121

published a dataset about 2 months ago

zhangkangning/mmskills

Viewer • Updated 26 days ago • 515 • 5.55k • 3

upvoted 3 papers 2 months ago

upvoted a paper 3 months ago

MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences

Paper • 2603.27813 • Published Mar 29 • 23

upvoted 2 papers 4 months ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

upvoted a paper 5 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

liked a dataset 5 months ago

zwhe99/DeepMath-103K

Viewer • Updated May 29, 2025 • 103k • 6.97k • 367

upvoted 2 papers 6 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 233

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

张康宁

AI & ML interests

Recent Activity

Organizations

zhangkangning's activity