1 10 2

Mingsong_Li

Mingsong07

https://lms-07.github.io/

lms-07

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning

authored a paper about 18 hours ago

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

upvoted a paper 1 day ago

HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models

View all activity

Organizations

None yet

upvoted a paper about 4 hours ago

MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning

Paper • 2603.16929 • Published 18 days ago • 13

authored a paper about 18 hours ago

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

Paper • 2603.27481 • Published 3 days ago • 30

upvoted 2 papers 1 day ago

HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models

Paper • 2601.15968 • Published Jan 22 • 9

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

Paper • 2603.27481 • Published 3 days ago • 30

liked a Space 5 months ago

The Smol Training Playbook

📚

3.07k

The secrets to building world-class LLMs

upvoted a paper 5 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 33

updated a collection 5 months ago

LLM-Reasoning-Data

Collection

2 items • Updated Oct 19, 2025

upvoted a paper 6 months ago

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Paper • 2510.02263 • Published Oct 2, 2025 • 9

updated a Space 6 months ago

VibeCoder Flowstate

🐳

Enhance coding flow with ambient visuals and music

published a Space 6 months ago

VibeCoder Flowstate

🐳

Enhance coding flow with ambient visuals and music

liked a dataset 6 months ago

inclusionAI/MultiEdit

Viewer • Updated Sep 19, 2025 • 108k • 59 • 14

authored a paper 6 months ago

MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks

Paper • 2509.14638 • Published Sep 18, 2025 • 14

upvoted a paper 6 months ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18, 2025 • 33

New activity in inclusionAI/MultiEdit 6 months ago

Sourcing

#1 opened 8 months ago by deleted

upvoted a paper 6 months ago

MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks

Paper • 2509.14638 • Published Sep 18, 2025 • 14

upvoted a paper 7 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 664

upvoted 2 papers 8 months ago

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

Paper • 2508.05257 • Published Aug 7, 2025 • 13

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11, 2025 • 29

Mingsong_Li

AI & ML interests

Recent Activity

Organizations

Mingsong07's activity

The Smol Training Playbook

VibeCoder Flowstate

VibeCoder Flowstate

Sourcing