Sungmin Jo's picture

Sungmin Jo

jsm0424

·

jsm0424

AI & ML interests

RLVR, LLM Reasoning

Recent Activity

upvoted a paper 25 days ago

Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging

upvoted a paper about 1 month ago

Grounding World Simulation Models in a Real-World Metropolis

upvoted a collection 3 months ago

View all activity

Organizations

None yet

upvoted a paper 25 days ago

Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging

Paper • 2606.01717 • Published 28 days ago • 21

upvoted a paper about 1 month ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 155

upvoted 2 collections 3 months ago

Agent

130 items • Updated 3 days ago • 13

Data

19 items • Updated 5 days ago • 1

upvoted a paper 4 months ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 158

upvoted 8 papers 5 months ago

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published Feb 10 • 36

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published Jan 21 • 75

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 233

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 329

K-EXAONE Technical Report

Paper • 2601.01739 • Published Jan 5 • 95

Solar Open Technical Report

Paper • 2601.07022 • Published Jan 11 • 67