Hyungyu seo's picture

Hyungyu seo

hgseo

·

omnipede@naver.com

AI & ML interests

None yet

Recent Activity

liked a model 14 days ago

CohereLabs/North-Mini-Code-1.0

liked a model 20 days ago

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

liked a dataset 20 days ago

nvidia/Nemotron-Pretraining-Code-v3

View all activity

Organizations

upvoted a collection 3 months ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 17 days ago • 50

upvoted 2 papers 5 months ago

SWE-World: Building Software Engineering Agents in Docker-Free Environments

Paper • 2602.03419 • Published Feb 3 • 41

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Paper • 2602.03411 • Published Feb 3 • 39

upvoted a collection 6 months ago

Falcon-H1R

5 items • Updated Mar 2 • 27

upvoted a paper 9 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

upvoted a collection 9 months ago

[Dataset] FineWeb2 Edu Korean

4 items • Updated Mar 2 • 2

upvoted a collection 10 months ago

[Dataset] K-Corpus

11 items • Updated Jul 24, 2025 • 1

upvoted a paper 10 months ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20, 2025 • 86

upvoted a paper 11 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

upvoted a paper about 1 year ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21, 2025 • 38