11 35 78

min jun kim

mjkmain

mjkmain

AI & ML interests

None yet

Recent Activity

liked a dataset 28 days ago

hysong/MentalBench

new activity about 2 months ago

KORMo-Team/KORMo-10B-base:I propose modifying the KORMo modelling to ensure compatibility with both Transformers 4.57.1 and 5.2.

updated a model about 2 months ago

KORMo-Team/KORMo-10B-sft

View all activity

Organizations

upvoted 2 papers 7 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 182

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 87

upvoted 2 collections 7 months ago

KORMo pretraining datasets

Collection

The pretraining datasets for KORMo-10B were collected from diverse, publicly available source. • 14 items • Updated Oct 13, 2025 • 22

KORMo-10B

Collection

KORMo-10B models • 4 items • Updated Oct 13, 2025 • 19

upvoted a collection 8 months ago

Tri Series

Collection

Introducing our new series of models: Tri-7B, Tri-21B, and Tri-70B-preview-SFT • 12 items • Updated Feb 20 • 11

upvoted an article 9 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

773

upvoted a paper 11 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

upvoted an article about 1 year ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

upvoted 2 papers about 1 year ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21, 2025 • 38

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

upvoted 7 papers over 1 year ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10, 2024 • 48

VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation

Paper • 2412.10151 • Published Dec 13, 2024 • 7

upvoted a paper almost 2 years ago

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18, 2024 • 17

upvoted 2 papers about 2 years ago

X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment

Paper • 2403.11399 • Published Mar 18, 2024 • 6

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 81

min jun kim

AI & ML interests

Recent Activity

Organizations

mjkmain's activity

SmolLM3: smol, multilingual, long-context reasoner

Efficient LLM Pretraining: Packed Sequences and Masked Attention