KORMo: Korean Open Reasoning Model for Everyone

---
title: README
emoji: 👀
colorFrom: green
colorTo: blue
sdk: static
pinned: false
---

<h1 align="center">KORMo: Korean Open Reasoning Model for Everyone</h1>
<p align="center">
  An open-source hub for Korean language data and model research  
</p>

---


## 🧠 Open Models

- **KORMo-Team/KORMo-tokenizer** — A tokenizer optimized for bilingual (Korean–English) language representation  
- **KORMo-Team/KORMo-10B-base** — The <b>KORMo-10B</b> pretrained model trained on large-scale Korean and English corpora  
- **KORMo-Team/KORMo-10B-sft** — A fine-tuned model enhanced with long-context reasoning and instruction-following data
- **KORMo-Team/KORMo-10B-inst** — Final instruction-tuned model with reasoning enhancement and RL (Coming soon; currently awaiting GPU availability)

> 💡 You can explore the full training history and checkpoints in each model’s **`Revisions` tab** on Hugging Face.


---

## 🌐 Links
- **Technical Report** — https://arxiv.org/pdf/2510.09426
- **Technical Report(Slide-Korean)** — https://github.com/MLP-Lab/KORMo-tutorial/blob/main/20251009_MLP_KORMo(Korean).pdf
- **Tutorial on Github** — https://github.com/MLP-Lab/KORMo-tutorial
- **Tutorial on youtube** — https://www.youtube.com/@MLPLab

---

### 📖 About KORMo

KORMo is an open research initiative dedicated to advancing Korean language understanding and generation through large-scale, fully open-source models and datasets.  
We aim to make Korean NLP research transparent, reproducible, and accessible to the global community.