KORMo-Team
/

KORMo-10B-sft

Text Generation

Model card Files Files and versions

sswoo123 commited on Oct 12, 2025

Commit

654930c

·

verified ·

1 Parent(s): 99a8255

Update README.md

Files changed (1) hide show

README.md +54 -7

README.md CHANGED Viewed

@@ -94,27 +94,74 @@ The model, training code, and training data are all **fully open**, allowing any
 ## 📦 Installation
 ### 1. Clone the repository
 ```bash
 git clone https://github.com/MLP-Lab/KORMo-tutorial.git
 cd KORMo-tutorial
 ```
----
 ### 2. Create and activate a virtual environment (optional but recommended)
 ```bash
 uv venv
-source .venv/bin/activate   # macOS / Linux
-# OR
-.venv\Scripts\activate      # Windows
 ```
----
-### 3. Install KORMo (editable mode)
 ```bash
 uv pip install -e .
 ```
 ## Contact
 - KyungTae Lim, Professor at KAIST. `ktlim@kaist.ac.kr`

 ## 📦 Installation
 ### 1. Clone the repository
 ```bash
 git clone https://github.com/MLP-Lab/KORMo-tutorial.git
 cd KORMo-tutorial
 ```
 ### 2. Create and activate a virtual environment (optional but recommended)
 ```bash
 uv venv
+source .venv/bin/activate
 ```
+### 3. Install KORMo
 ```bash
 uv pip install -e .
 ```
+---
+## 🚀 Inference Example
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+model_name = "KORMo-Team/KORMo-10B-sft"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.bfloat16,
+    device_map="auto",
+    trust_remote_code=True
+)
+messages = [
+    {"role": "user", "content": "What happens inside a black hole?"}
+]
+chat_prompt = tokenizer.apply_chat_template(
+    messages,
+    tokenize=False,
+    add_generation_prompt=True,
+    enable_thinking=False
+)
+inputs = tokenizer(chat_prompt, return_tensors="pt").to(model.device)
+with torch.no_grad():
+    output_ids = model.generate(
+        **inputs,
+        max_new_tokens=1024,
+    )
+response = tokenizer.decode(output_ids[0][inputs['input_ids'].shape[1]:], skip_special_tokens=True)
+print("Assistant:", response)
+```
+---
+## 🧠 Enabling Thinking Mode
+If you want to enable the **thinking** mode, simply set `enable_thinking=True`:
+```python
+chat_prompt = tokenizer.apply_chat_template(
+    messages,
+    tokenize=False,
+    add_generation_prompt=True,
+    enable_thinking=True
+)
+```
+---
 ## Contact
 - KyungTae Lim, Professor at KAIST. `ktlim@kaist.ac.kr`