UEC-InabaLab
/

Llama-3.1-KokoroChat-High

Text Generation

dialogue-system

Model card Files Files and versions

ZhiyangQi97 commited on Jun 1, 2025

Commit

53dfd76

·

verified ·

1 Parent(s): 4518ac1

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -40,19 +40,23 @@ The base model is [tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3](https://hug
 ---
-## ⚙️ Usage Instructions
-This repository contains **only the LoRA adapter**. You must load the original base model and then apply this adapter:
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig
 from peft import PeftModel
 base_model_id = "tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3"
-adapter_id = "your-username/kokorochat-high-lora"
 tokenizer = AutoTokenizer.from_pretrained(base_model_id)
 base_model = AutoModelForCausalLM.from_pretrained(
     base_model_id,
     device_map="auto",
@@ -60,6 +64,7 @@ base_model = AutoModelForCausalLM.from_pretrained(
     quantization_config=BitsAndBytesConfig(load_in_4bit=True)
 )
 model = PeftModel.from_pretrained(base_model, adapter_id)
 model = model.merge_and_unload()
 ```

 ---
+## ⚙️ Usage Instructions (LoRA Adapter)
+This repository only contains the **adapter weights**.
+You must load the original base model and then apply this adapter.
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig
 from peft import PeftModel
+# === Base + Adapter Paths ===
 base_model_id = "tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3"
+adapter_id = "your-username/kokorochat-lora"
+# === Load Tokenizer ===
 tokenizer = AutoTokenizer.from_pretrained(base_model_id)
+# === Load Base Model ===
 base_model = AutoModelForCausalLM.from_pretrained(
     base_model_id,
     device_map="auto",
     quantization_config=BitsAndBytesConfig(load_in_4bit=True)
 )
+# === Load & Merge LoRA ===
 model = PeftModel.from_pretrained(base_model, adapter_id)
 model = model.merge_and_unload()
 ```