UEC-InabaLab
/

Llama-3.1-KokoroChat-High

@@ -16,7 +16,7 @@ datasets:
 # 🧠 KokoroChat-High (LoRA Adapter for Japanese Counseling Dialogue)
-This repository contains the **LoRA adapter weights** for KokoroChat-High, a version of the KokoroChat model fine-tuned on **high-feedback counseling dialogues** (client score ≥ 70 and ≤ 98) from the [KokoroChat dataset](https://huggingface.co/datasets/your-username/kokorochat).
 The base model is [tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3), and this adapter enhances it for generating **high-quality, empathetic Japanese counseling responses**.
@@ -51,7 +51,7 @@ from peft import PeftModel
 # === Base + Adapter Paths ===
 base_model_id = "tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3"
-adapter_id = "your-username/kokorochat-lora"
 # === Load Tokenizer ===
 tokenizer = AutoTokenizer.from_pretrained(base_model_id)
@@ -94,3 +94,18 @@ print(tokenizer.decode(output[0][input_ids.shape[-1]:], skip_special_tokens=True
 - 📁 **Dataset**: [KokoroChat Dataset on Hugging Face](https://huggingface.co/datasets/UEC-InabaLab/KokoroChat)
 - 🧠 **Base Model**: [tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3)
 - 📄 **Paper**: [KokoroChat: A Japanese Psychological Counseling Dialogue Dataset (ACL 2025)](https://drive.google.com/file/d/1T6XgvZii8rZ1kKLgOUGqm3BMvqQAvxEM/view?usp=sharing)

 # 🧠 KokoroChat-High (LoRA Adapter for Japanese Counseling Dialogue)
+This repository contains the **LoRA adapter weights** for KokoroChat-High, a version of the KokoroChat model fine-tuned on **high-feedback counseling dialogues** (client score ≥ 70 and ≤ 98) from the [KokoroChat dataset](https://huggingface.co/datasets/UEC-InabaLab/KokoroChat).
 The base model is [tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3), and this adapter enhances it for generating **high-quality, empathetic Japanese counseling responses**.
 # === Base + Adapter Paths ===
 base_model_id = "tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3"
+adapter_id = "UEC-InabaLab/KokoroChat-High"
 # === Load Tokenizer ===
 tokenizer = AutoTokenizer.from_pretrained(base_model_id)
 - 📁 **Dataset**: [KokoroChat Dataset on Hugging Face](https://huggingface.co/datasets/UEC-InabaLab/KokoroChat)
 - 🧠 **Base Model**: [tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3)
 - 📄 **Paper**: [KokoroChat: A Japanese Psychological Counseling Dialogue Dataset (ACL 2025)](https://drive.google.com/file/d/1T6XgvZii8rZ1kKLgOUGqm3BMvqQAvxEM/view?usp=sharing)
+## 📄 Citation
+If you use this dataset, please cite the following paper:
+```bibtex
+@inproceedings{qi2025kokorochat,
+  title     = {KokoroChat: A Japanese Psychological Counseling Dialogue Dataset Collected via Role-Playing by Trained Counselors},
+  author    = {Zhiyang Qi and Takumasa Kaneko and Keiko Takamizo and Mariko Ukiyo and Michimasa Inaba},
+  booktitle = {Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics},
+  year      = {2025},
+  url       = {https://github.com/UEC-InabaLab/KokoroChat}
+}
+```