devmeta
/

maple-npc-llama3.2-3B-lora

Text Generation

Model card Files Files and versions

devmeta commited on Mar 14

Commit

d3317b4

·

verified ·

1 Parent(s): d3cc5e1

Update README.md

Files changed (1) hide show

README.md +84 -16

README.md CHANGED Viewed

@@ -1,22 +1,90 @@
 ---
-base_model: unsloth/llama-3.2-3b-instruct-bnb-4bit
-tags:
-- text-generation-inference
-- transformers
-- unsloth
-- llama
-- trl
-license: apache-2.0
-language:
-- en
 ---
-# Uploaded  model
-- **Developed by:** devmeta
-- **License:** apache-2.0
-- **Finetuned from model :** unsloth/llama-3.2-3b-instruct-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

+# 🍁 maple-npc-llama3.2-3B-lora
+**메이플스토리 NPC 대화 특화 LoRA 파인튜닝 모델**
+> Llama 3.2 3B Instruct 기반 | Unsloth LoRA | 한국어 게임 NPC 특화
+---
+## 📌 모델 개요
+메이플스토리 세계관에 특화된 NPC 대화 생성 모델입니다.
+**Big Five 성격 모델**을 프롬프트로 제어하여
+NPC별 고유한 말투와 성격을 구현합니다.
+- 베이스 모델: `unsloth/Llama-3.2-3B-Instruct`
+- 학습 방법: LoRA (r=16, alpha=32)
+- 학습 데이터: 메이플스토리 NPC 대화 134샘플
+- 학습 시간: 33초 (A100, Unsloth 2x 가속)
 ---
+## 🎮 주요 특징
+- 메이플스토리 세계관 (헤네시스, 엘리니아, 에레브 등) 반영
+- Big Five 성격 모델 기반 NPC 성격 제어
+- 클래스별 말투 차별화 (아크메이지, 팬텀, 메르세데스 등)
+- RAG 지역 컨텍스트 주입 지원
 ---
+## 🚀 사용법
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+model = AutoModelForCausalLM.from_pretrained("devmeta/maple-npc-llama3.2-3B-lora")
+tokenizer = AutoTokenizer.from_pretrained("devmeta/maple-npc-llama3.2-3B-lora")
+prompt = """너는 메이플스토리 헤네시스 마을의 신관 NPC야.
+친화성이 높고 따뜻한 성격이야.
+사용자: 안녕하세요!
+NPC:"""
+inputs = tokenizer(prompt, return_tensors="pt")
+outputs = model.generate(**inputs, max_new_tokens=100)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+---
+## 📊 학습 결과
+| 항목 | 값 |
+|------|-----|
+| 베이스 모델 | Llama 3.2 3B Instruct |
+| 학습 샘플 수 | 134개 |
+| Epoch | 2 |
+| 초기 Loss | 3.25 |
+| 최종 Loss | 2.41 |
+| 학습 시간 | 33초 (A100) |
+| 학습 파라미터 | 0.57% (LoRA) |
+---
+## 🔗 관련 프로젝트
+- **MaplePersona**: Big Five 슬라이더 기반 NPC 성격 제어 웹앱
+  - Gemini Flash + RAG + Big Five 실시간 제어
+  - [Live Demo](#) | [스크린샷](#)
+---
+## 👤 개발자
+**Taewan Kim** | Nexon KartRider 레벨디자인 파트장 (20년)
+- 서강대학교 메타버스 전문대학원 박사과정
+- GitHub: [Taewan627](https://github.com/Taewan627)
+```
+---
+## 추가로 할 것
+모델 태그도 추가하세요. Settings → Tags:
+```
+korean
+game-npc
+maplestory
+llama
+lora
+big-five