aripos1
/

gorani-3B

@@ -4,65 +4,40 @@ datasets:
 - aripos1/gorani_dataset
 language:
 - ko
 base_model:
 - unsloth/Llama-3.2-3B-Instruct-bnb-4bit
 pipeline_tag: text-generation
 library_name: transformers
 ---
-# Gorani LoRA Merged (Llama 3.2-3B 기반)
-## 📌 Model Overview
-- **Model Name**: Gorani LoRA Merged
-- **Base Model**: `unsloth/Llama-3.2-3B-Instruct-bnb-4bit`
-- **Adapter Type**: LoRA
-- **Task**: Text Generation (번역, 챗봇)
-- **License**: Apache 2.0
-- **Author**: aripos1
-- **Fine-Tuned Dataset**: [Custom Korean-English Dataset]
-## 🛠 How to Use
-```python
-from transformers import AutoModelForCausalLM, AutoTokenizer
-model_name = "aripos1/gorani-lora-merged"
-model = AutoModelForCausalLM.from_pretrained(model_name)
-tokenizer = AutoTokenizer.from_pretrained(model_name)
-text = "Translate this sentence from Korean to English: 안녕하세요."
-inputs = tokenizer(text, return_tensors="pt")
-outputs = model.generate(**inputs)
-print(tokenizer.decode(outputs[0], skip_special_tokens=True))
-📊 Model Performance
-Version	Comet Score	BERT Score
-v1	0.78	0.85
-v2	0.82	0.88
-v3	0.85	0.90
-📂 Training Details
-LoRA 적용 여부: ✅
-Dataset: Custom dataset (Korean-English parallel corpus)
-Frameworks: Hugging Face transformers, peft
-Training Hardware: NVIDIA A100 40GB
-Training Time: 24 hours
-📌 License
-This model is licensed under the Apache 2.0 License.
----
-## ✅ **2️⃣ Model Card 작성 후 업로드 방법**
-Model Card는 `README.md` 파일을 Hugging Face 저장소에 업로드하면 자동으로 반영됨.
-### **🔹 방법 1: Python 코드로 업로드**
-```python
-from huggingface_hub import HfApi
-api = HfApi()
-repo_id = "aripos1/gorani-lora-merged"
-# README.md 파일 업로드
-api.upload_file(
-    path_or_fileobj="README.md",
-    path_in_repo="README.md",
-    repo_id=repo_id
-)
-print("✅ Model Card (README.md) 업로드 완료!")

 - aripos1/gorani_dataset
 language:
 - ko
+- en
+- ja
 base_model:
 - unsloth/Llama-3.2-3B-Instruct-bnb-4bit
 pipeline_tag: text-generation
 library_name: transformers
 ---
+# Gorani Model Card
+## 소개 (Introduce)
+이 모델은 번역을 위한 모델입니다. 한국 고유어의 정확한 번역을 생성하기 위해 한국어, 영어, 일본어의 언어 데이터를 혼합하여 **unsloth/Llama-3.2-1B-Instruct-bnb-4bit**을 학습시켜 생성된 **gorani-1B** 입니다.
+gorani는 현재 **한국어, 영어, 일본어**만 번역을 지원합니다.
+### 모델 정보
+- **개발자**: haeun0420
+- **모델 유형**: **llama**를 기반으로 하는 **1B** 매개변수 모델인 **gorani-1B**
+- **지원 언어**: 한국어, 영어, 일본어
+- **라이센스**: **llama**
+## Training Hyperparameters
+- **per_device_train_batch_size**: 8
+- **gradient_accumulation_steps**: 1
+- **warmup_steps**: 5
+- **learning_rate**: 2e-4
+- **fp16**: `not is_bfloat16_supported()`
+- **num_train_epochs**: 3
+- **weight_decay**: 0.01
+- **lr_scheduler_type**: "linear"
+## 학습 데이터
+[데이터셋 링크](https://huggingface.co/datasets/aripos1/gorani_dataset)
+## 학습 성능 비교
+![점수 비교 그래프]
+## Training Results
+![Training Loss 그래프- 스탭수와 학습 지표 그래프]