myanmar-llm-train / README.md
amkyawdev's picture
Upload README.md with huggingface_hub
76918e4 verified
# 🧠 Myanmar LLM Training
Fine-tune **Qwen2.5-0.5B-Instruct** with Myanmar language dataset.
## ⚑ No License Required!
This model is fully open. No Llama license needed!
## πŸ“‹ Requirements
- Python 3.8+
- GPU with 6GB+ VRAM
- HuggingFace Account
## πŸš€ Quick Start
### 1. Install dependencies
```bash
pip install -r requirements.txt
```
### 2. Login to HuggingFace
```bash
huggingface-cli login
```
### 3. Run training
```bash
python train.py
```
## βš™οΈ Configuration
| Parameter | Default | Description |
|-----------|---------|-------------|
| MODEL_NAME | Qwen/Qwen2.5-0.5B-Instruct | Base model (fully open!) |
| num_train_epochs | 3 | Training iterations |
| per_device_train_batch_size | 4 | Batch size |
| gradient_accumulation_steps | 4 | Effective batch = 16 |
| learning_rate | 2e-5 | Learning rate |
## πŸ“Š Features
- βœ… Fully open model - α€œα€­α€―α€„α€Ία€…α€„α€Ία€™α€œα€­α€―α€•α€«α€žα€Šα€Ία‹
- βœ… FP16 precision - α€•α€­α€―α€™α€­α€―α€™α€Όα€”α€Ία€•α€«α€žα€Šα€Ία‹
- βœ… Gradient checkpointing - Memory α€α€»α€½α€±α€α€¬α€•α€«α€žα€Šα€Ία‹
- βœ… Test/Validation evaluation - α€”α€Ύα€…α€Ία€α€―α€œα€―α€Άα€Έα€‘α€α€½α€€α€Ί α€…α€™α€Ία€Έα€žα€•α€Ία€•α€«α€žα€Šα€Ία‹
## πŸ“Š Training Data
Dataset: [amkyawdev/AmkyawDev-Dataset](https://huggingface.co/datasets/amkyawdev/AmkyawDev-Dataset)
| Split | Samples |
|-------|---------|
| Train | ~29,100 |
| Validation | ~29,100 |
| Test | ~29,100 |
> **Note:** Each file (train.jsonl, test.jsonl, validation.jsonl) has ~29,100 conversations!
## πŸ’Ύ Output
Trained model saved to `./myanmar-qwen-output/`
## πŸ“€ Upload to HuggingFace
```bash
cd myanmar-qwen-output
huggingface-cli upload amkyawdev/my-myanmar-qwen . --repo-type model
```
## πŸ–₯️ Google Colab
```python
# Install
!pip install transformers datasets torch accelerate
# Login
from huggingface_hub import login
login("YOUR_TOKEN")
# Run
%run train.py
```
---
Built by amkyawdev