# 🧠 Myanmar LLM Training Fine-tune **Qwen2.5-0.5B-Instruct** with Myanmar language dataset. ## ⚑ No License Required! This model is fully open. No Llama license needed! ## πŸ“‹ Requirements - Python 3.8+ - GPU with 6GB+ VRAM - HuggingFace Account ## πŸš€ Quick Start ### 1. Install dependencies ```bash pip install -r requirements.txt ``` ### 2. Login to HuggingFace ```bash huggingface-cli login ``` ### 3. Run training ```bash python train.py ``` ## βš™οΈ Configuration | Parameter | Default | Description | |-----------|---------|-------------| | MODEL_NAME | Qwen/Qwen2.5-0.5B-Instruct | Base model (fully open!) | | num_train_epochs | 3 | Training iterations | | per_device_train_batch_size | 4 | Batch size | | gradient_accumulation_steps | 4 | Effective batch = 16 | | learning_rate | 2e-5 | Learning rate | ## πŸ“Š Features - βœ… Fully open model - α€œα€­α€―α€„α€Ία€…α€„α€Ία€™α€œα€­α€―α€•α€«α€žα€Šα€Ία‹ - βœ… FP16 precision - α€•α€­α€―α€™α€­α€―α€™α€Όα€”α€Ία€•α€«α€žα€Šα€Ία‹ - βœ… Gradient checkpointing - Memory α€α€»α€½α€±α€α€¬α€•α€«α€žα€Šα€Ία‹ - βœ… Test/Validation evaluation - α€”α€Ύα€…α€Ία€α€―α€œα€―α€Άα€Έα€‘α€α€½α€€α€Ί α€…α€™α€Ία€Έα€žα€•α€Ία€•α€«α€žα€Šα€Ία‹ ## πŸ“Š Training Data Dataset: [amkyawdev/AmkyawDev-Dataset](https://huggingface.co/datasets/amkyawdev/AmkyawDev-Dataset) | Split | Samples | |-------|---------| | Train | ~29,100 | | Validation | ~29,100 | | Test | ~29,100 | > **Note:** Each file (train.jsonl, test.jsonl, validation.jsonl) has ~29,100 conversations! ## πŸ’Ύ Output Trained model saved to `./myanmar-qwen-output/` ## πŸ“€ Upload to HuggingFace ```bash cd myanmar-qwen-output huggingface-cli upload amkyawdev/my-myanmar-qwen . --repo-type model ``` ## πŸ–₯️ Google Colab ```python # Install !pip install transformers datasets torch accelerate # Login from huggingface_hub import login login("YOUR_TOKEN") # Run %run train.py ``` --- Built by amkyawdev