amkyawdev
/

mm-llm-coder-lite-v1

phi

custom_code

Model card Files Files and versions

xet

Community

amkyawdev commited on Apr 27

Commit

f463bcd

verified ·

1 Parent(s): 98d2bbe

Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +213 -69

README.md CHANGED Viewed

@@ -1,36 +1,37 @@
----
-license: mit
-tags:
-- burmese
-- myanmar
-- llm
-- code-generation
-- fine-tuned
-- lora
-- phi-2
-datasets:
-- amkyawdev/myanmar-llm-data
----
 # mm-llm-coder-lite-v1
-Myanmar Lightweight LLM for Code Generation and Conversation
 ## 📌 Overview
-This is a fine-tuned lightweight LLM model for Myanmar (Burmese) language understanding, code generation, and conversational tasks. The model is based on [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) and fine-tuned using LoRA technique.
 ## 🏗️ Architecture
-- **Base Model**: microsoft/phi-2
-- **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
-- **Training Framework**: Hugging Face Transformers + PEFT + TRL
-- **Language**: Burmese (Myanmar)
 ## 📊 Training Details
 | Parameter | Value |
-|-----------|-------|
 | Base Model | microsoft/phi-2 |
 | Training Epochs | 3 |
 | Learning Rate | 2e-4 |
@@ -45,52 +46,66 @@ This is a fine-tuned lightweight LLM model for Myanmar (Burmese) language unders
 Trained on [amkyawdev/myanmar-llm-data](https://huggingface.co/datasets/amkyawdev/myanmar-llm-data):
-| Tag | Description |
-|-----|-------------|
-| coding | Programming conversations (90%) |
-| translation | English-Myanmar translation (1%) |
-| general | General knowledge Q&A (1%) |
-| greeting | Burmese greetings (1%) |
 ### Dataset Statistics
-- Train: ~20,327 samples
-- Test: ~17,155 samples
-- Validation: ~17,071 samples
 ## 🚀 Quick Start
 ### Installation
 ```bash
-pip install -r requirements.txt
 ```
-### Fine-tuning
-```bash
-python finetune_mm_llm.py
-```
-### Inference
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-# Load model
 model_name = "amkyawdev/mm-llm-coder-lite-v1"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
-model = AutoModelForCausalLM.from_pretrained(model_name)
 # Generate response
-input_text = "System: သင်သည် မြန်မာစာကျွမ်းကျင်သော AI အကူအညီပေးသူဖြစ်သည်။\n\nUser: မင်္ဂလာပါ။"
-inputs = tokenizer(input_text, return_tensors="pt")
-outputs = model.generate(**inputs, max_new_tokens=100)
 response = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print(response)
 ```
-### Using with Transformers Pipeline
 ```python
 from transformers import pipeline
@@ -98,13 +113,137 @@ from transformers import pipeline
 pipe = pipeline(
     "text-generation",
     model="amkyawdev/mm-llm-coder-lite-v1",
-    tokenizer="amkyawdev/mm-llm-coder-lite-v1"
 )
-result = pipe("User: ဟိုင်း၊ နေကောင်းလား။")
 print(result[0]['generated_text'])
 ```
 ## 📋 Requirements
 ```
@@ -118,36 +257,41 @@ datasets>=2.14.0
 ## 🔧 Configuration
-Edit `Config` class in `finetune_mm_llm.py` to customize:
 ```python
-class Config:
-    MODEL_NAME = "microsoft/phi-2"  # Change base model
-    OUTPUT_DIR = "./mm-llm-coder-lite-v1"
-    TRAIN_PATH = "/workspace/train.jsonl"
-    # ... more settings
 ```
-## 📝 Output
-After training, the model will be saved to `./mm-llm-coder-lite-v1/` with:
-- `adapter_config.json` - LoRA config
-- `adapter_model.safetensors` - LoRA weights
-- `tokenizer.json` - Tokenizer
-- `tokenizer_config.json` - Tokenizer config
-- `training_config.json` - Training config
-## 🏷️ Tags
-- `burmese` - Myanmar language
-- `llm` - Large Language Model
-- `code-generation` - Code generation
-- `fine-tuned` - Fine-tuned model
-## 📜 License
-MIT License
-## 👤 Author
-[amkyawdev](https://huggingface.co/amkyawdev)

 # mm-llm-coder-lite-v1
+<p align="center">
+  <img src="https://img.shields.io/badge/Myanmar-LLM-blue?style=for-the-badge&logo=huggingface" alt="License">
+  <img src="https://img.shields.io/badge/License-MIT-green?style=for-the-badge" alt="License">
+  <img src="https://img.shields.io/badge/Model-phi--2-orange?style=for-the-badge" alt="Base Model">
+  <img src="https://img.shields.io/badge/Fine--tuned-LoRA-red?style=for-the-badge" alt="Method">
+</p>
 ## 📌 Overview
+**mm-llm-coder-lite-v1** is a specialized Large Language Model (LLM) fine-tuned for Myanmar (Burmese) language understanding, code generation, and conversational tasks. The model is based on Microsoft's `phi-2` and fine-tuned using Low-Rank Adaptation (LoRA) technique.
+### Key Features
+- 🌍 **Myanmar Language Support**: Specialized in Burmese/Myanmar language processing
+- 💻 **Code Generation**: Supports Python, JavaScript, and other programming languages
+- 💬 **Conversational AI**: Can engage in natural dialogue in Myanmar language
+- ⚡ **Lightweight**: Optimized for efficient inference with LoRA
 ## 🏗️ Architecture
+| Component | Details |
+|----------|---------|
+| **Base Model** | microsoft/phi-2 |
+| **Fine-tuning Method** | LoRA (Low-Rank Adaptation) |
+| **Training Framework** | Hugging Face Transformers + PEFT + TRL |
+| **Language** | Burmese (Myanmar) |
+| **Parameters** | ~2.7B total (trainable: ~2.6M) |
 ## 📊 Training Details
 | Parameter | Value |
+|----------|-------|
 | Base Model | microsoft/phi-2 |
 | Training Epochs | 3 |
 | Learning Rate | 2e-4 |
 Trained on [amkyawdev/myanmar-llm-data](https://huggingface.co/datasets/amkyawdev/myanmar-llm-data):
+| Tag | Description | Percentage |
+|-----|-------------|------------|
+| coding | Programming conversations | 90% |
+| translation | English-Myanmar translation | 1% |
+| general | General knowledge Q&A | 1% |
+| greeting | Burmese greetings | 1% |
 ### Dataset Statistics
+- **Train**: ~20,327 samples
+- **Test**: ~17,155 samples
+- **Validation**: ~17,071 samples
 ## 🚀 Quick Start
 ### Installation
 ```bash
+pip install torch transformers peft accelerate datasets
 ```
+### Basic Inference
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+# Load model and tokenizer
 model_name = "amkyawdev/mm-llm-coder-lite-v1"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.float16,
+    device_map="auto"
+)
+# Set pad token
+tokenizer.pad_token = tokenizer.eos_token
 # Generate response
+input_text = """System: သင်သည် မြန်မာစာကျွမ်းကျင်သော AI အကူအညီပေးသူဖြစ်သည်။
+User: Python နဲ့ Fibonacci စီးရီးထုတ်တဲ့ function ရေးပေးပါ။
+Assistant:"""
+inputs = tokenizer(input_text, return_tensors="pt", truncation=True, max_length=512)
+inputs = {k: v.to(model.device) for k, v in inputs.items()}
+outputs = model.generate(
+    **inputs,
+    max_new_tokens=256,
+    temperature=0.7,
+    top_p=0.95,
+    do_sample=True
+)
 response = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print(response)
 ```
+### Using Pipeline
 ```python
 from transformers import pipeline
 pipe = pipeline(
     "text-generation",
     model="amkyawdev/mm-llm-coder-lite-v1",
+    tokenizer="amkyawdev/mm-llm-coder-lite-v1",
+    device_map="auto",
+    torch_dtype=torch.float16
 )
+prompt = """System: သင်သည် မြန်မာစာကျွမ်းကျင်သော AI အကူအညီပေးသူဖြစ်သည်။
+User: ဟိုင်း၊ နေကောင်းလား။
+Assistant:"""
+result = pipe(prompt, max_new_tokens=128, temperature=0.7)
 print(result[0]['generated_text'])
 ```
+## 📝 Prompt Template
+This model uses the following prompt format:
+```
+System: <system_prompt>
+User: <user_message>
+Assistant: <assistant_response><eos>
+```
+### Example Prompt
+```
+System: သင်သည် မြန်မာစာကျွမ်းကျင်သော AI အကူအညီပေးသူဖြစ်သည်။
+User: မင်္ဂလာပါ။
+Assistant: မင်္ဂလာပါရှင်း။ သင့်အား ကူညီပါသည်။<eos>
+```
+## 🖥️ Deployment
+### GGUF Conversion (for LM Studio / Ollama)
+```python
+# Install required packages
+# pip install transformers peft accelerate sentencepiece
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+# Load model
+model_name = "amkyawdev/mm-llm-coder-lite-v1"
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.float16,
+    device_map="cpu",
+    low_cpu_mem_usage=True
+)
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+# Merge LoRA weights (if using PEFT)
+# Note: This model uses LoRA adapters
+# Save merged model
+output_dir = "./mm-llm-merged"
+model.save_merged(output_dir)
+tokenizer.save_pretrained(output_dir)
+# Convert to GGUF using llama.cpp
+# Follow: https://github.com/ggerganov/llama.cpp/tree/master/convert
+```
+### Ollama Deployment
+```bash
+# Create Modelfile
+FROM ./mm-llm-coder-lite-v1
+PARAMETER temperature 0.7
+PARAMETER top_p 0.95
+PARAMETER top_k 40
+TEMPLATE """System: {{ .System }}
+User: {{ .Prompt }}
+Assistant: {{ .Response }}<eos>"""
+# Create model in Ollama
+ollama create mm-llm-coder -f Modelfile
+# Run
+ollama run mm-llm-coder
+```
+## 📈 Evaluation
+### Myanmar Code Evaluation
+```python
+# Example evaluation for Myanmar code generation
+myanmar_prompts = [
+    "Python နဲ့ list ကို sort လုပ်နည်းရေးပါ။",
+    "JavaScript နဲ့ function ရေးပေးပါ။",
+    "မြန်မာ Unicode ကို Zawgyi ပြောင်းတဲ့ code ရေးပါ။",
+]
+# Run generation and evaluate
+def evaluate_model(prompts):
+    results = []
+    for prompt in prompts:
+        # Generate code
+        output = generate(prompt)
+        results.append({
+            "prompt": prompt,
+            "generated": output,
+            "success": check_syntax(output)
+        })
+    return results
+# Calculate pass rate
+success_rate = sum(1 for r in results if r["success"]) / len(results)
+print(f"Success Rate: {success_rate * 100:.2f}%")
+```
+### Benchmark Adaptation
+For Myanmar-specific evaluation, consider:
+1. Translating MBPP/MathEval prompts to Myanmar
+2. Creating Myanmar coding benchmarks
+3. Using BLEU/ROUGE for translation quality
 ## 📋 Requirements
 ```
 ## 🔧 Configuration
 ```python
+from transformers import TrainingArguments
+training_args = TrainingArguments(
+    output_dir="./mm-llm-output",
+    num_train_epochs=3,
+    per_device_train_batch_size=4,
+    learning_rate=2e-4,
+    fp16=True,
+    save_steps=500,
+    eval_steps=500,
+    save_total_limit=2,
+)
 ```
+## 📜 License
+This project is licensed under the **MIT License**.
+See [LICENSE](LICENSE) for details.
+## 👤 Author
+**Amkyaw Dev**
+- GitHub: [@amkyawdev](https://github.com/amkyawdev)
+- Hugging Face: [amkyawdev](https://huggingface.co/amkyawdev)
+## 🙏 Acknowledgments
+- Microsoft for the phi-2 model
+- Hugging Face for Transformers and PEFT
+- The Myanmar NLP community
+---
+<p align="center">
+  Made with ❤️ for Myanmar AI Community
+</p>