amk-coder-v2 / README.md
amkyawdev's picture
Update README.md with model details
df2fd27 verified
---
license: apache-2.0
pipeline_tag: text-generation
tags:
- code
- qwen
- generated_from_trainer
- myanmar-nlp
- ai-agent
library_name: transformers
base_model: Qwen/Qwen2.5-Coder-1.5B
datasets:
- amkyawdev/mm-llm-coder-agent-dataset
language:
- my
- en
metrics:
- accuracy
---
# Model Card for amk-coder-v2
## Model Details
### Model Description
Myanmar-localized coding agent model fine-tuned from Qwen/Qwen2.5-Coder-1.5B using LoRA (PEFT). Designed for code generation and coding assistance in Myanmar language context.
- **Developed by:** amkyawdev
- **Model type:** Language Model (LLM)
- **Language(s) (NLP):** Myanmar (my), English (en)
- **License:** Apache-2.0
- **Finetuned from model:** Qwen/Qwen2.5-Coder-1.5B
### Model Sources
- **Repository:** [amkyawdev/amk-coder-v2](https://huggingface.co/amkyawdev/amk-coder-v2)
- **Dataset:** [amkyawdev/mm-llm-coder-agent-dataset](https://huggingface.co/datasets/amkyawdev/mm-llm-coder-agent-dataset)
## Model Configuration
| Parameter | Value |
|-----------|-------|
| Base Model | Qwen/Qwen2.5-Coder-1.5B |
| Fine-tuning Method | LoRA (PEFT) |
| Target Modules | q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj |
| Optimizer | paged_adamw_8bit |
| Precision | FP16 Mixed Precision |
| Learning Rate | 3e-5 |
| Training Infrastructure | Kaggle Cloud (Dual NVIDIA T4 GPUs) |
## Chat Template
This model uses the ChatML structure:
```xml
<|im_start|>system
You are an expert Myanmar AI coding agent with tool access.<|im_end|>
<|im_start|>user
{Instruction}
Tools available: {Tools}<|im_end|>
<|im_start|>assistant
Thought & Code:
```
## Training Details
### Training Data
- **Dataset:** amkyawdev/mm-llm-coder-agent-dataset
- **Description:** Myanmar localized coding agent dataset for instruction-tuned code generation
### Training Hyperparameters
| Parameter | Value |
|-----------|-------|
| Precision | FP16 Mixed Precision |
| Optimizer | paged_adamw_8bit |
| Learning Rate | 3e-5 |
| Hardware | Kaggle Cloud (Dual NVIDIA T4 GPUs) |
## How to Get Started with the Model
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "amkyawdev/amk-coder-v2"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)
# Chat prompt format
prompt = """<|im_start|>system
You are an expert Myanmar AI coding agent with tool access.<|im_end|>
<|im_start|>user
Write a Python function to add two numbers
Tools available: python<|im_end|>
<|im_start|>assistant
Thought & Code:
"""
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```
## Uses
### Direct Use
This model can be used for code generation tasks with Myanmar language instructions. Suitable for building coding assistants that understand Burmese/Myanmar language prompts.
### Out-of-Scope Use
- Not intended for production deployment without fine-tuning
- Not tested for safety-critical applications
- May generate incorrect code; always verify outputs
## Bias, Risks, and Limitations
- Model may generate syntactically incorrect code
- May not follow security best practices
- Training data quality affects output quality
- Myanmar language support may be limited compared to English
## Environmental Impact
- **Hardware Type:** NVIDIA T4 GPUs (Dual)
- **Cloud Provider:** Kaggle
- **Training Time:** ~3-5 hours
## Citation
If you use this model, please cite:
```
@misc{amk-coder-v2,
author = {amkyawdev},
title = {amk-coder-v2: Myanmar Coding Agent Model},
year = {2025},
publisher = {HuggingFace},
url = {https://huggingface.co/amkyawdev/amk-coder-v2}
}
```
## More Information
- Dataset: [amkyawdev/mm-llm-coder-agent-dataset](https://huggingface.co/datasets/amkyawdev/mm-llm-coder-agent-dataset)
- Base Model: [Qwen/Qwen2.5-Coder-1.5B](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B)