File size: 3,942 Bytes
e7c2c3c
6b1f941
 
 
 
 
 
e7c9932
 
e7c2c3c
e7c9932
 
 
 
 
 
 
 
e7c2c3c
df2fd27
e7c2c3c
 
 
 
df2fd27
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e7c2c3c
 
 
 
df2fd27
 
e7c2c3c
df2fd27
 
 
 
 
 
 
e7c2c3c
df2fd27
e7c2c3c
df2fd27
 
e7c2c3c
df2fd27
 
 
e7c2c3c
df2fd27
 
 
 
 
 
 
 
 
e7c2c3c
df2fd27
 
 
 
e7c2c3c
df2fd27
e7c2c3c
df2fd27
 
e7c2c3c
df2fd27
 
 
 
e7c2c3c
df2fd27
 
 
 
 
e7c2c3c
 
df2fd27
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
---
license: apache-2.0
pipeline_tag: text-generation
tags:
- code
- qwen
- generated_from_trainer
- myanmar-nlp
- ai-agent
library_name: transformers
base_model: Qwen/Qwen2.5-Coder-1.5B
datasets:
- amkyawdev/mm-llm-coder-agent-dataset
language:
- my
- en
metrics:
- accuracy
---
# Model Card for amk-coder-v2

## Model Details

### Model Description
Myanmar-localized coding agent model fine-tuned from Qwen/Qwen2.5-Coder-1.5B using LoRA (PEFT). Designed for code generation and coding assistance in Myanmar language context.

- **Developed by:** amkyawdev
- **Model type:** Language Model (LLM)
- **Language(s) (NLP):** Myanmar (my), English (en)
- **License:** Apache-2.0
- **Finetuned from model:** Qwen/Qwen2.5-Coder-1.5B

### Model Sources
- **Repository:** [amkyawdev/amk-coder-v2](https://huggingface.co/amkyawdev/amk-coder-v2)
- **Dataset:** [amkyawdev/mm-llm-coder-agent-dataset](https://huggingface.co/datasets/amkyawdev/mm-llm-coder-agent-dataset)

## Model Configuration

| Parameter | Value |
|-----------|-------|
| Base Model | Qwen/Qwen2.5-Coder-1.5B |
| Fine-tuning Method | LoRA (PEFT) |
| Target Modules | q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj |
| Optimizer | paged_adamw_8bit |
| Precision | FP16 Mixed Precision |
| Learning Rate | 3e-5 |
| Training Infrastructure | Kaggle Cloud (Dual NVIDIA T4 GPUs) |

## Chat Template

This model uses the ChatML structure:

```xml
<|im_start|>system
You are an expert Myanmar AI coding agent with tool access.<|im_end|>
<|im_start|>user
{Instruction}
Tools available: {Tools}<|im_end|>
<|im_start|>assistant
Thought & Code:
```

## Training Details

### Training Data
- **Dataset:** amkyawdev/mm-llm-coder-agent-dataset
- **Description:** Myanmar localized coding agent dataset for instruction-tuned code generation

### Training Hyperparameters
| Parameter | Value |
|-----------|-------|
| Precision | FP16 Mixed Precision |
| Optimizer | paged_adamw_8bit |
| Learning Rate | 3e-5 |
| Hardware | Kaggle Cloud (Dual NVIDIA T4 GPUs) |

## How to Get Started with the Model

```python
from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "amkyawdev/amk-coder-v2"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

# Chat prompt format
prompt = """<|im_start|>system
You are an expert Myanmar AI coding agent with tool access.<|im_end|>
<|im_start|>user
Write a Python function to add two numbers
Tools available: python<|im_end|>
<|im_start|>assistant
Thought & Code:
"""

inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```

## Uses

### Direct Use
This model can be used for code generation tasks with Myanmar language instructions. Suitable for building coding assistants that understand Burmese/Myanmar language prompts.

### Out-of-Scope Use
- Not intended for production deployment without fine-tuning
- Not tested for safety-critical applications
- May generate incorrect code; always verify outputs

## Bias, Risks, and Limitations
- Model may generate syntactically incorrect code
- May not follow security best practices
- Training data quality affects output quality
- Myanmar language support may be limited compared to English

## Environmental Impact
- **Hardware Type:** NVIDIA T4 GPUs (Dual)
- **Cloud Provider:** Kaggle
- **Training Time:** ~3-5 hours

## Citation

If you use this model, please cite:

```
@misc{amk-coder-v2,
  author = {amkyawdev},
  title = {amk-coder-v2: Myanmar Coding Agent Model},
  year = {2025},
  publisher = {HuggingFace},
  url = {https://huggingface.co/amkyawdev/amk-coder-v2}
}
```

## More Information
- Dataset: [amkyawdev/mm-llm-coder-agent-dataset](https://huggingface.co/datasets/amkyawdev/mm-llm-coder-agent-dataset)
- Base Model: [Qwen/Qwen2.5-Coder-1.5B](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B)