amkyawdev's picture
Upload README.md with huggingface_hub
65ac950 verified
---
library_name: transformers
tags:
- code
- coding
- software-development
- programming
- llm
- python
- qwen
- transformers
- peft
- lora
- finetuned
license: apache-2.0
---
# 🤖 MM Coder Agent v1
A professional AI coding assistant model fine-tuned from Qwen2.5-1.5B-Instruct for software development tasks.
## Model Overview
| Property | Value |
|----------|-------|
| **Base Model** | Qwen/Qwen2.5-1.5B-Instruct |
| **Architecture** | LoRA (PEFT Adapter) |
| **Parameters** | 1.5B (base) + 37MB (adapter) |
| **Task** | Code Generation / Software Development |
| **Framework** | Transformers, Safetensors |
## Model Description
MM Coder Agent v1 is a specialized coding assistant built on Qwen2.5-1.5B-Instruct. This model is optimized for:
- **Code Generation** - Generate clean, efficient code in multiple languages
- **Bug Detection** - Identify and fix common programming errors
- **Algorithm Implementation** - Implement sorting, searching, and data structures
- **Code Review** - Assist with code review and best practices
### Architecture Details
```json
{
"peft_type": "LORA",
"base_model_name_or_path": "Qwen/Qwen2.5-1.5B-Instruct",
"r": 16,
"lora_alpha": 32,
"lora_dropout": 0.0,
"task_type": "CAUSAL_LM",
"target_modules": ["q_proj", "k_proj", "v_proj", "o_proj", "gate_proj", "up_proj", "down_proj"]
}
```
## Live Demo
Try the model live at: [mm-coder-v1-space](https://huggingface.co/spaces/amkyawdev/mm-coder-v1-space)
## Quick Start
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
from peft import PeftModel, PeftConfig
# Load adapter config
peft_config = PeftConfig.from_pretrained("amkyawdev/mm-coder-agent-v1-combined")
# Load base model
base_model = AutoModelForCausalLM.from_pretrained(
peft_config.base_model_name_or_path,
torch_dtype=torch.float16,
device_map="auto",
trust_remote_code=True
).eval()
# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained(
peft_config.base_model_name_or_path,
trust_remote_code=True
)
# Load LoRA adapter
model = PeftModel.from_pretrained(base_model, "amkyawdev/mm-coder-agent-v1-combined")
# Generate code
prompt = "Write a Python function to calculate fibonacci numbers"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
with torch.no_grad():
outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)
```
## Example Outputs
| Prompt | Output |
|--------|--------|
| `python hello world` | `print("Hello, World!")` |
| `reverse string python` | `s[::-1]` |
| `fibonacci function python` | Full fibonacci implementation |
| `bubble sort python` | Bubble sort algorithm |
## Training Data
- **Dataset**: mm-llm-coder-dataset (4M rows)
- **Additional**: mm-llm-coder-agent-dataset (4M rows)
- **Source**: Quality coding prompts and responses
## Use Cases
### Ideal For
- Code completion and generation
- Bug detection and fixing
- Algorithm implementation
- Learning programming concepts
- Quick prototyping
### Not Recommended For
- Production-critical systems without evaluation
- Security-sensitive applications without guardrails
- Tasks beyond software development
## Limitations
- 1.5B parameter model (smaller than GPT-4 class)
- May produce incorrect code - always verify outputs
- Limited context window
- Fine-tuned primarily for English
## License
Apache 2.0
## Citation
```bibtex
@model{amkyawdev/mm-coder-agent-v1-combined,
title={MM Coder Agent v1},
author={amkyawdev},
year={2024},
url={https://huggingface.co/amkyawdev/mm-coder-agent-v1-combined}
}
```
---
*Built with ❤️ using Transformers and PEFT*