1337Hero's picture
Upload README.md with huggingface_hub
e45eb25 verified
---
base_model: unsloth/Qwen3-Coder-30B-A3B-Instruct
base_model_relation: adapter
library_name: peft
license: apache-2.0
language:
- code
model_name: qwen3-coder-30b-a3b-codemonkey
pipeline_tag: text-generation
tags:
- lora
- peft
- qwen3
- qwen3-coder
- qwen3moe
- sft
- code
- unsloth
---
# qwen3-coder-30b-a3b-codemonkey
LoRA adapter for `unsloth/Qwen3-Coder-30B-A3B-Instruct`.
## Files
- `adapter_model.safetensors`: adapter weights
- `adapter_config.json`: PEFT config
- `tokenizer.json`, `tokenizer_config.json`, `chat_template.jinja`: tokenizer and chat template assets
## Load with Transformers + PEFT
```python
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base_id = "unsloth/Qwen3-Coder-30B-A3B-Instruct"
adapter_id = "1337Hero/qwen3-coder-30b-a3b-codemonkey"
tokenizer = AutoTokenizer.from_pretrained(base_id)
base_model = AutoModelForCausalLM.from_pretrained(
base_id,
torch_dtype="auto",
device_map="auto",
)
model = PeftModel.from_pretrained(base_model, adapter_id)
messages = [
{"role": "user", "content": "Write a Python function that atomically replaces a file."}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True,
)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512)
completion = outputs[0][inputs.input_ids.shape[1]:]
print(tokenizer.decode(completion, skip_special_tokens=True))
```
## Adapter details
- Base model: `unsloth/Qwen3-Coder-30B-A3B-Instruct`
- PEFT type: `LoRA`
- Rank: `r=16`
- Alpha: `32`
- Target modules: `q_proj`, `k_proj`, `v_proj`, `o_proj`
## GGUF
A merged GGUF release can live in a separate repo such as
`1337Hero/qwen3-coder-30b-a3b-codemonkey-GGUF`.