phucngodev's picture
Duplicate from noctrex/Qwopus3.5-9B-Coder-MTP
9d8e9f8
---
pipeline_tag: image-text-to-text
base_model:
- Jackrong/Qwopus3.5-9B-Coder
tags:
- qwen
- qwen3_5_moe
- MTP
---
These are quantizations of the model [Jackrong / Qwopus3.5-9B-Coder](https://huggingface.co/Jackrong/Qwopus3.5-9B-Coder)
I've added the MTP layer on it.
My personal speed improvement on my 7900XTX with the vulkan backend has been from ~80 tps to around ~120 tps.
An imatrix has been calulated for coding tasks, as such it is specialized for coding.
## Quick Start
1. Download the latest release of **llama.cpp**.
2. Download your preferred model variant from below.