phucngodev's picture
Duplicate from noctrex/Qwopus3.5-9B-Coder-MTP
9d8e9f8
metadata
pipeline_tag: image-text-to-text
base_model:
  - Jackrong/Qwopus3.5-9B-Coder
tags:
  - qwen
  - qwen3_5_moe
  - MTP

These are quantizations of the model Jackrong / Qwopus3.5-9B-Coder
I've added the MTP layer on it.
My personal speed improvement on my 7900XTX with the vulkan backend has been from ~80 tps to around ~120 tps. An imatrix has been calulated for coding tasks, as such it is specialized for coding.

Quick Start

  1. Download the latest release of llama.cpp.
  2. Download your preferred model variant from below.