Qwen
/

Qwen2.5-Coder-1.5B-Instruct-GGUF

Text Generation

Model card Files Files and versions

perf: switch to 1.5B Q2_K quantization for lowest possible latency on CPU

#3

by scriptsledge - opened Dec 20, 2025

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

No description provided.

perf: switch to 1.5B Q2_K quantization for lowest possible latency on CPU009eb6a8

scriptsledge changed pull request status to closed Dec 20, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment