louisguthmann's picture
Add files using upload-large-folder tool
a3f5d82 verified
metadata
base_model: Qwen/Qwen3.5-2B
library_name: gguf
tags:
  - gguf
  - qwen
  - bash
  - shell
  - linux
  - llama.cpp
  - text-generation

Qwen3.5-2B ShellCommand-Linux GGUF

This repository contains merged GGUF exports of the current best Qwen3.5-2B ShellCommand-Linux LoRA.

Source

  • adapter source: https://huggingface.co/louisguthmann/qwen3.5-2b-shellcommand-linux-lora
  • GitHub repo: https://github.com/GuthL/bitnet-nl2sh

Files

  • Qwen3.5-2B-shellcommand-linux-F16.gguf
  • Qwen3.5-2B-shellcommand-linux-Q4_K_M.gguf
  • Qwen3.5-2B-shellcommand-linux-Q4_K_S.gguf

Inherited Eval Snapshot

These metrics come from the source LoRA adapter before GGUF quantization.

  • score: 276.5033
  • verifier ok rate: 0.7750
  • verifier command rate: 0.7604
  • verifier ask rate: 0.7500
  • verifier cannot rate: 1.0000
  • exact any-exact rate: 0.2500
  • exact parse-ok rate: 0.9800

Recommended Deployment Variants

  • Q4_K_M: safer default if you want more quality headroom
  • Q4_K_S: leaner option if memory or latency is tighter

CX23 Benchmarking

See the GitHub docs for the exact benchmark commands used for llama.cpp on Hetzner CX23.