Suggest renaming files per GGUF naming convention

#6
by mishig - opened

Hi @antirez , small filename suggestion

Current Proposed
…IQ2XXS-w2Q2K-AProjQ8-SExpQ8-OutQ8-chat-v2.gguf DeepSeek-V4-Flash-55IQ2_XXS-34Q2_K-07Q8_0-03F16.gguf
…IQ2XXS-…-chat-v2-imatrix.gguf DeepSeek-V4-Flash-55IQ2_XXS-34Q2_K-07Q8_0-03F16-imatrix.gguf
…Q4KExperts-F16HC-…-chat-v2.gguf DeepSeek-V4-Flash-95Q4_K-04Q8_0.gguf
…Q4KExperts-…-chat-v2-imatrix.gguf DeepSeek-V4-Flash-95Q4_K-04Q8_0-imatrix.gguf
DeepSeek-V4-Flash-MTP-Q4K-Q8_0-F32.gguf DeepSeek-V4-Flash-95Q4_K-04Q8_0-MTP.gguf

Rule: components listed descending by byte share, only those >~2% appear, so the sum may end slightly under 100. (The %s above are rough; exact values would need recomputing from the GGUF tensor info table.)

Two upstream PRs that wire this in:

  • GGUF spec extensionggml-org/ggml#1489 formalizes the percentage-mix form in the <Encoding> slot grammar with a backward-compatible regex update.
  • HF parser supporthuggingface.js#2170 adds parseGGUFQuantMix and extends parseGGUFQuantLabel to return the dominant component for mix filenames. Once this lands the HF Hub quant filter and GGUF widget will auto-classify your files correctly instead of showing "unknown".

Renames can be done in-place via the HF UI's GGUF Editor: no reupload needed. Happy to open a PR if useful 😊

for example, on https://huggingface.co/antirez/deepseek-v4-gguf/blob/main/DeepSeek-V4-Flash-IQ2XXS-w2Q2K-AProjQ8-SExpQ8-OutQ8-chat-v2-imatrix.gguf

image

This comment has been hidden (marked as Resolved)

Sign up or log in to comment