ubergarm
/

Kimi-K2-Instruct-GGUF

Text Generation

Model card Files Files and versions

ubergarm commited on Jul 14, 2025

Commit

a518c83

·

1 Parent(s): 3d187fa

initial commit

Files changed (2) hide show

.gitattributes +3 -0
README.md +18 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.gguf filter=lfs diff=lfs merge=lfs -text
+*.png filter=lfs diff=lfs merge=lfs -text
+imatrix-*.dat filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,5 +1,23 @@
 ---
 license: other
 license_name: modified-mit
 license_link: https://huggingface.co/moonshotai/Kimi-K2-Instruct/raw/main/LICENSE
 ---

 ---
+quantized_by: ubergarm
+pipeline_tag: text-generation
+base_model: moonshotai/Kimi-K2-Instruct
 license: other
 license_name: modified-mit
 license_link: https://huggingface.co/moonshotai/Kimi-K2-Instruct/raw/main/LICENSE
+base_model_relation: quantized
+tags:
+- mla
+- imatrix
+- conversational
+- ik_llama.cpp
 ---
+## Work In Progress
+Hoping to first at least a new `imatrix.gguf` for other folks to quantize their own mainline quants. Then follow up with some of ik_llama.cpp's SOTA quants targeting "smaller" rigs.
+Follow along at home:
+* https://github.com/ggml-org/llama.cpp/pull/14654
+* https://huggingface.co/gabriellarson/Kimi-K2-Instruct-GGUF/discussions/1
+* https://github.com/ggml-org/llama.cpp/pull/9400