richardyoung
/

kat-dev-72b

Text Generation

Model card Files Files and versions

richardyoung commited on Oct 24, 2025

Commit

9a9c5c5

·

verified ·

1 Parent(s): d0c8003

Add model card metadata

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -1,8 +1,25 @@
 # Kat-Dev 72B (GGUF)
 Quantized builds of the KAT-Dev 72B coding model for Ollama / llama.cpp runtimes. Each variant ships with the matching Modelfile generated from the Ollama registry export.
 ## Variants
 | Variant | Size | Blob |

+---
+license: apache-2.0
+base_model: Kwaipilot/KAT-Dev-72B-Exp
+pipeline_tag: text-generation
+library_name: llama.cpp
+language:
+  - en
+tags:
+  - gguf
+  - quantized
+  - ollama
+  - text-generation
+quantized_by: richardyoung
+---
 # Kat-Dev 72B (GGUF)
 Quantized builds of the KAT-Dev 72B coding model for Ollama / llama.cpp runtimes. Each variant ships with the matching Modelfile generated from the Ollama registry export.
+These binaries are derived from the upstream [`Kwaipilot/KAT-Dev-72B-Exp`](https://huggingface.co/Kwaipilot/KAT-Dev-72B-Exp) release (Apache-2.0). The goal is to provide ready-to-run GGUF artifacts for local inference stacks such as Ollama and llama.cpp.
 ## Variants
 | Variant | Size | Blob |