GGUF available — Cerebellum v1 & v2 (ablation-guided mixed-precision)

#29

by deucebucket - opened May 1

May 1

Ablation-guided mixed-precision GGUF quants for running this model in llama.cpp / ollama:

Cerebellum v2 — ablation-informed with PLE protection, latest version
Cerebellum v1 — initial ablation-informed quant

Instead of treating every tensor the same, we ran individual ablation experiments to measure which tensors are sensitive vs. tolerant and assigned precision accordingly. Details and benchmarks in the model cards.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment