GGUF available — Cerebellum v1 & v2 (ablation-guided mixed-precision)

#29
by deucebucket - opened

Ablation-guided mixed-precision GGUF quants for running this model in llama.cpp / ollama:

Instead of treating every tensor the same, we ran individual ablation experiments to measure which tensors are sensitive vs. tolerant and assigned precision accordingly. Details and benchmarks in the model cards.

Sign up or log in to comment