Highly experimental prune of GLM 4.6V from 128 experts to 96. Vision works, as tested in GGUF format. I would greatly appreciate feedback.
Chat template
Files info
Base model