Heretic Options?

by darkc0de - opened Feb 15

Feb 15

Hey MuXodious!
I'm curious to know your preferred heretic options your using since v1.2.0

heretic --model USER/REPO
heretic --model USER/REPO --orthogonalize-direction
heretic --model USER/REPO --orthogonalize-direction --row-normalization full
heretic --model USER/REPO --orthogonalize-direction --row-normalization full --winsorization-quantile 0.995

Something different?

MuXodious

Owner Feb 15

Salut, the creator of the cult classic Xortron, darkc0de!

I almost exclusively use heretic --orthogonalize-direction --row-normalization full --winsorization-quantile 0.995 USER/REPO, adjusting windsor to a higher or lower value to see if it helps.

darkc0de

Feb 16

Check out the results of this run,

heretic --orthogonalize-direction --row-normalization full --winsorization-quantile 0.995 --model unsloth/GLM-4.7-Flash

MuXodious

Owner Feb 16

•

edited Feb 16

Now, those are what I refer to as *impotent heresy. Although, I also used the same set of arguments for GLM 4.7, it still remains a tad unique case. Its reasoning throws off the refusal detection mechanism and, as a result, the ablation optimisation algorithm. So, you effectively end up with a low initial refusal count and an impotent ablation. You should also keep in mind that not all models refuse the same. The default set of refusal markers is adequate, but not enough for all cases. Here's an example. You may have to study the model behaviours a bit to pinpoint its unique markers.

Is there a particular reason to use the unsloth version?

darkc0de

Feb 16

Over time I've grown to appreciate the small fixes unsloth applies to some models. Their quants top notch too

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment