Update README.md

6fccca7 verified 4 months ago

517 Bytes

license: mit

Original Model https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Heretic Ablitertion "[Trial 75] Refusals: 5/100, KL divergence: 0.00"

The Blashphemous Model has less refusals with a KL divergence of 0.01. I made this the heretic model to allow for making a LORA adapter, and to make merging better.

7B coming up next, If you release a GGUF please let me know so I can add it to my repo as well, I don't have the time to be learning how to do those conversions aat the moment.