ModelMaiden
/

Heretic-Deepseek-R1-Distill-Qwen-1.5B

Model card Files Files and versions

Heretic-Deepseek-R1-Distill-Qwen-1.5B / README.md

ModelMaiden's picture

Update README.md

6fccca7 verified 4 months ago

|

history blame contribute delete

517 Bytes

	---
	license: mit
	---
	Original Model https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

	Heretic Ablitertion "[Trial 75] Refusals: 5/100, KL divergence: 0.00"

	The Blashphemous Model has less refusals with a KL divergence of 0.01. I made this the heretic model to allow for making a LORA adapter, and to make merging better.

	7B coming up next, If you release a GGUF please let me know so I can add it to my repo as well, I don't have the time to be learning how to do those conversions aat the moment.