Spaces:

heretic-org
/

README

Running

README / README.md

Update README.md

1cb6036 verified 6 days ago

1.27 kB

	---
	title: README
	emoji: 👀
	colorFrom: gray
	colorTo: indigo
	sdk: static
	pinned: false
	---

	[![Discord](https://img.shields.io/discord/1447831134212984903?color=5865F2&label=discord&labelColor=black&logo=discord&logoColor=white&style=for-the-badge)](https://discord.gg/gdXc48gSyT)

	# The official organization of the Heretic project

	[Heretic](https://github.com/p-e-w/heretic) is a tool that removes censorship (aka "safety alignment") from
	transformer-based language models without expensive post-training.
	It combines an advanced implementation of directional ablation, also known
	as "abliteration" ([Arditi et al. 2024](https://arxiv.org/abs/2406.11717),
	Lai 2025 ([1](https://huggingface.co/blog/grimjim/projected-abliteration),
	[2](https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration))),
	with a TPE-based parameter optimizer powered by [Optuna](https://optuna.org/).

	The purpose of this organization is to publish and curate high-quality
	abliterated models made using Heretic.

	Membership is open to anyone who has either

	1. contributed code to the Heretic project, or
	2. published a well-received model made using Heretic.

	To become a member, please [join our Discord](https://discord.gg/gdXc48gSyT)
	and request an invitation.