README / README.md
p-e-w's picture
Update README.md
1cb6036 verified
---
title: README
emoji: ๐Ÿ‘€
colorFrom: gray
colorTo: indigo
sdk: static
pinned: false
---
[![Discord](https://img.shields.io/discord/1447831134212984903?color=5865F2&label=discord&labelColor=black&logo=discord&logoColor=white&style=for-the-badge)](https://discord.gg/gdXc48gSyT)
# The official organization of the Heretic project
[Heretic](https://github.com/p-e-w/heretic) is a tool that removes censorship (aka "safety alignment") from
transformer-based language models without expensive post-training.
It combines an advanced implementation of directional ablation, also known
as "abliteration" ([Arditi et al. 2024](https://arxiv.org/abs/2406.11717),
Lai 2025 ([1](https://huggingface.co/blog/grimjim/projected-abliteration),
[2](https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration))),
with a TPE-based parameter optimizer powered by [Optuna](https://optuna.org/).
The purpose of this organization is to publish and curate high-quality
abliterated models made using Heretic.
Membership is open to anyone who has either
1. contributed code to the Heretic project, *or*
2. published a well-received model made using Heretic.
To become a member, please [join our Discord](https://discord.gg/gdXc48gSyT)
and request an invitation.