Spaces:
Running
Running
| title: README | |
| emoji: ๐ | |
| colorFrom: gray | |
| colorTo: indigo | |
| sdk: static | |
| pinned: false | |
| [](https://discord.gg/gdXc48gSyT) | |
| # The official organization of the Heretic project | |
| [Heretic](https://github.com/p-e-w/heretic) is a tool that removes censorship (aka "safety alignment") from | |
| transformer-based language models without expensive post-training. | |
| It combines an advanced implementation of directional ablation, also known | |
| as "abliteration" ([Arditi et al. 2024](https://arxiv.org/abs/2406.11717), | |
| Lai 2025 ([1](https://huggingface.co/blog/grimjim/projected-abliteration), | |
| [2](https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration))), | |
| with a TPE-based parameter optimizer powered by [Optuna](https://optuna.org/). | |
| The purpose of this organization is to publish and curate high-quality | |
| abliterated models made using Heretic. | |
| Membership is open to anyone who has either | |
| 1. contributed code to the Heretic project, *or* | |
| 2. published a well-received model made using Heretic. | |
| To become a member, please [join our Discord](https://discord.gg/gdXc48gSyT) | |
| and request an invitation. | |