Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -38,8 +38,10 @@ base_model:
 # ProFS Editing for Safety
-This model has been edited for safety from [`mistralai/Mistral-7B-v0.1`](https://huggingface.co/mistralai/Mistral-7B-v0.1).
-Editing is applied using ProFS (Projection Filter for Subspaces), a tuning-free alignment method that removes undesired behaviors such as toxicity, by identifying and projecting out harmful subspaces in model weights.
 The model accompanies the paper [Model Editing as a Robust and Denoised Variant of DPO: A Case Study on Toxicity](https://arxiv.org/abs/2405.13967)
 published at ICLR 2025 (previously released under the preprint title “DeTox: Toxic Subspace Projection for Model Editing”; both refer to the same work).

 # ProFS Editing for Safety
+This model is an edited version of [`mistralai/Mistral-7B-v0.1`](https://huggingface.co/mistralai/Mistral-7B-v0.1).
+Editing is applied through ProFS, to improve safety.
+ProFS (Projection Filter for Subspaces) is a tuning-free alignment method that removes undesired behaviors by identifying and projecting out harmful subspaces in model weights.
 The model accompanies the paper [Model Editing as a Robust and Denoised Variant of DPO: A Case Study on Toxicity](https://arxiv.org/abs/2405.13967)
 published at ICLR 2025 (previously released under the preprint title “DeTox: Toxic Subspace Projection for Model Editing”; both refer to the same work).