SAGE / README.md

Link model card to paper and fix code block formatting (#1)

3637f84 about 13 hours ago

1.71 kB

	---
	library_name: diffusers
	license: mit
	pipeline_tag: text-to-image
	tags:
	- text-to-image
	- safety-alignment
	- stable-diffusion
	- ECCV
	---

	# SAGE: Structure-Aware Geometric Regularization (ECCV-26)

	Paper: [The Illusion of High Utility in Safety Alignment of Text-to-Image Diffusion Models](https://huggingface.co/papers/2607.00402)
	Authors: Adeel Yousaf, Soumik Ghosh, James Beetham, Amrit Singh Bedi, Mubarak Shah
	Institution: University of Central Florida
	Project Page: [https://adeelyousaf.github.io/SAGE_ECCV26_Project_Page/](https://adeelyousaf.github.io/SAGE_ECCV26_Project_Page/)

	---

	## Overview

	We show that existing T2I safety alignment methods create an illusion of high utility — they appear to maintain high utility under coarse metrics (FID, CLIPScore) but suffer significant drops in fine-grained semantic fidelity (TIFA). We trace this to semantic collapse in the text encoder embedding space.

	SAGE is a geometry-aware safety alignment method that preserves embedding spread and local similarity structure during fine-tuning, achieving only a −1.2% TIFA drop vs. −6.2% for DES while maintaining strong safety (Avg. ASR 1.2%).

	---

	## Use this Model

	```python
	import torch
	from diffusers import StableDiffusionPipeline
	from huggingface_hub import hf_hub_download

	# Load base pipeline
	pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")

	# Download and load SAGE text encoder weights
	ckpt_path = hf_hub_download(repo_id="Adeely93/SAGE", filename="SAGE.pt")
	pipe.text_encoder.load_state_dict(torch.load(ckpt_path, map_location="cpu"))

	pipe = pipe.to("cuda")
	image = pipe("a photo of a dog in a park").images[0]
	```