Improve model card: Add pipeline tag, library name, links, and usage examples
#1
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,7 +1,68 @@
|
|
| 1 |
---
|
| 2 |
license: cc-by-nc-4.0
|
|
|
|
|
|
|
| 3 |
---
|
| 4 |
|
| 5 |
-
|
| 6 |
|
| 7 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: cc-by-nc-4.0
|
| 3 |
+
pipeline_tag: image-to-image
|
| 4 |
+
library_name: diffusers
|
| 5 |
---
|
| 6 |
|
| 7 |
+
# NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
|
| 8 |
|
| 9 |
+
This repository contains models for **NeuralRemaster**, also known as **Phase-Preserving Diffusion ($\phi$-PD)**, presented in the paper [NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation](https://arxiv.org/abs/2512.05106).
|
| 10 |
+
|
| 11 |
+
**NeuralRemaster** introduces a novel, model-agnostic reformulation of the diffusion process that preserves input phase while randomizing magnitude. This approach enables structure-aligned generation without requiring architectural changes or additional parameters, making it particularly well-suited for tasks demanding geometric consistency, such as re-rendering, simulation enhancement, and various image-to-image and video-to-video translation tasks. It also proposes Frequency-Selective Structured (FSS) noise for continuous control over structural rigidity.
|
| 12 |
+
|
| 13 |
+
- 📚 **Paper**: [NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation](https://arxiv.org/abs/2512.05106)
|
| 14 |
+
- 🌐 **Project Page**: [https://yuzeng-at-tri.github.io/ppd-page/](https://yuzeng-at-tri.github.io/ppd-page/)
|
| 15 |
+
- 💻 **Code**: [https://github.com/zengxianyu/PPD-examples](https://github.com/zengxianyu/PPD-examples)
|
| 16 |
+
|
| 17 |
+
The 4-step LoRA checkpoints for Wan2.2-14b mentioned in this repository are converted from [Wan2.2-Lightning](https://huggingface.co/lightx2v/).
|
| 18 |
+
|
| 19 |
+
## Usage
|
| 20 |
+
|
| 21 |
+
This repository provides example adaptations of SD1.5, FLUX.1-dev, and Wan2.2-14b with Phase-Preserving Diffusion.
|
| 22 |
+
|
| 23 |
+
1. **Install dependencies**:
|
| 24 |
+
|
| 25 |
+
```bash
|
| 26 |
+
pip install -r requirements.txt
|
| 27 |
+
pip install git+https://github.com/zengxianyu/structured-noise
|
| 28 |
+
```
|
| 29 |
+
|
| 30 |
+
2. **Download model weights**:
|
| 31 |
+
Download the model weights from [huggingface.co/zengxianyu/ppd/tree/main](https://huggingface.co/zengxianyu/ppd/tree/main) and place them in `models/ppd/`.
|
| 32 |
+
|
| 33 |
+
3. **Inference examples**:
|
| 34 |
+
Example input images can be found [here](https://huggingface.co/zengxianyu/ppd/tree/main).
|
| 35 |
+
|
| 36 |
+
**SD 1.5**:
|
| 37 |
+
|
| 38 |
+
```bash
|
| 39 |
+
PYTHONPATH=. python examples/image_synthesis/sd_text_to_image_ppd.py --input_image dog.jpg --radius 15 --prompt "A high quality picture captured by a professional camera. Picture of a cute border collie" --output output.png
|
| 40 |
+
```
|
| 41 |
+
|
| 42 |
+
**FLUX1.1-dev**:
|
| 43 |
+
|
| 44 |
+
```bash
|
| 45 |
+
PYTHONPATH=. CUDA_VISIBLE_DEVICES=6 python examples/flux/model_inference/FLUX.1-dev_ppd.py --input_image test2.jpg --prompt "$(cat test2.txt)" --output output.png --radius 30
|
| 46 |
+
```
|
| 47 |
+
|
| 48 |
+
**Wan2.2-14b**:
|
| 49 |
+
|
| 50 |
+
```bash
|
| 51 |
+
PYTHONPATH=. CUDA_VISIBLE_DEVICES=1 python examples/wanvideo/model_inference/Wan2.2-I2V-A14B_ppd.py --input_image output.png --input_video test2.mp4 --prompt "$(cat test2.txt)" --radius 30 --output output.mp4
|
| 52 |
+
```
|
| 53 |
+
|
| 54 |
+
4. For training, please refer to the original [DiffSynth-Studio repository](https://github.com/modelscope/DiffSynth-Studio).
|
| 55 |
+
|
| 56 |
+
## Citation
|
| 57 |
+
|
| 58 |
+
If you find this work useful, please cite the paper:
|
| 59 |
+
|
| 60 |
+
```bibtex
|
| 61 |
+
@article{zeng2025neuralremaster,
|
| 62 |
+
title = {{NeuralRemaster}: Phase-Preserving Diffusion for Structure-Aligned Generation},
|
| 63 |
+
author = {Zeng, Yu and Ochoa, Charles and Zhou, Mingyuan and Patel, Vishal M and
|
| 64 |
+
Guizilini, Vitor and McAllister, Rowan},
|
| 65 |
+
journal = {arXiv preprint arXiv:XXXX.XXXXX},
|
| 66 |
+
year = {2025}
|
| 67 |
+
}
|
| 68 |
+
```
|