Improve model card: Add pipeline tag, library name, links, and usage examples

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +63 -2
README.md CHANGED
@@ -1,7 +1,68 @@
1
  ---
2
  license: cc-by-nc-4.0
 
 
3
  ---
4
 
5
- Models for [this repo](https://github.com/zengxianyu/ppd-examples) and [this paper](https://arxiv.org/abs/2512.05106) arxiv.org/abs/2512.05106
6
 
7
- The 4-step lora checkpoints for Wan2.2-14b are converted from [Wan2.2-Lightning](https://huggingface.co/lightx2v/)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-4.0
3
+ pipeline_tag: image-to-image
4
+ library_name: diffusers
5
  ---
6
 
7
+ # NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
8
 
9
+ This repository contains models for **NeuralRemaster**, also known as **Phase-Preserving Diffusion ($\phi$-PD)**, presented in the paper [NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation](https://arxiv.org/abs/2512.05106).
10
+
11
+ **NeuralRemaster** introduces a novel, model-agnostic reformulation of the diffusion process that preserves input phase while randomizing magnitude. This approach enables structure-aligned generation without requiring architectural changes or additional parameters, making it particularly well-suited for tasks demanding geometric consistency, such as re-rendering, simulation enhancement, and various image-to-image and video-to-video translation tasks. It also proposes Frequency-Selective Structured (FSS) noise for continuous control over structural rigidity.
12
+
13
+ - 📚 **Paper**: [NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation](https://arxiv.org/abs/2512.05106)
14
+ - 🌐 **Project Page**: [https://yuzeng-at-tri.github.io/ppd-page/](https://yuzeng-at-tri.github.io/ppd-page/)
15
+ - 💻 **Code**: [https://github.com/zengxianyu/PPD-examples](https://github.com/zengxianyu/PPD-examples)
16
+
17
+ The 4-step LoRA checkpoints for Wan2.2-14b mentioned in this repository are converted from [Wan2.2-Lightning](https://huggingface.co/lightx2v/).
18
+
19
+ ## Usage
20
+
21
+ This repository provides example adaptations of SD1.5, FLUX.1-dev, and Wan2.2-14b with Phase-Preserving Diffusion.
22
+
23
+ 1. **Install dependencies**:
24
+
25
+ ```bash
26
+ pip install -r requirements.txt
27
+ pip install git+https://github.com/zengxianyu/structured-noise
28
+ ```
29
+
30
+ 2. **Download model weights**:
31
+ Download the model weights from [huggingface.co/zengxianyu/ppd/tree/main](https://huggingface.co/zengxianyu/ppd/tree/main) and place them in `models/ppd/`.
32
+
33
+ 3. **Inference examples**:
34
+ Example input images can be found [here](https://huggingface.co/zengxianyu/ppd/tree/main).
35
+
36
+ **SD 1.5**:
37
+
38
+ ```bash
39
+ PYTHONPATH=. python examples/image_synthesis/sd_text_to_image_ppd.py --input_image dog.jpg --radius 15 --prompt "A high quality picture captured by a professional camera. Picture of a cute border collie" --output output.png
40
+ ```
41
+
42
+ **FLUX1.1-dev**:
43
+
44
+ ```bash
45
+ PYTHONPATH=. CUDA_VISIBLE_DEVICES=6 python examples/flux/model_inference/FLUX.1-dev_ppd.py --input_image test2.jpg --prompt "$(cat test2.txt)" --output output.png --radius 30
46
+ ```
47
+
48
+ **Wan2.2-14b**:
49
+
50
+ ```bash
51
+ PYTHONPATH=. CUDA_VISIBLE_DEVICES=1 python examples/wanvideo/model_inference/Wan2.2-I2V-A14B_ppd.py --input_image output.png --input_video test2.mp4 --prompt "$(cat test2.txt)" --radius 30 --output output.mp4
52
+ ```
53
+
54
+ 4. For training, please refer to the original [DiffSynth-Studio repository](https://github.com/modelscope/DiffSynth-Studio).
55
+
56
+ ## Citation
57
+
58
+ If you find this work useful, please cite the paper:
59
+
60
+ ```bibtex
61
+ @article{zeng2025neuralremaster,
62
+ title = {{NeuralRemaster}: Phase-Preserving Diffusion for Structure-Aligned Generation},
63
+ author = {Zeng, Yu and Ochoa, Charles and Zhou, Mingyuan and Patel, Vishal M and
64
+ Guizilini, Vitor and McAllister, Rowan},
65
+ journal = {arXiv preprint arXiv:XXXX.XXXXX},
66
+ year = {2025}
67
+ }
68
+ ```