Improve model card

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +28 -3
README.md CHANGED
@@ -1,3 +1,28 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: text-to-video
4
+ tags:
5
+ - video-generation
6
+ - dpo
7
+ ---
8
+
9
+ # Mind the Generative Details: Direct Localized Detail Preference Optimization for Video Diffusion Models
10
+
11
+ This repository contains the weights for **LocalDPO**, a novel post-training framework that constructs localized preference pairs from real videos and optimizes alignment at the spatio-temporal region level for video diffusion models.
12
+
13
+ LocalDPO addresses the efficiency and ambiguity limitations of existing DPO methods. It treats high-quality real videos as positive samples and generates corresponding negatives by locally corrupting them with random spatio-temporal masks. Experiments on Wan2.1 and CogVideoX demonstrate that LocalDPO consistently improves video fidelity and temporal coherence.
14
+
15
+ - **Paper:** [Mind the Generative Details: Direct Localized Detail Preference Optimization for Video Diffusion Models](https://huggingface.co/papers/2601.04068)
16
+ - **Project Page:** [https://1170300714.github.io/LocalDPO/](https://1170300714.github.io/LocalDPO/)
17
+ - **Code:** [https://github.com/1170300714/Local-DPO](https://github.com/1170300714/Local-DPO)
18
+
19
+ ## Citation
20
+
21
+ ```bibtex
22
+ @article{huang2026mind,
23
+ title={Mind the Generative Details: Direct Localized Detail Preference Optimization for Video Diffusion Models},
24
+ author={Huang, Zitong and Zhang, Kaidong and Ding, Yukang and Gao, Chao and Ding, Rui and Chen, Ying and Zuo, Wangmeng},
25
+ journal={arXiv preprint arXiv:2601.04068},
26
+ year={2026}
27
+ }
28
+ ```