Upload folder using huggingface_hub
Browse files- .gitattributes +3 -0
- README.md +14 -3
- figures/data_gen.png +3 -0
- figures/first.png +3 -0
- figures/pipeline.png +3 -0
- wan_example.ckpt +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
figures/data_gen.png filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
figures/first.png filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
figures/pipeline.png filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,3 +1,14 @@
|
|
| 1 |
-
|
| 2 |
-
|
| 3 |
-
--
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Photography Perspective Composition: Towards Aesthetic Perspective Recommendation
|
| 2 |
+
|
| 3 |
+
<a href="https://arxiv.org/abs/2505.20655"><img src='https://img.shields.io/badge/arXiv-2505.20655-red?style=flat&logo=arXiv&logoColor=red' alt='arxiv'></a>
|
| 4 |
+
<a href="https://vivocameraresearch.github.io/ppc/"><img src='https://img.shields.io/badge/Project-Page-Green' alt='GitHub'></a>
|
| 5 |
+
|
| 6 |
+
|
| 7 |
+
## Motivation
|
| 8 |
+
<p align="center">
|
| 9 |
+
<img src="figures/first.png" width="100%">
|
| 10 |
+
</p>
|
| 11 |
+
Traditional crop-based methods (a) focus on learning crop templates for better composition. However, when scenes contain chaotic arrangements of subjects, cropping alone rarely yields satisfactory results. Perspective transformation (b) addresses these challenges by adjusting spatial relationships between subjects (e.g., person and tree, red arrow) and scene orientation.
|
| 12 |
+
|
| 13 |
+
## Abstract
|
| 14 |
+
Traditional photography composition approaches are dominated by 2D cropping-based methods. However, these methods fall short when scenes contain poorly arranged subjects. Professional photographers often employ perspective adjustment as a form of 3D recomposition, modifying the projected 2D relationships between subjects while maintaining their actual spatial positions to achieve better compositional balance. Inspired by this artistic practice, we propose photography perspective composition (PPC), extending beyond traditional cropping-based methods. However, implementing the PPC faces significant challenges: the scarcity of perspective transformation datasets and undefined assessment criteria for perspective quality. To address these challenges, we present three key contributions: (1) An automated framework for building PPC datasets through expert photographs. (2) A video generation approach that demonstrates the transformation process from suboptimal to optimal perspectives. (3) A perspective quality assessment (PQA) model constructed based on human performance. Our approach is concise and requires no additional prompt instructions or camera trajectories, helping and guiding ordinary users to enhance their composition skills.
|
figures/data_gen.png
ADDED
|
Git LFS Details
|
figures/first.png
ADDED
|
Git LFS Details
|
figures/pipeline.png
ADDED
|
Git LFS Details
|
wan_example.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6603f6b1fdd7b7adbc57b91c153647de5afe7c8367848c18ba9ea49e817e55a2
|
| 3 |
+
size 306977010
|