File size: 605 Bytes
818053a b23d456 818053a b23d456 |
1 2 3 4 5 6 7 8 9 10 11 12 13 |
---
license: mit
pipeline_tag: text-to-video
library_name: transformers
---
# VideoPhy: Evaluating Physical Commonsense in Video Generation
This text-to-video model is part of the VideoPhy project, which benchmarks physical commonsense in video generation. It generates videos from text prompts, aiming to evaluate how well generated videos adhere to real-world physics.
[Project Website](https://videophy2.github.io/) | [Paper](https://arxiv.org/abs/2406.03520) | [GitHub](https://github.com/Hritikbansal/videophy)
For detailed use-case instructions, please refer to the project's GitHub repository. |