File size: 605 Bytes
818053a
 
b23d456
 
818053a
 
b23d456
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
---
license: mit
pipeline_tag: text-to-video
library_name: transformers
---

# VideoPhy: Evaluating Physical Commonsense in Video Generation

This text-to-video model is part of the VideoPhy project, which benchmarks physical commonsense in video generation.  It generates videos from text prompts, aiming to evaluate how well generated videos adhere to real-world physics.

[Project Website](https://videophy2.github.io/) | [Paper](https://arxiv.org/abs/2406.03520) | [GitHub](https://github.com/Hritikbansal/videophy)

For detailed use-case instructions, please refer to the project's GitHub repository.