ByteDance
/

ContentV-8B

ContentVPipeline

Model card Files Files and versions

linwf commited on Jun 3, 2025

Commit

2fadbb0

·

verified ·

1 Parent(s): 0f3e199

Update README.md

Files changed (1) hide show

README.md +49 -3

README.md CHANGED Viewed

@@ -1,3 +1,49 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+# ContentV: Efficient Training of Video Generation Models with Limited Compute
+This project presents ContentV, a novel framework that accelerates DiT-based video generation through three key innovations:
+- A minimalist model design that enables effective reuse of pre-trained image generation models for video synthesis
+- A comprehensive exploration of a multi-stage, efficient training strategy based on Flow Matching
+- A low-cost Reinforcement Learning with Human Feedback (RLHF) approach that further enhances generation quality without the need for additional human annotations.
+## Quickstart
+#### Recommended PyTorch Version
+- GPU: torch >= 2.3.1 (CUDA >= 12.2)
+- NPU: torch and torch-npu >= 2.1.0 (CANN >= 8.0.RC2). Please refer to [Ascend Extension for PyTorch](https://gitee.com/ascend/pytorch) for the installation of torch-npu.
+#### Installation
+```sh
+git clone https://github.com/bytedance/ContentV.git
+pip3 install -r ContentV/requirements.txt
+```
+#### T2V Generation
+```sh
+cd ContentV
+## For GPU
+python3 demo.py
+## For NPU
+USE_ASCEND_NPU=1 python3 demo.py
+```
+## Todo List
+- [x] Inference code and checkpoints
+- [ ] Training code of RLHF
+## License
+This code repository and part of the model weights are licensed under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0). Please note that:
+- MM DiT are derived from [Stable Diffusion 3.5 Large](https://huggingface.co/stabilityai/stable-diffusion-3.5-large) and trained with video samples. This Stability AI Model is licensed under the [Stability AI Community License](https://stability.ai/community-license-agreement), Copyright ©  Stability AI Ltd. All Rights Reserved
+- Video VAE from [Wan2.1](https://huggingface.co/Wan-AI/Wan2.1-T2V-14B) is licensed under [Apache 2.0 License](https://huggingface.co/Wan-AI/Wan2.1-T2V-14B/blob/main/LICENSE.txt)
+## Acknowledgement
+* [Stable Diffusion 3.5 Large](https://huggingface.co/stabilityai/stable-diffusion-3.5-large)
+* [Wan2.1](https://github.com/Wan-Video/Wan2.1)
+* [Diffusers](https://github.com/huggingface/diffusers)
+* [HuggingFace](https://huggingface.co)