--- license: apache-2.0 pipeline_tag: image-text-to-video base_model: ByteDance/Bernini-R --- **Quantized GGUF version of Bernini-R for ComfyUI.** **Original model link:** [https://huggingface.co/ByteDance/Bernini-R](https://huggingface.co/ByteDance/Bernini-R) **Watch us on Youtube:** [@VantageWithAI](https://www.youtube.com/@vantagewithai)
## ๐ Citation
If you use Bernini in your research, please cite:
```bibtex
@article{bernini,
title = {Bernini: Latent Semantic Planning for Video Diffusion},
author = {Chenchen Liu and Junyi Chen and Lei Li and Lu Chi and Mingzhen Sun and Zhuoying Li and Yi Fu and Ruoyu Guo and Yiheng Wu and Ge Bai and Zehuan Yuan},
journal = {arXiv preprint arXiv:2605.22344},
year = {2026}
}
```
## ๐ Acknowledgements
Bernini builds on several outstanding open-source projects:
- [Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B)
- [Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct)
- [VeOmni](https://github.com/ByteDance-Seed/VeOmni)
We thank the authors and communities of these projects for their contributions.
## ๐ License
Apache License 2.0. See [LICENSE](LICENSE).