WBench-weights / README.md
Kaining's picture
Fix bibtex: year 2025->2026, update citation key, add ModelScope badge (#3)
1c5fde9
---
license: apache-2.0
pipeline_tag: other
tags:
- video-evaluation
- world-model
- benchmark
---
<div align="center">
<img src="assets/longcat-logo-full.png" width="300">
<h1>WBench Weights</h1>
<p>Pre-trained model weights for WBench evaluation.</p>
[![Paper](https://img.shields.io/badge/Paper-red?style=for-the-badge&logo=arxiv&logoColor=white)](https://huggingface.co/papers/2605.25874)
[![Homepage](https://img.shields.io/badge/Homepage-blue?style=for-the-badge&logo=google-chrome&logoColor=white)](https://meituan-longcat.github.io/WBench/)
[![Code](https://img.shields.io/badge/Code-black?style=for-the-badge&logo=github&logoColor=white)](https://github.com/meituan-longcat/WBench)
[![Dataset](https://img.shields.io/badge/Dataset-4285F4?style=for-the-badge&logo=huggingface&logoColor=white)](https://huggingface.co/datasets/meituan-longcat/WBench)
[![ModelScope](https://img.shields.io/badge/ModelScope-6B4EFF?style=for-the-badge&logo=data:image/svg+xml;base64,PHN2ZyBmaWxsPSJ3aGl0ZSIgZmlsbC1ydWxlPSJldmVub2RkIiBoZWlnaHQ9IjFlbSIgc3R5bGU9ImZsZXg6bm9uZTtsaW5lLWhlaWdodDoxIiB2aWV3Qm94PSIwIDAgMjQgMjQiIHdpZHRoPSIxZW0iIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyI+PHRpdGxlPk1vZGVsU2NvcGU8L3RpdGxlPjxwYXRoIGQ9Ik0yLjY2NyA1LjNIOHYyLjY2N0g1LjMzM3YyLjY2NkgyLjY2N1Y4LjQ2N0guNXYyLjE2NmgyLjE2N1YxMy4zSDBWNy45NjdoMi42NjdWNS4zek0yLjY2NyAxMy4zaDIuNjY2djIuNjY3SDh2Mi42NjZIMi42NjdWMTMuM3pNOCAxMC42MzNoMi42NjdWMTMuM0g4di0yLjY2N3pNMTMuMzMzIDEzLjN2Mi42NjdoLTIuNjY2VjEzLjNoMi42NjZ6TTEzLjMzMyAxMy4zdi0yLjY2N0gxNlYxMy4zaC0yLjY2N3oiPjwvcGF0aD48cGF0aCBjbGlwLXJ1bGU9ImV2ZW5vZGQiIGQ9Ik0yMS4zMzMgMTMuM3YtMi42NjdoLTIuNjY2VjcuOTY3SDE2VjUuM2g1LjMzM3YyLjY2N0gyNFYxMy4zaC0yLjY2N3ptMC0yLjY2N0gyMy41VjguNDY3aC0yLjE2N3YyLjE2NnoiPjwvcGF0aD48cGF0aCBkPSJNMjEuMzMzIDEzLjN2NS4zMzNIMTZ2LTIuNjY2aDIuNjY3VjEzLjNoMi42NjZ6Ij48L3BhdGg+PC9zdmc+&logoColor=white)](https://www.modelscope.cn/models/meituan-longcat/WBench-weights)
</div>
---
This repository contains the consolidated model weights for **WBench**, a comprehensive multi-turn benchmark for interactive video world model evaluation. WBench evaluates world models along five dimensions: video quality, setting adherence, interaction adherence, consistency, and physics compliance. It contains 289 test cases and 1,058 interaction turns covering diverse scenes, styles, subjects, and perspectives.
## Usage
Please refer to the [WBench GitHub repository](https://github.com/meituan-longcat/WBench) for installation and evaluation instructions. You can download the weights using the Hugging Face CLI:
```bash
huggingface-cli download meituan-longcat/WBench-weights --local-dir weights/
```
## Disclaimer
We consolidate these weights into a single repository to help the community quickly deploy the WBench evaluation framework without hunting for individual checkpoints. These weights are redistributed solely for academic research and evaluation purposes. All rights belong to the original authors. See [LICENSE_NOTICE.md](LICENSE_NOTICE.md) for per-model licenses. If you believe any content infringes your rights, please contact us and we will remove it promptly:
- **Kaining Ying**: kaining.ying.cv@gmail.com
- **Siyu Ren**: rensiyu07@meituan.com
## Citation
```bibtex
@article{ying2026wbenchcomprehensivemultiturnbenchmark,
title={WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation},
author={Ying, Kaining and Hu, Hengrui and Ren, Siyu and Li, Jiamu and Chen, Fengjiao and Wang, Ziwen and Cao, Xuezhi and Cai, Xunliang and Ding, Henghui},
journal={arXiv preprint arXiv:2605.25874},
year={2026}
}
```