ZengrongLin's picture
Update README.md
738a8d3 verified
---
language:
- en
pipeline_tag: image-to-video
tags:
- video-generation
- image-to-video
- world-model
- robotics
- action-conditioned
- pytorch
library_name: pytorch
base_model:
- Wan-AI/Wan2.2-TI2V-5B
---
<div align="center">
<h1>Boundless-World-Model</h1>
<p align="center">
<strong>BWM: Physically consistent, action-conditioned video world model for robotic manipulation</strong>
</p>
<p align="center">
<a href="https://github.com/boundless-large-model/boundless-world-model"><img src="https://img.shields.io/badge/GitHub-Repository-blue?style=flat&logo=github" alt="GitHub Repository"></a>
<a href="https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B"><img src="https://img.shields.io/badge/Base%20Model-Wan2.2--TI2V--5B-orange?style=flat&logo=huggingface" alt="Base Model"></a>
<a href="https://huggingface.co/spaces/WorldArena/WorldArena"><img src="https://img.shields.io/badge/Benchmark-WorldArena-yellow?style=flat" alt="WorldArena"></a>
</p>
</div>
## Model Details
| Property | Value |
|----------|-------|
| **Base Model** | [Wan2.2-TI2V-5B](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B) |
| **Resolution** | 480 x 640 |
| **Frames** | 81 frames |
| **Control Signals** | Robot action trajectories |
| **Architecture** | Trainable DiT + Action Encoder |
## Usage
To use these weights, please refer to [our GitHub repository](https://github.com/boundless-large-model/boundless-world-model).
## Acknowledgements
This project builds upon the following open-source projects and benchmarks:
- Wan2.2: https://github.com/Wan-Video/Wan2.2
- DiffSynth-Studio: https://github.com/modelscope/DiffSynth-Studio
- WorldArena: https://github.com/tsinghua-fib-lab/WorldArena/
- ABot-PhysWorld: https://github.com/amap-cvlab/ABot-PhysWorld