--- license: mit language: - en pipeline_tag: image-to-video library_name: diffusers --- # Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
## 📝 Overview **Matrix-Game-2.0(1.8B)** is an interactive world model generates long videos on-the-fly via few-step auto-regressive diffusion ## ✨ Key Features - 🚀 **Feature 1**: **Real-Time Distillation** Efficient few-step diffusion for streaming video synthesis at 25 FPS, producing minute-level, high-fidelity videos across complex environments with ultra-fast speed. - 🖱️ **Feature 2**: **Precise Action Injection** A mouse/keyboard-to-frame module that embeds user inputs as direct interactions, enabling frame-level control and dynamic response in generated videos. - 🎬 **Feature 3**: **Massive Interactive Data Pipeline** A scalable production system for Unreal Engine & GTA5 that generates ~1350 hours of high-quality interactive video data, covering diverse scenes with frame-level realism. ## 🔥 Latest Updates * [2025-08] 🎉 Initial release of Matrix-Game-2.0 Model ## 🚀 Performance Comparison ### GameWorld Score Benchmark Comparison | Model | Image Quality ↑ | Aesthetic Quality ↑ | Temporal Cons. ↑ | Motion Smooth. ↑ | Keyboard Acc. ↑ | Mouse Acc. ↑ | Object Cons. | Scenario Cons.| |-----------|------------------|-------------|-------------------|-------------------|------------------|---------------|-------------|-------------| | Oasis | 0.27 | 0.27 | 0.82 | **0.99** | 0.73 | 0.56 | 0.18 | 0.84 | | **Ours** | **0.61** | **0.50** | **0.94** | 0.98 | **0.91** | **0.95** | **0.64** | **0.80** | **Metric Descriptions**: - **Image Quality** / **Aesthetic**: Visual fidelity and perceptual appeal of generated frames - **Temporal Consistency** / **Motion Smoothness**: Temporal coherence and smoothness between frames - **Keyboard Accuracy** / **Mouse Accuracy**: Accuracy in following user control signals - **Object Consistency**: Geometric stability and consistency of objects over time - **Scenario Consistency**: Scenario consistency over time Please check our [GameWorld](https://github.com/SkyworkAI/Matrix-Game/tree/main/GameWorldScore) benchmark for detailed implementation. ## 🚀 Quick Start ``` # clone the repository: git clone xxx cd Matrix-Game-2.0 # install dependencies: pip install -r requirements.txt # inference bash xxx.sh ``` ## ⭐ Acknowledgements We would like to express our gratitude to: - [Diffusers](https://github.com/huggingface/diffusers) for their excellent diffusion model framework - [SkyReels-V2](https://github.com/SkyworkAI/SkyReels-V2) for their strong base model - [Self-Forcing](https://github.com/guandeh17/Self-Forcing) for their excellent work - [MineRL](https://github.com/minerllabs/minerl) for their excellent gym framework - [Video-Pre-Training](https://github.com/openai/Video-Pre-Training) for their accurate Inverse Dynamics Model - [GameFactory](https://github.com/KwaiVGI/GameFactory) for their idea of action control module We are grateful to the broader research community for their open exploration and contributions to the field of interactive world generation. ## 📎 Citation If you find this project useful, please cite our paper: ```bibtex xxx ```