mihirma's picture
Create README.md
b8ee1b9 verified
---
license: cc0-1.0
---
# Jasmine Diffusion Checkpoint
Pretrained **diffusion-based world model** from the [Jasmine](https://github.com/p-doom/jasmine) codebase.
Trained on the **CoinRun** dataset for action-conditioned video generation using the **diffusion-forcing** objective (Chen et al., 2024).
---
### Model Details
- **Architecture:** ST-DiT (spatio-temporal diffusion transformer)
- **Input:** 16-frame sequences (64×64) + latent actions
- **Training Environment:** [CoinRun (Cobbe et al., 2020)](https://huggingface.co/datasets/p-doom/coinrun-dataset)
- **Objective:** Diffusion forcing (x-prediction)