|
|
--- |
|
|
license: cc0-1.0 |
|
|
--- |
|
|
|
|
|
# Jasmine Diffusion Checkpoint |
|
|
|
|
|
Pretrained **diffusion-based world model** from the [Jasmine](https://github.com/p-doom/jasmine) codebase. |
|
|
Trained on the **CoinRun** dataset for action-conditioned video generation using the **diffusion-forcing** objective (Chen et al., 2024). |
|
|
|
|
|
--- |
|
|
|
|
|
### Model Details |
|
|
- **Architecture:** ST-DiT (spatio-temporal diffusion transformer) |
|
|
- **Input:** 16-frame sequences (64×64) + latent actions |
|
|
- **Training Environment:** [CoinRun (Cobbe et al., 2020)](https://huggingface.co/datasets/p-doom/coinrun-dataset) |
|
|
- **Objective:** Diffusion forcing (x-prediction) |
|
|
|