p-doom
/

jasmine-diffusion-coinrun

Model card Files Files and versions

jasmine-diffusion-coinrun / README.md

mihirma's picture

Create README.md

b8ee1b9 verified 2 months ago

|

history blame contribute delete

627 Bytes

	---
	license: cc0-1.0
	---

	# Jasmine Diffusion Checkpoint

	Pretrained diffusion-based world model from the [Jasmine](https://github.com/p-doom/jasmine) codebase.
	Trained on the CoinRun dataset for action-conditioned video generation using the diffusion-forcing objective (Chen et al., 2024).

	---

	### Model Details
	- Architecture: ST-DiT (spatio-temporal diffusion transformer)
	- Input: 16-frame sequences (64×64) + latent actions
	- Training Environment: [CoinRun (Cobbe et al., 2020)](https://huggingface.co/datasets/p-doom/coinrun-dataset)
	- Objective: Diffusion forcing (x-prediction)