Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
byungki-kwon
/
JointDiT
like
1
Text-to-Image
Diffusers
English
diffusion-transformer
multimodal
joint-generation
depth-estimation
depth-to-image
text-to-multimodal
arxiv:
2505.00482
License:
mit
Model card
Files
Files and versions
xet
Community
Use this model
main
JointDiT
4.72 GB
2 contributors
History:
12 commits
byungki-kwon
Update README.md
2cc1b64
verified
4 months ago
.gitattributes
1.57 kB
Upload jointdit.gif
4 months ago
README.md
1.54 kB
Update README.md
4 months ago
jointdit.gif
14.9 MB
xet
Upload jointdit.gif
4 months ago
jointdit_addons.safetensors
Safe
4.71 GB
xet
Upload jointdit_addons.safetensors
6 months ago