Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thu-ml
/
unidiffuser-v1
like
36
Follow
Tsinghua Machine Learning Group
39
Text-to-Image
Diffusers
UniDiffuserPipeline
image-to-text
image-captioning
image-variation
text-variation
multi-modality
generative model
License:
agpl-3.0
Model card
Files
Files and versions
xet
Community
6
Use this model
main
unidiffuser-v1
10.7 GB
5 contributors
History:
13 commits
patrickvonplaten
dg845
Upload 2 files (
#5
)
c70d410
over 2 years ago
clip_image_processor
Upload 2 files (#5)
over 2 years ago
clip_tokenizer
Upload 22 files (#3)
over 2 years ago
image_encoder
Upload 22 files (#3)
over 2 years ago
image_processor
Upload 22 files (#3)
over 2 years ago
scheduler
Upload 22 files (#3)
over 2 years ago
text_decoder
Upload 22 files (#3)
over 2 years ago
text_encoder
Upload 22 files (#3)
over 2 years ago
text_tokenizer
Upload 22 files (#3)
over 2 years ago
unet
Upload 22 files (#3)
over 2 years ago
vae
Upload 22 files (#3)
over 2 years ago
.gitattributes
1.48 kB
initial commit
almost 3 years ago
README.md
7.58 kB
Update README.md (#4)
over 2 years ago
autoencoder_kl.pth
335 MB
xet
autoencoder_kl, caption_ae
almost 3 years ago
caption_decoder.pth
1 GB
xet
Upload caption_decoder.pth
almost 3 years ago
model_index.json
699 Bytes
Upload 2 files (#5)
over 2 years ago
uvit_v1.pth
3.81 GB
xet
uvit_v1
almost 3 years ago