Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
cvssp
/
audioldm2
like
66
Follow
Centre for Vision, Speech and Signal Processing - University of Surrey
78
Diffusers
Safetensors
AudioLDM2Pipeline
arxiv:
2308.05734
License:
cc-by-nc-sa-4.0
Model card
Files
Files and versions
xet
Community
6
Use this model
refs/pr/4
audioldm2
/
projection_model
/
config.json
Sanchit Gandhi
Add model weights and config
efd334c
over 2 years ago
raw
Copy download link
history
blame
Safe
173 Bytes
{
"_class_name"
:
"AudioLDM2ProjectionModel"
,
"_diffusers_version"
:
"0.20.0.dev0"
,
"langauge_model_dim"
:
768
,
"text_encoder_1_dim"
:
1024
,
"text_encoder_dim"
:
512
}