Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rishitdagli
/
see-2-sound
like
6
vision
audio
spatial audio
audio generation
music
art
Eval Results (legacy)
arxiv:
2406.06612
Model card
Files
Files and versions
xet
Community
main
see-2-sound
/
codi
36.5 GB
1 contributor
History:
1 commit
rishitdagli
Add models
5a9dbea
over 1 year ago
codi_audio.pth
6.91 GB
xet
Add models
over 1 year ago
codi_encoder.pth
6.03 GB
xet
Add models
over 1 year ago
codi_text.pth
12.4 GB
xet
Add models
over 1 year ago
codi_video.pth
11.2 GB
xet
Add models
over 1 year ago