Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sankim2
/
cosmos
like
2
Image-Text-to-Text
Transformers
vision
vision-language-model
contrastive learning
self-supervised learning
arxiv:
2412.01814
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
cosmos
24.5 GB
1 contributor
History:
16 commits
nielsr
HF Staff
Add pipeline tag and library name
2a8823e
verified
10 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
README.md
1.5 kB
Add pipeline tag and library name
10 months ago
config.json
2 Bytes
Create config.json
10 months ago
cosmos_vitb16_cc12m.pt
2.44 GB
xet
Upload cosmos_vitb16_cc12m.pt
11 months ago
cosmos_vitb16_cc3m.pt
2.44 GB
xet
Upload cosmos_vitb16_cc3m.pt
11 months ago
cosmos_vitb16_merged30m.pt
2.44 GB
xet
Upload cosmos_vitb16_merged30m.pt
11 months ago
cosmos_vitb16_pixelprose.pt
2.44 GB
xet
Upload cosmos_vitb16_pixelprose.pt
11 months ago
cosmos_vitb16_yfcc15m.pt
2.44 GB
xet
Upload cosmos_vitb16_yfcc15m.pt
11 months ago
cosmos_vitb32_cc12m.pt
2.47 GB
xet
Upload cosmos_vitb32_cc12m.pt
11 months ago
cosmos_vitb32_cc3m.pt
2.47 GB
xet
Upload cosmos_vitb32_cc3m.pt
11 months ago
cosmos_vitb32_merged30m.pt
2.47 GB
xet
Upload cosmos_vitb32_merged30m.pt
11 months ago
cosmos_vitb32_pixelprose.pt
2.47 GB
xet
Upload cosmos_vitb32_pixelprose.pt
11 months ago
cosmos_vitb32_yfcc15m.pt
2.47 GB
xet
Upload cosmos_vitb32_yfcc15m.pt
11 months ago