Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sankim2
/
cosmos
like
2
Image-Text-to-Text
Transformers
vision
vision-language-model
contrastive learning
self-supervised learning
arxiv:
2412.01814
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
374c287
cosmos
24.5 GB
1 contributor
History:
16 commits
sankim2
Update README.md
374c287
verified
9 months ago
.gitattributes
1.52 kB
initial commit
9 months ago
README.md
1.5 kB
Update README.md
9 months ago
config.json
2 Bytes
Create config.json
9 months ago
cosmos_vitb16_cc12m.pt
2.44 GB
xet
Upload cosmos_vitb16_cc12m.pt
9 months ago
cosmos_vitb16_cc3m.pt
2.44 GB
xet
Upload cosmos_vitb16_cc3m.pt
9 months ago
cosmos_vitb16_merged30m.pt
2.44 GB
xet
Upload cosmos_vitb16_merged30m.pt
9 months ago
cosmos_vitb16_pixelprose.pt
2.44 GB
xet
Upload cosmos_vitb16_pixelprose.pt
9 months ago
cosmos_vitb16_yfcc15m.pt
2.44 GB
xet
Upload cosmos_vitb16_yfcc15m.pt
9 months ago
cosmos_vitb32_cc12m.pt
2.47 GB
xet
Upload cosmos_vitb32_cc12m.pt
9 months ago
cosmos_vitb32_cc3m.pt
2.47 GB
xet
Upload cosmos_vitb32_cc3m.pt
9 months ago
cosmos_vitb32_merged30m.pt
2.47 GB
xet
Upload cosmos_vitb32_merged30m.pt
9 months ago
cosmos_vitb32_pixelprose.pt
2.47 GB
xet
Upload cosmos_vitb32_pixelprose.pt
9 months ago
cosmos_vitb32_yfcc15m.pt
2.47 GB
xet
Upload cosmos_vitb32_yfcc15m.pt
9 months ago