Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
antflydb
/
clipclap
like
0
Follow
Antfly, Inc.
4
Feature Extraction
ONNX
OpenSound/AudioCaps
onnxruntime
multimodal
clip
clap
audio
image
text
embeddings
antfly
termite
License:
mit
Model card
Files
Files and versions
xet
Community
main
clipclap
896 MB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
CarpenterAnt91
Create README.md
2530c24
verified
about 2 months ago
.gitattributes
Safe
1.88 kB
Update CLIPCLAP model with trained audio projection
about 2 months ago
README.md
Safe
5.42 kB
Create README.md
about 2 months ago
audio_model.onnx
Safe
3.32 MB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
about 2 months ago
audio_model.onnx.data
277 MB
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
audio_projection.onnx
Safe
12.7 kB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
about 2 months ago
audio_projection.onnx.data
Safe
4.26 MB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
about 2 months ago
clip_config.json
Safe
411 Bytes
Update CLIPCLAP model with trained audio projection
about 2 months ago
processor_config.json
Safe
581 Bytes
Update CLIPCLAP model with trained audio projection
about 2 months ago
projection_training_metadata.json
Safe
294 Bytes
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
about 2 months ago
text_model.onnx
Safe
1.24 MB
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
text_model.onnx.data
253 MB
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
text_projection.onnx
Safe
339 Bytes
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
text_projection.onnx.data
Safe
1.05 MB
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
tokenizer.json
Safe
3.64 MB
Update CLIPCLAP model with trained audio projection
about 2 months ago
tokenizer_config.json
Safe
322 Bytes
Update CLIPCLAP model with trained audio projection
about 2 months ago
visual_model.onnx
Safe
1.14 MB
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
visual_model.onnx.data
350 MB
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
visual_projection.onnx
Safe
341 Bytes
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago
visual_projection.onnx.data
Safe
1.57 MB
xet
Update CLIPCLAP model with trained audio projection
about 2 months ago