Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
antflydb
/
clipclap
like
0
Follow
Antfly, Inc.
4
Feature Extraction
ONNX
OpenSound/AudioCaps
onnxruntime
multimodal
clip
clap
audio
image
text
embeddings
antfly
termite
License:
mit
Model card
Files
Files and versions
xet
Community
main
clipclap
896 MB
1 contributor
History:
7 commits
CarpenterAnt91
Create README.md
2530c24
verified
10 days ago
.gitattributes
1.88 kB
Update CLIPCLAP model with trained audio projection
13 days ago
README.md
5.42 kB
Create README.md
10 days ago
audio_model.onnx
3.32 MB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
12 days ago
audio_model.onnx.data
277 MB
xet
Update CLIPCLAP model with trained audio projection
13 days ago
audio_projection.onnx
12.7 kB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
12 days ago
audio_projection.onnx.data
4.26 MB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
12 days ago
clip_config.json
411 Bytes
Update CLIPCLAP model with trained audio projection
13 days ago
processor_config.json
581 Bytes
Update CLIPCLAP model with trained audio projection
13 days ago
projection_training_metadata.json
294 Bytes
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
12 days ago
text_model.onnx
1.24 MB
xet
Update CLIPCLAP model with trained audio projection
13 days ago
text_model.onnx.data
253 MB
xet
Update CLIPCLAP model with trained audio projection
13 days ago
text_projection.onnx
339 Bytes
xet
Update CLIPCLAP model with trained audio projection
13 days ago
text_projection.onnx.data
1.05 MB
xet
Update CLIPCLAP model with trained audio projection
13 days ago
tokenizer.json
3.64 MB
Update CLIPCLAP model with trained audio projection
13 days ago
tokenizer_config.json
322 Bytes
Update CLIPCLAP model with trained audio projection
13 days ago
visual_model.onnx
1.14 MB
xet
Update CLIPCLAP model with trained audio projection
13 days ago
visual_model.onnx.data
350 MB
xet
Update CLIPCLAP model with trained audio projection
13 days ago
visual_projection.onnx
341 Bytes
xet
Update CLIPCLAP model with trained audio projection
13 days ago
visual_projection.onnx.data
1.57 MB
xet
Update CLIPCLAP model with trained audio projection
13 days ago