Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Mozilla
/
distilvit
like
27
Follow
mozilla
399
Image-to-Text
Transformers.js
PyTorch
ONNX
Safetensors
Mozilla/flickr30k-transformed-captions-gpt4o
vision-encoder-decoder
image-captioning
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Use this model
refs/pr/1
distilvit
6.57 GB
1 contributor
History:
12 commits
Xenova
HF Staff
[DO NOT MERGE YET - STILL TESTING] Add more ONNX weights (dtypes)
e83ace9
verified
over 1 year ago
onnx
[DO NOT MERGE YET - STILL TESTING] Add more ONNX weights (dtypes)
over 1 year ago
.gitattributes
1.52 kB
initial commit
almost 2 years ago
README.md
2.09 kB
Update README.md
over 1 year ago
config.json
4.87 kB
Upload 16 files
over 1 year ago
generation_config.json
89 Bytes
Upload 16 files
over 1 year ago
merges.txt
456 kB
Upload 15 files
almost 2 years ago
metrics.txt
657 Bytes
Model save
over 1 year ago
model.safetensors
730 MB
xet
Model save
over 1 year ago
preprocessor_config.json
616 Bytes
Model save
over 1 year ago
quantize_config.json
3.11 kB
Upload 16 files
over 1 year ago
special_tokens_map.json
583 Bytes
Upload 15 files
almost 2 years ago
tokenizer.json
2.11 MB
Upload 15 files
almost 2 years ago
tokenizer_config.json
476 Bytes
Upload 15 files
almost 2 years ago
training_args.bin
5.18 kB
xet
Model save
over 1 year ago
vocab.json
798 kB
Upload 15 files
almost 2 years ago