Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
shivamg05
/
SmolVLM2-500M-Audio-Aligned
like
0
Image-to-Text
Transformers
Safetensors
agkphysics/AudioSet
English
smolvlm
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
SmolVLM2-500M-Audio-Aligned
File size: 67 Bytes
38793c4
1
2
3
4
5
{
"image_seq_len"
:
64
,
"processor_class"
:
"SmolVLMProcessor"
}