arrow-hf's picture
Update README: reflect full mirror (model weights + tokenizer + ONNX)
d6391ce verified
metadata
tags:
  - smolvlm2
  - mirror
library_name: transformers
license: apache-2.0

SmolVLM2-500M-Video-Instruct (full mirror)

Full mirror of HuggingFaceTB/SmolVLM2-500M-Video-Instruct.

Includes:

  • model.safetensors (~1.9 GB PyTorch weights)
  • 14 ONNX variants under onnx/ (fp16, int8, q4, uint8, etc. for decoder / embed_tokens / vision_encoder)
  • Tokenizer files (tokenizer.json, vocab.json, merges.txt, added_tokens.json, special_tokens_map.json)
  • Processor configs (processor_config.json, preprocessor_config.json, chat_template.json)
  • generation_config.json, config.json

Mirrored via huggingface_hub.snapshot_download.

Usage

from transformers import AutoModel, AutoProcessor, AutoTokenizer
model = AutoModel.from_pretrained("arrow-hf/SmolVLM2-500M-Video-Instruct")
processor = AutoProcessor.from_pretrained("arrow-hf/SmolVLM2-500M-Video-Instruct")
tokenizer = AutoTokenizer.from_pretrained("arrow-hf/SmolVLM2-500M-Video-Instruct")

Related

The tokenizer is used by arrow-hf/smolvla-robotwin-stack-bowls-two-50pct (max_length=48). The SmolVLA policy is fine-tuned on top of this base VLM.