mfarre commited on
Commit
2fa62eb
·
verified ·
1 Parent(s): 65732f7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -17,7 +17,8 @@ base_model:
17
 
18
  # SmolVLM2-500M-Video
19
 
20
- SmolVLM2-500M-Video is a tiny video model, member of the SmolVLM family. It accepts video, arbitrary sequences of image and text inputs to produce text outputs. It's designed for efficiency. SmolVLM2 is optimized for video but can answer questions about images, describe visual content, or transcribe text. Its lightweight architecture makes it suitable for on-device applications while maintaining strong performance on multimodal tasks. It can run inference on a video with 1.8GB of GPU RAM.
 
21
 
22
  ## Model Summary
23
 
 
17
 
18
  # SmolVLM2-500M-Video
19
 
20
+ SmolVLM2-500M-Video is a model optimized for video that accepts video, arbitrary sequences of image and text inputs to produce text outputs. It can answer questions about media files, compare images, describe visual content, or transcribe text.
21
+ Its lightweight architecture makes it suitable for on-device applications while maintaining strong performance on multimodal tasks. It can run inference on a video with 1.8GB of GPU RAM.
22
 
23
  ## Model Summary
24