Image limit

by swtb - opened May 6, 2024

Discussion

swtb

May 6, 2024

Is there a limit to how many images can be supplied as context?

klldmofashi

Efficient-Large-Model org May 6, 2024

•

edited May 6, 2024

Is there a limit to how many images can be supplied as context

First, please use VILA1.5 models. that will give you better performance
Second, for VILA1.5 models, in theory you can fit in 20 images for 8B (196 tokens/image and 4096 context length) but we only tested in-context learning w/ 2 images and video input w/ 8 frames.

klldmofashi changed discussion status to closed May 6, 2024

klldmofashi changed discussion status to open May 6, 2024

swtb

May 6, 2024

Thank you for the information 😊

swtb changed discussion status to closed May 6, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment