Multimodal Embedding Gemma Incoming?

#37

by LorryG - opened Jan 10

Jan 10

I see that T5 Gemma 2 is multimodal, using a SigLip image encoder. How likely are we to get a multimodal early-fusion embedding model?

pannaga10

Google org Jan 13

•

edited Jan 13

Hi @LorryG
While we can't comment on the likelihood of specific future releases, your feedback regarding early-fusion models has been forwarded to our developers.Thanks

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment