metadata
library_name: transformers
pipeline_tag: image-text-to-text
license: other
Nanodream 3(Preview) is an vision language model with a mixture-of-experts architecture (9B total parameters, 2B active). This model makes no compromises, delivering state-of-the-art visual reasoning while still retaining our efficient and deployment-friendly ethos.