--- license: mit library_name: transformers pipeline_tag: image-text-to-text ---

Bo Li*1Yuanhan Zhang*,1Liangyu Chen*,1Jinghao Wang*,1Fanyi Pu*,1
Jingkang Yang1Chunyuan Li2Ziwei Liu1
1S-Lab, Nanyang Technological University  2Microsoft Research, Redmond
This repository contains the models presented in [Otter: A Multi-Modal Model with In-Context Instruction Tuning](https://huggingface.co/papers/2305.03726). You can refer the code to start evaluation and demo on your local machine. https://github.com/Luodian/Otter/blob/8b386816ec67b15833cde3dcd1d7ca6a752d2451/pipeline/demos/demo_models.py#L35