hal-utokyo
/

MangaLMM

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

MangaLMM / README.md

nielsr's picture

nielsr HF Staff

Add model card, link to paper and code

b8ce0bc verified 11 months ago

|

331 Bytes

license: mit
library_name: transformers
pipeline_tag: image-text-to-text

This repository contains the MangaLMM model described in the paper MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding.

Code: https://github.com/hal-utokyo/MangaLMM