MangaLMM / README.md
nielsr's picture
nielsr HF Staff
Add model card, link to paper and code
b8ce0bc verified
|
raw
history blame
331 Bytes
metadata
license: mit
library_name: transformers
pipeline_tag: image-text-to-text

This repository contains the MangaLMM model described in the paper MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding.

Code: https://github.com/hal-utokyo/MangaLMM