File size: 406 Bytes
95a0cb5
 
 
 
 
 
 
 
a556fbc
 
1
2
3
4
5
6
7
8
9
10
---
license: mit
library_name: transformers
pipeline_tag: image-text-to-text
---

This repository contains the MangaLMM model described in the paper [MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding](https://huggingface.co/papers/2505.20298).

Code: https://github.com/manga109/MangaLMM <br>
Official demo: https://huggingface.co/spaces/yuki-imajuku/MangaLMM-Demo