Hal Lab UTokyo

university

Verified

https://www.hal.t.u-tokyo.ac.jp/lab/en/index_1.xhtml

AI & ML interests

None defined yet.

Recent Activity

AtsuMiyai updated a dataset 27 days ago

hal-utokyo/PaperWrite-Bench

AtsuMiyai authored a paper about 1 month ago

Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers

AtsuMiyai published a dataset about 1 month ago

hal-utokyo/PaperWrite-Bench

View all activity

Papers

Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper

View all Papers

updated a dataset 27 days ago

hal-utokyo/PaperWrite-Bench

Viewer • Updated 27 days ago • 51 • 68 • 1

authored a paper about 1 month ago

Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers

Paper • 2604.01128 • Published Apr 1 • 15

published a dataset about 1 month ago

hal-utokyo/PaperWrite-Bench

Viewer • Updated 27 days ago • 51 • 68 • 1

authored a paper 5 months ago

JMMMU-Pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark via Vibe Benchmark Construction

Paper • 2512.14620 • Published Dec 16, 2025 • 2

submitted a paper to Daily Papers 5 months ago

JMMMU-Pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark via Vibe Benchmark Construction

Paper • 2512.14620 • Published Dec 16, 2025 • 2

authored a paper 6 months ago

Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper

Paper • 2511.04583 • Published Nov 6, 2025 • 5

updated a dataset 6 months ago

hal-utokyo/Manga109

Preview • Updated Oct 30, 2025 • 131 • 21

updated a dataset 7 months ago

hal-utokyo/MangaVQA

Viewer • Updated Oct 6, 2025 • 526 • 1.35k • 1

authored a paper 8 months ago

ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution

Paper • 2509.19349 • Published Sep 17, 2025 • 2

authored 4 papers 10 months ago

FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe Generation

Paper • 2409.18459 • Published Sep 27, 2024

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Paper • 2503.04412 • Published Mar 6, 2025 • 6

MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding

Paper • 2505.20298 • Published May 26, 2025 • 9

ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering

Paper • 2506.09050 • Published Jun 10, 2025 • 6

authored 2 papers 11 months ago

WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks

Paper • 2506.01952 • Published Jun 2, 2025 • 10

A Benchmark and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Models

Paper • 2501.18463 • Published Jan 30, 2025 • 1

updated a model 11 months ago

hal-utokyo/MangaLMM

Image-Text-to-Text • 8B • Updated Jun 1, 2025 • 230 • 12

in hal-utokyo/MangaLMM 11 months ago

Add model card, link to paper and code

#1 opened 11 months ago by

authored a paper 12 months ago

MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding

Paper • 2505.20298 • Published May 26, 2025 • 9

updated a dataset 12 months ago

hal-utokyo/Manga109-s

Preview • Updated May 23, 2025 • 94 • 28

published a model 12 months ago

hal-utokyo/MangaLMM

Image-Text-to-Text • 8B • Updated Jun 1, 2025 • 230 • 12