MLLM - a Uasonchen Collection

Uasonchen 's Collections

Video Understanding

Image Generation

Vision Foundation Model

Video Generation

Open Math Data for LLM

Math Data Synthesis

MLLM

updated Oct 10, 2025

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 125
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85
Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 224