Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Uasonchen 's Collections
Video Understanding
Agent
Image Generation
Image Editing
Vision Foundation Model
Video Generation
MLLM
Open Math Data for LLM
Math Data Synthesis

MLLM

updated Oct 10, 2025
Upvote
-

  • Apriel-1.5-15b-Thinker

    Paper • 2510.01141 • Published Oct 1, 2025 • 123

  • MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

    Paper • 2509.21268 • Published Sep 25, 2025 • 104

  • LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

    Paper • 2509.00676 • Published Aug 31, 2025 • 85

  • Visual Representation Alignment for Multimodal Large Language Models

    Paper • 2509.07979 • Published Sep 9, 2025 • 84

  • InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

    Paper • 2508.18265 • Published Aug 25, 2025 • 216
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs