Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Cyber-Blacat 's Collections
Vedio
2D-generation
LLM
3D
Ultimate-HQ-Datasets
Function-Space
Vision
Multimodal
Sound(ASR+TTS)

Multimodal

updated May 26
Upvote
-

  • BAAI/Emu3.5

    Any-to-Any • 34B • Updated Dec 25, 2025 • 169 • 172

  • HIT-TMG/Uni-MoE-2.0-Omni

    Any-to-Any • 33B • Updated Nov 24, 2025 • 44 • 36

  • HIT-TMG/Uni-MoE-2.0-Image

    Text-to-Image • 31B • Updated Nov 23, 2025 • 547 • 4

  • Yuanshi/ViBT

    Any-to-Any • Updated Dec 7, 2025 • 147 • 19

  • inclusionAI/LLaDA2.0-flash

    103B • Updated Dec 19, 2025 • 140 • 69

  • LiquidAI/LFM2.5-1.2B-Instruct

    Text Generation • 1B • Updated 14 days ago • 134k • 620

  • Glanty/Capybara

    Any-to-Any • Updated Feb 27 • 232

  • google/gemma-4-31B-it

    Image-Text-to-Text • 33B • Updated 27 days ago • 10.9M • • 3.09k

  • facebook/tribev2

    Updated Mar 27 • 48.2k • 622

  • nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B

    Text Generation • 4B • Updated Jan 8 • 8.61k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs