Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mirnaresearch 's Collections
Multimodal Models

Multimodal Models

updated Feb 4
Upvote
-

  • microsoft/Phi-4-multimodal-instruct

    Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 313k • 1.57k

  • zai-org/GLM-OCR

    Image-to-Text • Updated 8 days ago • 2.93M • • 1.38k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs