Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Smartground 's Collections
Audio
Speako
Play-Ground
Imagen
OCR
Data
Spatial
Code
Multimode

Multimode

updated Apr 3
Upvote
-

  • microsoft/Phi-4-multimodal-instruct

    Automatic Speech Recognition • 6B • Updated 17 days ago • 284k • 1.55k

  • ByteDance/Sa2VA-8B

    Image-Text-to-Text • 8B • Updated Sep 8 • 353 • 65
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs