Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
kasper-boy 's Collections
Zero-Shot Detection and Segmentation
MultiLanguageTranslato & Text Summarizer
CloneVoiceAI & Audio denoising-enhancement
LMMs - Large Multimodal Models
OpenAI Vision API-Keys
GPT-4O multi-chart
Image To Audio
OCR
prompt to image
Images-video-portrait
MusicGen
Image-enhancer

LMMs - Large Multimodal Models

updated Jun 6, 2024
Upvote
-

  • Runtime error
    Agents
    Featured
    428

    LLaVA

    🔥
    428

    Chat with an AI assistant using text and images


  • Runtime error
    Agents
    Featured
    886

    MiniGPT-4

    🚀
    886


  • Runtime error
    Agents
    Featured
    308

    Fuyu Multimodal

    👁
    308


  • Paused
    Agents
    Featured
    146

    Idefics 8b

    🐠
    146

    Generate text from images and prompts


  • Runtime error
    Agents
    166

    CogVLM

    📊
    166

    Answer questions about uploaded images using natural language

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs