Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
LuMatic 's Collections
Polskie Modele
VLM Vision Models
STT. Multimodal
Image & Video Generation
LLMs
Code Models
TTS
Embedding RAG
Function Calling
WebGPU

Image & Video Generation

updated 13 days ago
Upvote
-

  • PixArt-alpha/PixArt-XL-2-1024-MS

    Text-to-Image • Updated Nov 7, 2023 • 10.5k • 214

  • stabilityai/stable-video-diffusion-img2vid

    Image-to-Video • Updated Jul 10, 2024 • 51.8k • 1.03k

  • Running on Zero
    Agents
    Featured
    1.57k

    InstantMesh

    📚
    1.57k

    Create a 3D model from an image in 10 seconds!


  • Build error
    Agents
    297

    GenAI Arena

    📈
    297

    Realtime Image/Video Gen AI Arena


  • Running
    Agents
    442

    moondream2

    🌔
    442

    a tiny vision language model


  • microsoft/Florence-2-large

    Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 481k • 1.81k

  • stabilityai/stable-diffusion-3.5-medium

    Text-to-Image • Updated Oct 31, 2024 • 363k • • 943

  • Running on Zero
    Agents
    Featured
    513

    Qwen Image Layered

    🚀
    513

    Decompose images into editable layers


  • black-forest-labs/FLUX.2-dev

    Image-to-Image • Updated Feb 17 • 240k • • 1.68k

  • Qwen/Qwen-Image-2512

    Text-to-Image • Updated Dec 31, 2025 • 107k • • 846
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs