Image-to-3D
Diffusers
Safetensors
MIDIPipeline
How to use from the
Use from the
Diffusers library
pip install -U diffusers transformers accelerate
import torch
from diffusers import DiffusionPipeline

# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("VAST-AI/MIDI-3D", dtype=torch.bfloat16, device_map="cuda")

prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt).images[0]

MIDI-3D

MIDI is a 3D generative model for single image to compositional 3D scene generation. It was introduced in the paper MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation.

Project page: https://huanngzh.github.io/MIDI-Page/

Code: https://github.com/VAST-AI-Research/MIDI-3D

Downloads last month
46
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Spaces using VAST-AI/MIDI-3D 4

Paper for VAST-AI/MIDI-3D