TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Paper • 2502.06608 • Published • 39
import torch
from diffusers import DiffusionPipeline
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("VAST-AI/TripoSG-scribble", dtype=torch.bfloat16, device_map="cuda")
prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt).images[0]TripoSG-scribble converts a scribble image and a text prompt to a 3D shape. TripoSG-scribble is a variant of TripoSG. TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.
TripoSG utilizes a novel architecture combining:
For inference efficiency, TripoSG-scribble is different from TripoSG in:
This model is designed for:
For detailed usage instructions, please visit our GitHub repository.
TripoSG-scribble is developed by Tripo, VAST AI Research, pushing the boundaries of 3D Generative AI. For more information: