Instructions to use VAST-AI/TripoSG-scribble with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use VAST-AI/TripoSG-scribble with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("VAST-AI/TripoSG-scribble", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -6,9 +6,9 @@ tags:
|
|
| 6 |
- 3d-generation
|
| 7 |
- rectified-flow
|
| 8 |
---
|
| 9 |
-
# TripoSG
|
| 10 |
|
| 11 |
-
TripoSG-scribble is a variant of TripoSG. TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.
|
| 12 |
|
| 13 |
## Model Description
|
| 14 |
|
|
@@ -20,7 +20,7 @@ TripoSG utilizes a novel architecture combining:
|
|
| 20 |
- Cross-attention mechanism for image feature condition
|
| 21 |
- 1.5B parameters operating on 2048 latent tokens
|
| 22 |
|
| 23 |
-
|
| 24 |
- TripoSG-scribble is a CFG-distilled model and should be used with CFG=0
|
| 25 |
- TripoSG-scribble is trained with 512 latent tokens
|
| 26 |
|
|
|
|
| 6 |
- 3d-generation
|
| 7 |
- rectified-flow
|
| 8 |
---
|
| 9 |
+
# TripoSG-scribble - Fast 3D Shape Prototyping with Scribble and Prompt
|
| 10 |
|
| 11 |
+
TripoSG-scribble converts a scribble image and a text prompt to a 3D shape. TripoSG-scribble is a variant of TripoSG. TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.
|
| 12 |
|
| 13 |
## Model Description
|
| 14 |
|
|
|
|
| 20 |
- Cross-attention mechanism for image feature condition
|
| 21 |
- 1.5B parameters operating on 2048 latent tokens
|
| 22 |
|
| 23 |
+
For inference efficiency, TripoSG-scribble is different from TripoSG in:
|
| 24 |
- TripoSG-scribble is a CFG-distilled model and should be used with CFG=0
|
| 25 |
- TripoSG-scribble is trained with 512 latent tokens
|
| 26 |
|