Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,49 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
pipeline_tag: image-to-3d
|
| 4 |
+
tags:
|
| 5 |
+
- triposg
|
| 6 |
+
- 3d-generation
|
| 7 |
+
- rectified-flow
|
| 8 |
+
---
|
| 9 |
+
# TripoSG - High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
|
| 10 |
+
|
| 11 |
+
TripoSG-scribble is a variant of TripoSG. TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.
|
| 12 |
+
|
| 13 |
+
## Model Description
|
| 14 |
+
|
| 15 |
+
### Model Architecture
|
| 16 |
+
|
| 17 |
+
TripoSG utilizes a novel architecture combining:
|
| 18 |
+
- Rectified Flow (RF) based Transformer for stable, linear trajectory modeling
|
| 19 |
+
- Advanced VAE with SDF-based representation and hybrid geometric supervision
|
| 20 |
+
- Cross-attention mechanism for image feature condition
|
| 21 |
+
- 1.5B parameters operating on 2048 latent tokens
|
| 22 |
+
|
| 23 |
+
TripoSG-scribble accepts scribble image and text prompt condition. For inference efficiency, TripoSG-scribble is different from TripoSG in:
|
| 24 |
+
- TripoSG-scribble is a CFG-distilled model and should be used with CFG=0
|
| 25 |
+
- TripoSG-scribble is trained with 512 latent tokens
|
| 26 |
+
|
| 27 |
+
## Intended Uses
|
| 28 |
+
|
| 29 |
+
This model is designed for:
|
| 30 |
+
- Converting scribble image and text prompt to high-quality 3D meshes
|
| 31 |
+
- Creative and design applications
|
| 32 |
+
- Gaming and VFX asset creation
|
| 33 |
+
- Prototyping and visualization
|
| 34 |
+
|
| 35 |
+
## Requirements
|
| 36 |
+
|
| 37 |
+
- CUDA-capable GPU (>8GB VRAM)
|
| 38 |
+
|
| 39 |
+
## Usage
|
| 40 |
+
|
| 41 |
+
For detailed usage instructions, please visit our [GitHub repository](https://github.com/VAST-AI-Research/TripoSG).
|
| 42 |
+
|
| 43 |
+
## About
|
| 44 |
+
|
| 45 |
+
TripoSG-scribble is developed by [Tripo](https://www.tripo3d.ai), [VAST AI Research](https://github.com/orgs/VAST-AI-Research), pushing the boundaries of 3D Generative AI.
|
| 46 |
+
For more information:
|
| 47 |
+
- [GitHub Repository](https://github.com/VAST-AI-Research/TripoSG)
|
| 48 |
+
- [Paper](https://arxiv.org/abs/2502.06608)
|
| 49 |
+
- [Gradio Demo](https://huggingface.co/spaces/VAST-AI/TripoSG-scribble)
|