Text-to-3D
Diffusers
Safetensors
English
StableDiffusionLDM3DPipeline
stable-diffusion
stable-diffusion-diffusers
text-to-image
text-to-panoramic
Eval Results (legacy)
Instructions to use Intel/ldm3d-pano with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Intel/ldm3d-pano with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Intel/ldm3d-pano", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -19,7 +19,9 @@ LDM3D got accepted to [CVPRW'23]([https://aaai.org/Conferences/AAAI-23/](https:/
|
|
| 19 |
This checkpoint finetunes the previous [ldm3d-4c](https://huggingface.co/Intel/ldm3d-4c) on 2 panoramic-images datasets:
|
| 20 |
- [polyhaven](https://polyhaven.com/): 585 images for the training set, 66 images for the validation set
|
| 21 |
- [ihdri](https://www.ihdri.com/hdri-skies-outdoor/): 57 outdoor images for the training set, 7 outdoor images for the validation set.
|
|
|
|
| 22 |
These datasets were augmented using [Text2Light](https://frozenburning.github.io/projects/text2light/) to create a dataset containing 13852 training samples and 1606 validation samples.
|
|
|
|
| 23 |
In order to generate the depth map of those samples, we used [DPT-large](https://github.com/isl-org/MiDaS) and to generate the caption we used [BLIP-2](https://huggingface.co/docs/transformers/main/model_doc/blip-2)
|
| 24 |
|
| 25 |
A demo using this checkpoint has been open sourced in [this space](https://huggingface.co/spaces/Intel/ldm3d)
|
|
|
|
| 19 |
This checkpoint finetunes the previous [ldm3d-4c](https://huggingface.co/Intel/ldm3d-4c) on 2 panoramic-images datasets:
|
| 20 |
- [polyhaven](https://polyhaven.com/): 585 images for the training set, 66 images for the validation set
|
| 21 |
- [ihdri](https://www.ihdri.com/hdri-skies-outdoor/): 57 outdoor images for the training set, 7 outdoor images for the validation set.
|
| 22 |
+
|
| 23 |
These datasets were augmented using [Text2Light](https://frozenburning.github.io/projects/text2light/) to create a dataset containing 13852 training samples and 1606 validation samples.
|
| 24 |
+
|
| 25 |
In order to generate the depth map of those samples, we used [DPT-large](https://github.com/isl-org/MiDaS) and to generate the caption we used [BLIP-2](https://huggingface.co/docs/transformers/main/model_doc/blip-2)
|
| 26 |
|
| 27 |
A demo using this checkpoint has been open sourced in [this space](https://huggingface.co/spaces/Intel/ldm3d)
|