stabilityai
/

stable-diffusion-xl-1.0-tensorrt

@@ -6,17 +6,18 @@ language:
 tags:
   - stable-diffusion
   - stable-diffusion-xl
   - tensorrt
   - text-to-image
 ---
-# Stable Diffusion XL 1.0 TensorRT
 ## Introduction
 This repository hosts the Latent Consistency Model(LCM) TensorRT versions of **Stable Diffusion XL 1.0** created in collaboration with [NVIDIA](https://huggingface.co/nvidia). The optimized versions give substantial improvements in speed and efficiency.
-See the [usage instructions](#usage-example) for how to run the SDXL pipeline with the ONNX files hosted in this repository. The first invocation produces plan files in `engine_xl_base` and `engine_xl_refiner` specific to the accelerator being run on and are reused for later invocations.
 ## Model Description
@@ -26,7 +27,7 @@ See the [usage instructions](#usage-example) for how to run the SDXL pipeline wi
 - **Model Description:** This is a Latent Consistency Model (LCM) version of the [SDXL base 1.0](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0) and [SDXL refiner 1.0](https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0) models for [NVIDIA TensorRT](https://developer.nvidia.com/tensorrt) optimized inference
-## Performance Comparison
 #### Timings for 4 steps at 1024x1024
@@ -38,7 +39,7 @@ See the [usage instructions](#usage-example) for how to run the SDXL pipeline wi
 ## Usage Example
-1. Following the [setup instructions](https://github.com/rajeevsrao/TensorRT/blob/release/8.6/demo/Diffusion/README.md) on launching a TensorRT NGC container.
 ```shell
 git clone https://github.com/rajeevsrao/TensorRT.git
 cd TensorRT
@@ -50,7 +51,7 @@ docker run --rm -it --gpus all -v $PWD:/workspace nvcr.io/nvidia/pytorch:23.11-p
 ```shell
 git lfs install
 git clone https://huggingface.co/stabilityai/stable-diffusion-xl-1.0-tensorrt
-cd stable-diffusion-xl-1.0-tensorrt
 git lfs pull
 cd ..
 ```
@@ -64,6 +65,7 @@ python3 -m pip install --pre --upgrade --extra-index-url https://pypi.nvidia.com
 ```
 4. Perform TensorRT optimized inference
 ```
 python3 demo_txt2img_xl.py \
   ""Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"" \

 tags:
   - stable-diffusion
   - stable-diffusion-xl
+  - stable-diffusion-xl-lcm
   - tensorrt
   - text-to-image
 ---
+# Stable Diffusion XL 1.0 LCM TensorRT
 ## Introduction
 This repository hosts the Latent Consistency Model(LCM) TensorRT versions of **Stable Diffusion XL 1.0** created in collaboration with [NVIDIA](https://huggingface.co/nvidia). The optimized versions give substantial improvements in speed and efficiency.
+See the [usage instructions](#usage-example) for how to run the SDXL pipeline with the ONNX files hosted in this repository.
 ## Model Description
 - **Model Description:** This is a Latent Consistency Model (LCM) version of the [SDXL base 1.0](https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0) and [SDXL refiner 1.0](https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0) models for [NVIDIA TensorRT](https://developer.nvidia.com/tensorrt) optimized inference
+## Performance
 #### Timings for 4 steps at 1024x1024
 ## Usage Example
+1. Following the [setup instructions](https://github.com/rajeevsrao/TensorRT/blob/release/9.2/demo/Diffusion/README.md) on launching a TensorRT NGC container.
 ```shell
 git clone https://github.com/rajeevsrao/TensorRT.git
 cd TensorRT
 ```shell
 git lfs install
 git clone https://huggingface.co/stabilityai/stable-diffusion-xl-1.0-tensorrt
+cd stable-diffusion-xl-1.0-tensorrt/lcm
 git lfs pull
 cd ..
 ```
 ```
 4. Perform TensorRT optimized inference
+  * The first invocation produces plan files in --engine-dir specific to the accelerator being run on and are reused for later invocations.
 ```
 python3 demo_txt2img_xl.py \
   ""Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"" \