Alpha-VLLM
/

Lumina-Next-SFT-diffusers

Model card Files Files and versions

PommesPeter commited on Jun 21, 2024

Commit

89c6ceb

·

verified ·

1 Parent(s): d41c126

Update README.md

Files changed (1) hide show

README.md +17 -13

README.md CHANGED Viewed

@@ -43,7 +43,7 @@ More checkpoints of our model will be released soon~
 | Resolution | Next-DiT Parameter| Text Encoder | Prediction | Download URL  |
 | ---------- | ----------------------- | ------------ | -----------|-------------- |
-| 1024       | 2B             |    [Gemma-2B](https://huggingface.co/google/gemma-2b)  |   Rectified Flow | [hugging face](https://huggingface.co/Alpha-VLLM/Lumina-Next-SFT) |
 ## Installation
@@ -51,23 +51,23 @@ More checkpoints of our model will be released soon~
 Note: You may want to adjust the CUDA version [according to your driver version](https://docs.nvidia.com/deploy/cuda-compatibility/#default-to-minor-version).
-  ```bash
-  conda create -n Lumina_T2X -y
-  conda activate Lumina_T2X
-  conda install python=3.11 pytorch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 pytorch-cuda=12.1 -c pytorch -c nvidia -y
-  ```
 ### 2. Install dependencies
-  ```bash
-  pip install diffusers huggingface_hub
-  ```
 ### 3. Install ``flash-attn``
-  ```bash
-  pip install flash-attn --no-build-isolation
-  ```
 ## Inference
@@ -86,7 +86,11 @@ huggingface-cli download --resume-download Alpha-VLLM/Lumina-Next-SFT-diffusers
 from diffusers import LuminaText2ImgPipeline
 import torch
-pipeline = LuminaText2ImgPipeline.from_pretrained("/mnt/hdd1/xiejunlin/checkpoints/Lumina-Next-SFT-diffusers", torch_dtype=torch.bfloat16).to("cuda")
 image = pipeline(prompt="Upper body of a young woman in a Victorian-era outfit with brass goggles and leather straps. "
                         "Background shows an industrial revolution cityscape with smoky skies and tall, metal structures"

 | Resolution | Next-DiT Parameter| Text Encoder | Prediction | Download URL  |
 | ---------- | ----------------------- | ------------ | -----------|-------------- |
+| 1024  | 2B  | [Gemma-2B](https://huggingface.co/google/gemma-2b)  | Rectified Flow | [hugging face](https://huggingface.co/Alpha-VLLM/Lumina-Next-SFT) |
 ## Installation
 Note: You may want to adjust the CUDA version [according to your driver version](https://docs.nvidia.com/deploy/cuda-compatibility/#default-to-minor-version).
+```bash
+conda create -n Lumina_T2X -y
+conda activate Lumina_T2X
+conda install python=3.11 pytorch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 pytorch-cuda=12.1 -c pytorch -c nvidia -y
+```
 ### 2. Install dependencies
+```bash
+pip install diffusers huggingface_hub
+```
 ### 3. Install ``flash-attn``
+```bash
+pip install flash-attn --no-build-isolation
+```
 ## Inference
 from diffusers import LuminaText2ImgPipeline
 import torch
+pipeline = LuminaText2ImgPipeline.from_pretrained("/path/to/ckpt/Lumina-Next-SFT-diffusers", torch_dtype=torch.bfloat16).to("cuda")
+# or you can download the model using code directly
+# pipeline = LuminaText2ImgPipeline.from_pretrained("Alpha-VLLM/Lumina-Next-SFT-diffusers", torch_dtype=torch.bfloat16).to("cuda")
 image = pipeline(prompt="Upper body of a young woman in a Victorian-era outfit with brass goggles and leather straps. "
                         "Background shows an industrial revolution cityscape with smoky skies and tall, metal structures"