Update README.md
Browse files
README.md
CHANGED
|
@@ -106,6 +106,19 @@ python inference.py \
|
|
| 106 |
--qwen_model_path "Qwen/Qwen3-8B"
|
| 107 |
```
|
| 108 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 109 |
|
| 110 |
|
| 111 |
### 💻 Gradio Web UI
|
|
|
|
| 106 |
--qwen_model_path "Qwen/Qwen3-8B"
|
| 107 |
```
|
| 108 |
|
| 109 |
+
If you are running on a GPU with limited memory, you can use `inference_offload.py` to offload some components to the CPU:
|
| 110 |
+
|
| 111 |
+
```bash
|
| 112 |
+
python inference_offload.py \
|
| 113 |
+
--prompt "Urban Canvas Street Art Expo poster with bold graffiti-style lettering and dynamic colorful splashes" \
|
| 114 |
+
--enable_recap \
|
| 115 |
+
--num_inference_steps 28 \
|
| 116 |
+
--guidance_scale 3.5 \
|
| 117 |
+
--seed 42 \
|
| 118 |
+
--pipeline_path "black-forest-labs/FLUX.1-dev" \
|
| 119 |
+
--custom_transformer_path "PosterCraft/PosterCraft-v1_RL" \
|
| 120 |
+
--qwen_model_path "Qwen/Qwen3-8B"
|
| 121 |
+
```
|
| 122 |
|
| 123 |
|
| 124 |
### 💻 Gradio Web UI
|