Text-to-Image
Diffusers
Safetensors
StableDiffusionPipeline
stable-diffusion
stable-diffusion-diffusers
Instructions to use CompVis/stable-diffusion-v1-4 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use CompVis/stable-diffusion-v1-4 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4", dtype=torch.bfloat16, device_map="cuda") prompt = "A high tech solarpunk utopia in the Amazon rainforest" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
- DiffusionBee
Update README.md
#166
by Dixonbuttsdrivemenuts - opened
README.md
CHANGED
|
@@ -238,11 +238,7 @@ Using the model to generate content that is cruel to individuals is a misuse of
|
|
| 238 |
- The model does not perform well on more difficult tasks which involve compositionality, such as rendering an image corresponding to “A red cube on top of a blue sphere”
|
| 239 |
- Faces and people in general may not be generated properly.
|
| 240 |
- The model was trained mainly with English captions and will not work as well in other languages.
|
| 241 |
-
- The autoencoding part of the model is lossy
|
| 242 |
-
- The model was trained on a large-scale dataset
|
| 243 |
-
[LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
|
| 244 |
-
and is not fit for product use without additional safety mechanisms and
|
| 245 |
-
considerations.
|
| 246 |
- No additional measures were used to deduplicate the dataset. As a result, we observe some degree of memorization for images that are duplicated in the training data.
|
| 247 |
The training data can be searched at [https://rom1504.github.io/clip-retrieval/](https://rom1504.github.io/clip-retrieval/) to possibly assist in the detection of memorized images.
|
| 248 |
|
|
|
|
| 238 |
- The model does not perform well on more difficult tasks which involve compositionality, such as rendering an image corresponding to “A red cube on top of a blue sphere”
|
| 239 |
- Faces and people in general may not be generated properly.
|
| 240 |
- The model was trained mainly with English captions and will not work as well in other languages.
|
| 241 |
+
- The autoencoding part of the model is lossy.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 242 |
- No additional measures were used to deduplicate the dataset. As a result, we observe some degree of memorization for images that are duplicated in the training data.
|
| 243 |
The training data can be searched at [https://rom1504.github.io/clip-retrieval/](https://rom1504.github.io/clip-retrieval/) to possibly assist in the detection of memorized images.
|
| 244 |
|