File size: 3,680 Bytes
ed78f91
 
 
 
 
 
 
 
 
 
 
631f669
 
 
 
 
ed78f91
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
---
base_model: stable-diffusion-v1-5/stable-diffusion-v1-5
library_name: diffusers
license: creativeml-openrail-m
inference: true
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- diffusers-training
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- diffusers-training
---

<!-- This model card has been generated automatically according to the information the training script had access to. You
should probably proofread and complete it, then remove this comment. -->


# Text-to-image finetuning - elephantmipt/test_tuned_sd_15

This pipeline was finetuned from **stable-diffusion-v1-5/stable-diffusion-v1-5** on the **None** dataset. Below are some example images generated with the finetuned pipeline using the following prompts: ['IMAGE_ TYPE Cocktail Photography  GENRE Coktail Shooting Lowlight  EMOTION I want to drink it  SCENE A beautiful and refreshing glass of a drink called lychee spritz , decorated set against a dreamy background lowlight, fitting to the image  ACTORS None  LOCATION TYPE Studio  CAMERA MODEL Nikon D850  CAMERA LENSE 60mm f 2. 8 Macro  SPECIAL EFFECTS Dreamy bokeh  TIME_ OF_ DAY Studio lighting  INTERACTION None '
 'Gandalf, Saruman, Radagast. Blue Wizards perform a captivating magic ritual intense focus, vibrant colors swirl like airborne gas. Mystical pentagram unites them. '
 'wide shot, desert, wall, nature, fuchsia pink, brick red, ochre yellow, pale pink, chipotle orange '
 'disney pixar style character, dodge challenger srt hellcat illustration drifting under the ocean, cartoon, super detail, no text, 8k, render 3d, wide view vision '
 'wide shoot of a typical farm in rural surroundings, near a clear water lake, beautiful flowers blooming , forest, saplings, moss, beautiful, epic lighting, ultrasharp, nikon 12mm f15 '
 'dramtic sky backgraund '
 'underwater lake, dusk, scarry, blue green bright shining, deep water, nessi, lake ness'
 'Darkside Anakin Skywalker played by young Hayden Christensen with sith eyes, and a red lightsaber, hyperrealistic, cinematic, professional photo lighting, intricately detailed, cinematic lighting, 8k, ultra  detailed, ultra  realistic, photorealistic, camera Leica m11 quality with 30mm lens ']: 

![val_imgs_grid](./val_imgs_grid.png)


## Pipeline usage

You can use the pipeline like so:

```python
from diffusers import DiffusionPipeline
import torch

pipeline = DiffusionPipeline.from_pretrained("elephantmipt/test_tuned_sd_15", torch_dtype=torch.float16)
prompt = "IMAGE_ TYPE Cocktail Photography  GENRE Coktail Shooting Lowlight  EMOTION I want to drink it  SCENE A beautiful and refreshing glass of a drink called lychee spritz , decorated set against a dreamy background lowlight, fitting to the image  ACTORS None  LOCATION TYPE Studio  CAMERA MODEL Nikon D850  CAMERA LENSE 60mm f 2. 8 Macro  SPECIAL EFFECTS Dreamy bokeh  TIME_ OF_ DAY Studio lighting  INTERACTION None "
image = pipeline(prompt).images[0]
image.save("my_image.png")
```

## Training info

These are the key hyperparameters used during training:

* Epochs: 14
* Learning rate: 8e-05
* Batch size: 20
* Gradient accumulation steps: 1
* Image resolution: 512
* Mixed-precision: bf16


More information on all the CLI arguments and the environment are available on your [`wandb` run page](https://wandb.ai/harmless_ai/alchemist/runs/qspja0u3).


## Intended uses & limitations

#### How to use

```python
# TODO: add an example code snippet for running this diffusion pipeline
```

#### Limitations and bias

[TODO: provide examples of latent issues and potential remediations]

## Training details

[TODO: describe the data used to train the model]