Sup3r AlexWortega commited on
Commit
93d4ec3
·
0 Parent(s):

Duplicate from AlexWortega/Kandinsky2.0

Browse files

Co-authored-by: Wortega <AlexWortega@users.noreply.huggingface.co>

Files changed (6) hide show
  1. .gitattributes +34 -0
  2. NatallE.png +0 -0
  3. README.md +13 -0
  4. app.py +215 -0
  5. packages.txt +3 -0
  6. requirements.txt +3 -0
.gitattributes ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tflite filter=lfs diff=lfs merge=lfs -text
29
+ *.tgz filter=lfs diff=lfs merge=lfs -text
30
+ *.wasm filter=lfs diff=lfs merge=lfs -text
31
+ *.xz filter=lfs diff=lfs merge=lfs -text
32
+ *.zip filter=lfs diff=lfs merge=lfs -text
33
+ *.zst filter=lfs diff=lfs merge=lfs -text
34
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
NatallE.png ADDED
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: Kandinsky2.0
3
+ emoji: 📉
4
+ colorFrom: indigo
5
+ colorTo: green
6
+ sdk: gradio
7
+ sdk_version: 3.11.0
8
+ app_file: app.py
9
+ pinned: false
10
+ duplicated_from: AlexWortega/Kandinsky2.0
11
+ ---
12
+
13
+ Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
app.py ADDED
@@ -0,0 +1,215 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ import gradio as gr
3
+ import torch
4
+ from torch import autocast
5
+ from kandinsky2 import get_kandinsky2
6
+ device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
7
+ model = get_kandinsky2(device, task_type='text2img')
8
+
9
+
10
+
11
+
12
+ def infer(prompt):
13
+ images = model.generate_text2img(prompt, batch_size=4, h=512, w=512, num_steps=75, denoised_type='dynamic_threshold', dynamic_threshold_v=99.5, sampler='ddim_sampler', ddim_eta=0.05, guidance_scale=10)
14
+ return images
15
+
16
+ css = """
17
+ .gradio-container {
18
+ font-family: 'IBM Plex Sans', sans-serif;
19
+ }
20
+ .gr-button {
21
+ color: white;
22
+ border-color: black;
23
+ background: black;
24
+ }
25
+ input[type='range'] {
26
+ accent-color: black;
27
+ }
28
+ .dark input[type='range'] {
29
+ accent-color: #dfdfdf;
30
+ }
31
+ .container {
32
+ max-width: 730px;
33
+ margin: auto;
34
+ padding-top: 1.5rem;
35
+ }
36
+ #gallery {
37
+ min-height: 22rem;
38
+ margin-bottom: 15px;
39
+ margin-left: auto;
40
+ margin-right: auto;
41
+ border-bottom-right-radius: .5rem !important;
42
+ border-bottom-left-radius: .5rem !important;
43
+ }
44
+ #gallery>div>.h-full {
45
+ min-height: 20rem;
46
+ }
47
+ .details:hover {
48
+ text-decoration: underline;
49
+ }
50
+ .gr-button {
51
+ white-space: nowrap;
52
+ }
53
+ .gr-button:focus {
54
+ border-color: rgb(147 197 253 / var(--tw-border-opacity));
55
+ outline: none;
56
+ box-shadow: var(--tw-ring-offset-shadow), var(--tw-ring-shadow), var(--tw-shadow, 0 0 #0000);
57
+ --tw-border-opacity: 1;
58
+ --tw-ring-offset-shadow: var(--tw-ring-inset) 0 0 0 var(--tw-ring-offset-width) var(--tw-ring-offset-color);
59
+ --tw-ring-shadow: var(--tw-ring-inset) 0 0 0 calc(3px var(--tw-ring-offset-width)) var(--tw-ring-color);
60
+ --tw-ring-color: rgb(191 219 254 / var(--tw-ring-opacity));
61
+ --tw-ring-opacity: .5;
62
+ }
63
+ #advanced-btn {
64
+ font-size: .7rem !important;
65
+ line-height: 19px;
66
+ margin-top: 12px;
67
+ margin-bottom: 12px;
68
+ padding: 2px 8px;
69
+ border-radius: 14px !important;
70
+ }
71
+ #advanced-options {
72
+ display: none;
73
+ margin-bottom: 20px;
74
+ }
75
+ .footer {
76
+ margin-bottom: 45px;
77
+ margin-top: 35px;
78
+ text-align: center;
79
+ border-bottom: 1px solid #e5e5e5;
80
+ }
81
+ .footer>p {
82
+ font-size: .8rem;
83
+ display: inline-block;
84
+ padding: 0 10px;
85
+ transform: translateY(10px);
86
+ background: white;
87
+ }
88
+ .dark .footer {
89
+ border-color: #303030;
90
+ }
91
+ .dark .footer>p {
92
+ background: #0b0f19;
93
+ }
94
+ .acknowledgments h4{
95
+ margin: 1.25em 0 .25em 0;
96
+ font-weight: bold;
97
+ font-size: 115%;
98
+ }
99
+ #container-advanced-btns{
100
+ display: flex;
101
+ flex-wrap: wrap;
102
+ justify-content: space-between;
103
+ align-items: center;
104
+ }
105
+ .animate-spin {
106
+ animation: spin 1s linear infinite;
107
+ }
108
+ @keyframes spin {
109
+ from {
110
+ transform: rotate(0deg);
111
+ }
112
+ to {
113
+ transform: rotate(360deg);
114
+ }
115
+ }
116
+ #share-btn-container {
117
+ display: flex; padding-left: 0.5rem !important; padding-right: 0.5rem !important; background-color: #000000; justify-content: center; align-items: center; border-radius: 9999px !important; width: 13rem;
118
+ }
119
+ #share-btn {
120
+ all: initial; color: #ffffff;font-weight: 600; cursor:pointer; font-family: 'IBM Plex Sans', sans-serif; margin-left: 0.5rem !important; padding-top: 0.25rem !important; padding-bottom: 0.25rem !important;
121
+ }
122
+ #share-btn * {
123
+ all: unset;
124
+ }
125
+ .gr-form{
126
+ flex: 1 1 50%; border-top-right-radius: 0; border-bottom-right-radius: 0;
127
+ }
128
+ #prompt-container{
129
+ gap: 0;
130
+ }
131
+ #generated_id{
132
+ min-height: 700px
133
+ }
134
+ """
135
+ block = gr.Blocks(css=css)
136
+
137
+ examples = [
138
+ [
139
+ 'Красная площадь'
140
+ ],
141
+ [
142
+ 'Thinking man in anime style'
143
+ ],
144
+ [
145
+ 'אבוקדו'
146
+ ],
147
+ ]
148
+
149
+ with block as demo:
150
+ gr.Markdown("""
151
+
152
+
153
+ [![Framework: PyTorch](https://img.shields.io/badge/Framework-PyTorch-orange.svg)](https://pytorch.org/) [![Huggingface space](https://img.shields.io/badge/🤗-Huggingface-yello.svg)](https://huggingface.co/sberbank-ai/Kandinsky_2.0)
154
+
155
+
156
+
157
+ ## Model architecture:
158
+
159
+ It is a latent diffusion model with two multilingual text encoders:
160
+ * mCLIP-XLMR 560M parameters
161
+ * mT5-encoder-small 146M parameters
162
+
163
+ These encoders and multilingual training datasets unveil the real multilingual text-to-image generation experience!
164
+
165
+ **Kandinsky 2.0** was trained on a large 1B multilingual set, including samples that we used to train Kandinsky.
166
+
167
+ In terms of diffusion architecture Kandinsky 2.0 implements UNet with 1.2B parameters.
168
+
169
+ **Kandinsky 2.0** architecture overview:
170
+ ![](NatallE.png)
171
+
172
+ """
173
+ )
174
+ with gr.Group():
175
+ with gr.Box():
176
+ with gr.Row().style(mobile_collapse=False, equal_height=True):
177
+
178
+ text = gr.Textbox(
179
+ label="Enter your prompt", show_label=False, max_lines=1
180
+ ).style(
181
+ border=(True, False, True, True),
182
+ rounded=(True, False, False, True),
183
+ container=False,
184
+ )
185
+ btn = gr.Button("Run").style(
186
+ margin=False,
187
+ rounded=(False, True, True, False),
188
+ )
189
+
190
+ gallery = gr.Gallery(label="Generated images", show_label=False, elem_id="generated_id").style(
191
+ grid=[2], height="auto"
192
+ )
193
+
194
+ ex = gr.Examples(examples=examples, fn=infer, inputs=[text], outputs=gallery, cache_examples=True)
195
+ ex.dataset.headers = [""]
196
+
197
+ text.submit(infer, inputs=[text], outputs=gallery)
198
+ btn.click(infer, inputs=[text], outputs=gallery)
199
+ gr.Markdown("""
200
+
201
+
202
+ # Authors
203
+
204
+ + Arseniy Shakhmatov: [Github](https://github.com/cene555), [Blog](https://t.me/gradientdip)
205
+ + Anton Razzhigaev: [Github](https://github.com/razzant), [Blog](https://t.me/abstractDL)
206
+ + Aleksandr Nikolich: [Github](https://github.com/AlexWortega), [Blog](https://t.me/lovedeathtransformers)
207
+ + Vladimir Arkhipkin: [Github](https://github.com/oriBetelgeuse)
208
+ + Igor Pavlov: [Github](https://github.com/boomb0om)
209
+ + Andrey Kuznetsov: [Github](https://github.com/kuznetsoffandrey)
210
+ + Denis Dimitrov: [Github](https://github.com/denndimitrov)
211
+
212
+ """
213
+ )
214
+
215
+ demo.queue(max_size=25).launch()
packages.txt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ ffmpeg
2
+ libsm6
3
+ libxext6
requirements.txt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ git+https://github.com/ai-forever/Kandinsky-2.0.git
2
+ gradio
3
+ opencv-python