Boris Dayma commited on
Commit ·
8b264b9
1
Parent(s): db83852
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,21 +1,12 @@
|
|
| 1 |
-
|
|
|
|
|
|
|
|
|
|
| 2 |
|
| 3 |
-
##
|
| 4 |
|
| 5 |
-
|
| 6 |
-
* work on dataset loading - [see suggested datasets](https://discuss.huggingface.co/t/dall-e-mini-version/7324/4)
|
| 7 |
-
* Optionally create the OpenAI YFCC100M subset (see [this post](https://discuss.huggingface.co/t/dall-e-mini-version/7324/30?u=boris))
|
| 8 |
-
* work on text/image encoding
|
| 9 |
-
* concatenate inputs (not sure if we need fixed length for text or use a special token separating text & image)
|
| 10 |
-
* adapt training script
|
| 11 |
-
* create inference function
|
| 12 |
-
* integrate CLIP for better results (only if we have the time)
|
| 13 |
-
* work on a demo (streamlit or colab or maybe just HF widget)
|
| 14 |
-
* document (set up repo on model hub per instructions, start on README writeup…)
|
| 15 |
-
* help with coordinating activities & progress
|
| 16 |
|
|
|
|
| 17 |
|
| 18 |
-
|
| 19 |
-
You should create a new python virtual environment and install the project dependencies inside the virtual env: `pip install -r requirements.txt`
|
| 20 |
-
|
| 21 |
-
If you use `conda`, you can create the virtual env and install everything using: `conda env update -f environments.yaml`
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
---
|
| 5 |
|
| 6 |
+
## DALL-E Mini - Generate images from text
|
| 7 |
|
| 8 |
+
Model generating images from a text prompt.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
+
Outputs need to be decoded with [flax-community/vqgan_f16_16384](https://huggingface.co/flax-community/vqgan_f16_16384).
|
| 11 |
|
| 12 |
+
Model definition is currently in the [Github repo](https://github.com/borisdayma/dalle-mini) as `CustomFlaxBartForConditionalGeneration`.
|
|
|
|
|
|
|
|
|