apol
/

dalle-mini

@@ -1,21 +1,12 @@
-## DALL-E Mini - Generate image from text
-## TODO
-* experiment with flax/jax and setup of the TPU instance that we should get shortly
-* work on dataset loading - [see suggested datasets](https://discuss.huggingface.co/t/dall-e-mini-version/7324/4)
-* Optionally create the OpenAI YFCC100M subset (see [this post](https://discuss.huggingface.co/t/dall-e-mini-version/7324/30?u=boris))
-* work on text/image encoding
-* concatenate inputs (not sure if we need fixed length for text or use a special token separating text & image)
-* adapt training script
-* create inference function
-* integrate CLIP for better results (only if we have the time)
-* work on a demo (streamlit or colab or maybe just HF widget)
-* document (set up repo on model hub per instructions, start on README writeup…)
-* help with coordinating activities & progress
-## Dependencies Installation
-You should create a new python virtual environment and install the project dependencies inside the virtual env: `pip install -r requirements.txt`
-If you use `conda`, you can create the virtual env and install everything using: `conda env update -f environments.yaml`

+---
+language:
+- en
+---
+## DALL-E Mini - Generate images from text
+Model generating images from a text prompt.
+Outputs need to be decoded with [flax-community/vqgan_f16_16384](https://huggingface.co/flax-community/vqgan_f16_16384).
+Model definition is currently in the [Github repo](https://github.com/borisdayma/dalle-mini) as `CustomFlaxBartForConditionalGeneration`.