Update README.md
Browse files
README.md
CHANGED
|
@@ -29,10 +29,11 @@ The feature alignment loss is designed in a way such that the output of `block-x
|
|
| 29 |
The distillation process is performed with `512x512` Laion images recaptioned with `Qwen-VL` in the first stage for `90k steps`,
|
| 30 |
and `1024x1024` images generated by `Flux` using the prompts in `JourneyDB` with another `90k steps`.
|
| 31 |
|
| 32 |
-
##
|
| 33 |
|
| 34 |
-
|
| 35 |
-
|
|
|
|
| 36 |
|
| 37 |
The current model is ok with generating common images such as human/animal faces, landscapes, fantasy and abstract scenes.
|
| 38 |
Unfortunately, it is still incompetent in many scenarios. Including but not limited to:
|
|
@@ -47,5 +48,3 @@ Since our model is trained with prompts in JourneyDB, we encourage users to appl
|
|
| 47 |
For example: "profile of sad Socrates, full body, high detail, dramatic scene, Epic dynamic action, wide angle, cinematic, hyper-realistic, concept art, warm muted tones as painted by Bernie Wrightson, Frank Frazetta."
|
| 48 |
|
| 49 |
Thank you for your attention! We will continue to improve our model and release new versions in the future.
|
| 50 |
-
|
| 51 |
-
github link: https://github.com/TencentARC/flux-toolkits
|
|
|
|
| 29 |
The distillation process is performed with `512x512` Laion images recaptioned with `Qwen-VL` in the first stage for `90k steps`,
|
| 30 |
and `1024x1024` images generated by `Flux` using the prompts in `JourneyDB` with another `90k steps`.
|
| 31 |
|
| 32 |
+
## Limitations
|
| 33 |
|
| 34 |
+
With limited computing and data resources, the capability of our Flux-mini is still limited in certain domains.
|
| 35 |
+
To facilitate the development of flux-based models, we open-sourced the codes to distill Flux in [this link](https://github.com/TencentARC/FluxKits).
|
| 36 |
+
We appeal people interested in this project to collaborate together to build a more applicable and powerful text-to-image model!
|
| 37 |
|
| 38 |
The current model is ok with generating common images such as human/animal faces, landscapes, fantasy and abstract scenes.
|
| 39 |
Unfortunately, it is still incompetent in many scenarios. Including but not limited to:
|
|
|
|
| 48 |
For example: "profile of sad Socrates, full body, high detail, dramatic scene, Epic dynamic action, wide angle, cinematic, hyper-realistic, concept art, warm muted tones as painted by Bernie Wrightson, Frank Frazetta."
|
| 49 |
|
| 50 |
Thank you for your attention! We will continue to improve our model and release new versions in the future.
|
|
|
|
|
|