Update README.md
Browse files
README.md
CHANGED
|
@@ -12,46 +12,21 @@ tags:
|
|
| 12 |
- flux
|
| 13 |
---
|
| 14 |
|
| 15 |
-
|
| 16 |
|
| 17 |
-
|
| 18 |
-
For more information, please read our [blog post](https://blackforestlabs.ai/announcing-black-forest-labs/).
|
| 19 |
|
| 20 |
-
|
| 21 |
-
1. Cutting-edge output quality, second only to our state-of-the-art model `FLUX.1 [pro]`.
|
| 22 |
-
2. Competitive prompt following, matching the performance of closed source alternatives .
|
| 23 |
-
3. Trained using guidance distillation, making `FLUX.1 [dev]` more efficient.
|
| 24 |
-
4. Open weights to drive new scientific research, and empower artists to develop innovative workflows.
|
| 25 |
-
5. Generated outputs can be used for personal, scientific, and commercial purposes as described in the [flux-1-dev-non-commercial-license](./licence.md).
|
| 26 |
|
| 27 |
-
|
| 28 |
-
|
| 29 |
-
Developers and creatives looking to build on top of `FLUX.1 [dev]` are encouraged to use this as a starting point.
|
| 30 |
-
|
| 31 |
-
## API Endpoints
|
| 32 |
-
The FLUX.1 models are also available via API from the following sources
|
| 33 |
-
1. [bfl.ml](https://docs.bfl.ml/) (currently `FLUX.1 [pro]`)
|
| 34 |
-
2. [replicate.com](https://replicate.com/collections/flux)
|
| 35 |
-
3. [fal.ai](https://fal.ai/models/fal-ai/flux/dev)
|
| 36 |
-
|
| 37 |
-
## ComfyUI
|
| 38 |
-
`FLUX.1 [dev]` is also available in [Comfy UI](https://github.com/comfyanonymous/ComfyUI) for local inference with a node-based workflow.
|
| 39 |
-
|
| 40 |
-
## Diffusers
|
| 41 |
-
|
| 42 |
-
To use `FLUX.1 [dev]` with the 🧨 diffusers python library, first install or upgrade diffusers
|
| 43 |
-
|
| 44 |
-
```shell
|
| 45 |
-
pip install git+https://github.com/huggingface/diffusers.git
|
| 46 |
-
```
|
| 47 |
-
|
| 48 |
-
Then you can use `FluxPipeline` to run the model
|
| 49 |
|
|
|
|
| 50 |
```python
|
| 51 |
import torch
|
| 52 |
from diffusers import FluxPipeline
|
| 53 |
|
| 54 |
-
pipe = FluxPipeline.from_pretrained("
|
| 55 |
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power
|
| 56 |
|
| 57 |
prompt = "A cat holding a sign that says hello world"
|
|
@@ -65,29 +40,10 @@ image = pipe(
|
|
| 65 |
max_sequence_length=512,
|
| 66 |
generator=torch.Generator("cpu").manual_seed(0)
|
| 67 |
).images[0]
|
| 68 |
-
image.save("
|
| 69 |
```
|
| 70 |
|
| 71 |
-
|
| 72 |
-
|
| 73 |
-
-
|
| 74 |
-
# Limitations
|
| 75 |
-
- This model is not intended or able to provide factual information.
|
| 76 |
-
- As a statistical model this checkpoint might amplify existing societal biases.
|
| 77 |
-
- The model may fail to generate output that matches the prompts.
|
| 78 |
-
- Prompt following is heavily influenced by the prompting-style.
|
| 79 |
-
|
| 80 |
-
# Out-of-Scope Use
|
| 81 |
-
The model and its derivatives may not be used
|
| 82 |
-
|
| 83 |
-
- In any way that violates any applicable national, federal, state, local or international law or regulation.
|
| 84 |
-
- For the purpose of exploiting, harming or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
|
| 85 |
-
- To generate or disseminate verifiably false information and/or content with the purpose of harming others.
|
| 86 |
-
- To generate or disseminate personal identifiable information that can be used to harm an individual.
|
| 87 |
-
- To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
|
| 88 |
-
- To create non-consensual nudity or illegal pornographic content.
|
| 89 |
-
- For fully automated decision making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
|
| 90 |
-
- Generating or facilitating large-scale disinformation campaigns.
|
| 91 |
|
| 92 |
-
# License
|
| 93 |
-
This model falls under the [`FLUX.1 [dev]` Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).
|
|
|
|
| 12 |
- flux
|
| 13 |
---
|
| 14 |
|
| 15 |
+
**LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis**
|
| 16 |
|
| 17 |
+
This repository contains the model presented in the paper [LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis](https://huggingface.co/papers/2503.21749).
|
|
|
|
| 18 |
|
| 19 |
+
The abstract of the paper is the following:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
|
| 21 |
+
We introduce LeX-Art, a comprehensive suite for high-quality text-image synthesis that systematically bridges the gap between prompt expressiveness and text rendering fidelity. Our approach follows a data-centric paradigm, constructing a high-quality data synthesis pipeline based on Deepseek-R1 to curate LeX-10K, a dataset of 10K high-resolution, aesthetically refined 1024$\times$1024 images. Beyond dataset construction, we develop LeX-Enhancer, a robust prompt enrichment model, and train two text-to-image models, LeX-FLUX and LeX-Lumina, achieving state-of-the-art text rendering performance. To systematically evaluate visual text generation, we introduce LeX-Bench, a benchmark that assesses fidelity, aesthetics, and alignment, complemented by Pairwise Normalized Edit Distance (PNED), a novel metric for robust text accuracy evaluation. Experiments demonstrate significant improvements, with LeX-Lumina achieving a 22.16\% PNED gain, and LeX-FLUX outperforming baselines in color (+10.32\%), positional (+5.60\%), and font accuracy (+5.63\%). The codes, models, datasets, and demo are publicly available.
|
| 22 |
+

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
|
| 24 |
+
**Usage of LeX-Lumina:**
|
| 25 |
```python
|
| 26 |
import torch
|
| 27 |
from diffusers import FluxPipeline
|
| 28 |
|
| 29 |
+
pipe = FluxPipeline.from_pretrained("X-ART/LeX-FLUX", torch_dtype=torch.bfloat16)
|
| 30 |
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power
|
| 31 |
|
| 32 |
prompt = "A cat holding a sign that says hello world"
|
|
|
|
| 40 |
max_sequence_length=512,
|
| 41 |
generator=torch.Generator("cpu").manual_seed(0)
|
| 42 |
).images[0]
|
| 43 |
+
image.save("lex_flux_demo.png")
|
| 44 |
```
|
| 45 |
|
| 46 |
+
See also:
|
| 47 |
+
* [Project page](https://zhaoshitian.github.io/lexart/)
|
| 48 |
+
* [Code](https://github.com/zhaoshitian/LeX-Art)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 49 |
|
|
|
|
|
|