stzhao commited on
Commit
d25ae48
·
verified ·
1 Parent(s): 0222e40

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -55
README.md CHANGED
@@ -12,46 +12,21 @@ tags:
12
  - flux
13
  ---
14
 
15
- ![FLUX.1 [dev] Grid](./dev_grid.jpg)
16
 
17
- `FLUX.1 [dev]` is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions.
18
- For more information, please read our [blog post](https://blackforestlabs.ai/announcing-black-forest-labs/).
19
 
20
- # Key Features
21
- 1. Cutting-edge output quality, second only to our state-of-the-art model `FLUX.1 [pro]`.
22
- 2. Competitive prompt following, matching the performance of closed source alternatives .
23
- 3. Trained using guidance distillation, making `FLUX.1 [dev]` more efficient.
24
- 4. Open weights to drive new scientific research, and empower artists to develop innovative workflows.
25
- 5. Generated outputs can be used for personal, scientific, and commercial purposes as described in the [flux-1-dev-non-commercial-license](./licence.md).
26
 
27
- # Usage
28
- We provide a reference implementation of `FLUX.1 [dev]`, as well as sampling code, in a dedicated [github repository](https://github.com/black-forest-labs/flux).
29
- Developers and creatives looking to build on top of `FLUX.1 [dev]` are encouraged to use this as a starting point.
30
-
31
- ## API Endpoints
32
- The FLUX.1 models are also available via API from the following sources
33
- 1. [bfl.ml](https://docs.bfl.ml/) (currently `FLUX.1 [pro]`)
34
- 2. [replicate.com](https://replicate.com/collections/flux)
35
- 3. [fal.ai](https://fal.ai/models/fal-ai/flux/dev)
36
-
37
- ## ComfyUI
38
- `FLUX.1 [dev]` is also available in [Comfy UI](https://github.com/comfyanonymous/ComfyUI) for local inference with a node-based workflow.
39
-
40
- ## Diffusers
41
-
42
- To use `FLUX.1 [dev]` with the 🧨 diffusers python library, first install or upgrade diffusers
43
-
44
- ```shell
45
- pip install git+https://github.com/huggingface/diffusers.git
46
- ```
47
-
48
- Then you can use `FluxPipeline` to run the model
49
 
 
50
  ```python
51
  import torch
52
  from diffusers import FluxPipeline
53
 
54
- pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev", torch_dtype=torch.bfloat16)
55
  pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power
56
 
57
  prompt = "A cat holding a sign that says hello world"
@@ -65,29 +40,10 @@ image = pipe(
65
  max_sequence_length=512,
66
  generator=torch.Generator("cpu").manual_seed(0)
67
  ).images[0]
68
- image.save("flux-dev.png")
69
  ```
70
 
71
- To learn more check out the [diffusers](https://huggingface.co/docs/diffusers/main/en/api/pipelines/flux) documentation
72
-
73
- ---
74
- # Limitations
75
- - This model is not intended or able to provide factual information.
76
- - As a statistical model this checkpoint might amplify existing societal biases.
77
- - The model may fail to generate output that matches the prompts.
78
- - Prompt following is heavily influenced by the prompting-style.
79
-
80
- # Out-of-Scope Use
81
- The model and its derivatives may not be used
82
-
83
- - In any way that violates any applicable national, federal, state, local or international law or regulation.
84
- - For the purpose of exploiting, harming or attempting to exploit or harm minors in any way; including but not limited to the solicitation, creation, acquisition, or dissemination of child exploitative content.
85
- - To generate or disseminate verifiably false information and/or content with the purpose of harming others.
86
- - To generate or disseminate personal identifiable information that can be used to harm an individual.
87
- - To harass, abuse, threaten, stalk, or bully individuals or groups of individuals.
88
- - To create non-consensual nudity or illegal pornographic content.
89
- - For fully automated decision making that adversely impacts an individual's legal rights or otherwise creates or modifies a binding, enforceable obligation.
90
- - Generating or facilitating large-scale disinformation campaigns.
91
 
92
- # License
93
- This model falls under the [`FLUX.1 [dev]` Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).
 
12
  - flux
13
  ---
14
 
15
+ **LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis**
16
 
17
+ This repository contains the model presented in the paper [LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis](https://huggingface.co/papers/2503.21749).
 
18
 
19
+ The abstract of the paper is the following:
 
 
 
 
 
20
 
21
+ We introduce LeX-Art, a comprehensive suite for high-quality text-image synthesis that systematically bridges the gap between prompt expressiveness and text rendering fidelity. Our approach follows a data-centric paradigm, constructing a high-quality data synthesis pipeline based on Deepseek-R1 to curate LeX-10K, a dataset of 10K high-resolution, aesthetically refined 1024$\times$1024 images. Beyond dataset construction, we develop LeX-Enhancer, a robust prompt enrichment model, and train two text-to-image models, LeX-FLUX and LeX-Lumina, achieving state-of-the-art text rendering performance. To systematically evaluate visual text generation, we introduce LeX-Bench, a benchmark that assesses fidelity, aesthetics, and alignment, complemented by Pairwise Normalized Edit Distance (PNED), a novel metric for robust text accuracy evaluation. Experiments demonstrate significant improvements, with LeX-Lumina achieving a 22.16\% PNED gain, and LeX-FLUX outperforming baselines in color (+10.32\%), positional (+5.60\%), and font accuracy (+5.63\%). The codes, models, datasets, and demo are publicly available.
22
+ ![demo](teaser.png)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
23
 
24
+ **Usage of LeX-Lumina:**
25
  ```python
26
  import torch
27
  from diffusers import FluxPipeline
28
 
29
+ pipe = FluxPipeline.from_pretrained("X-ART/LeX-FLUX", torch_dtype=torch.bfloat16)
30
  pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power
31
 
32
  prompt = "A cat holding a sign that says hello world"
 
40
  max_sequence_length=512,
41
  generator=torch.Generator("cpu").manual_seed(0)
42
  ).images[0]
43
+ image.save("lex_flux_demo.png")
44
  ```
45
 
46
+ See also:
47
+ * [Project page](https://zhaoshitian.github.io/lexart/)
48
+ * [Code](https://github.com/zhaoshitian/LeX-Art)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49