Linear77
#5
by Linear77 - opened
README.md
CHANGED
|
@@ -6,92 +6,69 @@ tags:
|
|
| 6 |
- template:diffusion-lora
|
| 7 |
- flux
|
| 8 |
- nft
|
|
|
|
| 9 |
- art
|
|
|
|
| 10 |
widget:
|
| 11 |
-
- text: '
|
| 12 |
output:
|
| 13 |
url: >-
|
| 14 |
-
images/
|
| 15 |
-
- text: '
|
| 16 |
output:
|
| 17 |
url: >-
|
| 18 |
-
images/
|
| 19 |
-
- text: '
|
| 20 |
output:
|
| 21 |
url: >-
|
| 22 |
-
images/
|
| 23 |
-
- text: '
|
| 24 |
output:
|
| 25 |
url: >-
|
| 26 |
-
images/
|
| 27 |
-
- text: 'NFT ++, a black and white drawing of a black swan in a glass container. The container is shaped like a hexagon, with a black handle on the right side. The swan is swimming in the water, its head facing towards the left side of the container, its beak slightly open. The water in the container is a dark blue, with white splashes on it. Above the swan, the words "prithiv-hf" are written in bold black letters. The background is a light blue, and the text "prithiv-hf" is written in a bold black font.'
|
| 28 |
-
output:
|
| 29 |
-
url: >-
|
| 30 |
-
images/2.png
|
| 31 |
-
- text: 'NFT ++, a cartoon drawing of a cats face is seen in a clear glass container. The container is filled with a variety of brown objects, including a barcode and the word "prithiv" written in black letters. The background is a stark white, with a light brown border. The cats head is adorned with a red ribbon, adding a pop of color to the scene. The word "imo" is written on the top right corner of the container.'
|
| 32 |
-
output:
|
| 33 |
-
url: >-
|
| 34 |
-
images/3.png
|
| 35 |
base_model: black-forest-labs/FLUX.1-dev
|
| 36 |
-
instance_prompt:
|
| 37 |
license: creativeml-openrail-m
|
| 38 |
---
|
| 39 |
-

|
| 40 |
|
| 41 |
-
|
| 42 |
|
| 43 |
-
|
| 44 |
|
| 45 |
-
Image Processing Parameters
|
| 46 |
|
| 47 |
-
| Parameter | Value
|
| 48 |
-
|---------------------------|--------|---------------------------|--------|
|
| 49 |
| LR Scheduler | constant | Noise Offset | 0.03 |
|
| 50 |
-
| Optimizer | AdamW
|
| 51 |
-
| Network Dim |
|
| 52 |
-
| Network Alpha |
|
| 53 |
-
| Epoch |
|
| 54 |
-
|
| 55 |
-
Labeling: florence2-en(natural language & English)
|
| 56 |
-
|
| 57 |
-
Total Images Used for Training : 12
|
| 58 |
-
|
| 59 |
-
## Best Dimensions & Inference
|
| 60 |
-
|
| 61 |
-
| **Dimensions** | **Aspect Ratio** | **Recommendation** |
|
| 62 |
-
|-----------------|------------------|---------------------------|
|
| 63 |
-
| 1280 x 832 | 3:2 | Best |
|
| 64 |
-
| 1024 x 1024 | 1:1 | Default |
|
| 65 |
|
| 66 |
-
##
|
| 67 |
|
| 68 |
-
|
|
|
|
|
|
|
| 69 |
|
| 70 |
## Setting Up
|
|
|
|
| 71 |
```python
|
| 72 |
import torch
|
| 73 |
-
from
|
| 74 |
|
| 75 |
base_model = "black-forest-labs/FLUX.1-dev"
|
| 76 |
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)
|
| 77 |
|
| 78 |
-
|
| 79 |
-
|
| 80 |
pipe.load_lora_weights(lora_repo)
|
| 81 |
|
|
|
|
| 82 |
device = torch.device("cuda")
|
| 83 |
pipe.to(device)
|
| 84 |
-
```
|
| 85 |
-
## Trigger words
|
| 86 |
-
|
| 87 |
-
You should use `NFT ++` to trigger the image generation.
|
| 88 |
-
|
| 89 |
-
## Data Source
|
| 90 |
-
|
| 91 |
-
quaphadoodle
|
| 92 |
-
|
| 93 |
-
## Download model
|
| 94 |
-
|
| 95 |
-
Weights for this model are available in Safetensors format.
|
| 96 |
|
| 97 |
-
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
- template:diffusion-lora
|
| 7 |
- flux
|
| 8 |
- nft
|
| 9 |
+
- pfp
|
| 10 |
- art
|
| 11 |
+
- cartoon
|
| 12 |
widget:
|
| 13 |
+
- text: 'YellowGang, A front-facing view of the character wearing a black snapback cap, a red basketball jersey with "1" and "YellowGang" logo, gold chains, solid blue background.'
|
| 14 |
output:
|
| 15 |
url: >-
|
| 16 |
+
images/default.png
|
| 17 |
+
- text: 'YellowGang, A character wearing a futuristic cyborg motorcycle helmet with a glowing red LED visor, an astronaut suit with the "YellowGang" patch, solid blue background.'
|
| 18 |
output:
|
| 19 |
url: >-
|
| 20 |
+
images/cyborg.png
|
| 21 |
+
- text: 'YellowGang, The character clean-shaven (bald), wearing a classic black and white zebra bandana, sleek black aviator sunglasses, and a pink Japanese Kimono with the "YellowGang" logo, solid blue background.'
|
| 22 |
output:
|
| 23 |
url: >-
|
| 24 |
+
images/kimono.png
|
| 25 |
+
- text: 'YellowGang, A close-up portrait of the character with large over-ear DJ headphones, angry red eyes, a spiked leather choker collar, and a blue streetwear hoodie with the "YellowGang" logo, solid blue background.'
|
| 26 |
output:
|
| 27 |
url: >-
|
| 28 |
+
images/punk.png
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
base_model: black-forest-labs/FLUX.1-dev
|
| 30 |
+
instance_prompt: YellowGang
|
| 31 |
license: creativeml-openrail-m
|
| 32 |
---
|
|
|
|
| 33 |
|
| 34 |
+
# Model description for YellowGang-Generator-LoRA
|
| 35 |
|
| 36 |
+
This LoRA model is a specialized generator for the "YellowGang" collection. It is fine-tuned to preserve the character's signature yellow-skinned facial features and proportions while allowing for high-fidelity generation of diverse traits, including various headwear, eyewear, outfits, and neck accessories.
|
| 37 |
|
| 38 |
+
## Image Processing Parameters
|
| 39 |
|
| 40 |
+
| Parameter | Value | Parameter | Value |
|
| 41 |
+
|---------------------------|----------|---------------------------|--------|
|
| 42 |
| LR Scheduler | constant | Noise Offset | 0.03 |
|
| 43 |
+
| Optimizer | AdamW | Multires Noise Discount | 0.1 |
|
| 44 |
+
| Network Dim | 128 | Multires Noise Iterations | 10 |
|
| 45 |
+
| Network Alpha | 64 | Repeat & Steps | 25 & 3500 |
|
| 46 |
+
| Epoch | 15 | Save Every N Epochs | 2 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 47 |
|
| 48 |
+
## Inference Recommendation
|
| 49 |
|
| 50 |
+
* **Best Dimensions:** 1024 x 1024 (1:1 Aspect Ratio)
|
| 51 |
+
* **Recommended Inference Steps:** 30–35
|
| 52 |
+
* **Trigger Word:** `YellowGang`
|
| 53 |
|
| 54 |
## Setting Up
|
| 55 |
+
|
| 56 |
```python
|
| 57 |
import torch
|
| 58 |
+
from diffusers import DiffusionPipeline
|
| 59 |
|
| 60 |
base_model = "black-forest-labs/FLUX.1-dev"
|
| 61 |
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)
|
| 62 |
|
| 63 |
+
# Ganti dengan username dan nama repo kamu
|
| 64 |
+
lora_repo = "Linear77/YellowGang-Generator-LoRA"
|
| 65 |
pipe.load_lora_weights(lora_repo)
|
| 66 |
|
| 67 |
+
trigger_word = "YellowGang"
|
| 68 |
device = torch.device("cuda")
|
| 69 |
pipe.to(device)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 70 |
|
| 71 |
+
# Contoh Prompt
|
| 72 |
+
prompt = "YellowGang, wearing a crown, gold chain, basketball jersey, solid blue background"
|
| 73 |
+
image = pipe(prompt).images[0]
|
| 74 |
+
image.save("result.png")
|