Update README.md
Browse files
README.md
CHANGED
|
@@ -23,6 +23,7 @@ The no caption dataset uses only ohwx 3d render as caption
|
|
| 23 |
|
| 24 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/6345bd89fe134dfd7a0dba40/jK75d8i1x5hAHSYSsJNBd.png" alt="Training configuration" style="max-height: 500px; width: auto;">
|
| 25 |
|
|
|
|
| 26 |
|
| 27 |
## Inconsistent Dataset Training
|
| 28 |
|
|
@@ -32,28 +33,65 @@ This is the first training I made with the below dataset
|
|
| 32 |
|
| 33 |
When you pay attention to the grid image above shared, you will see that the dataset is not consistent
|
| 34 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 35 |
It has total 114 images
|
| 36 |
|
| 37 |
-
This training total step count was 500 * 114 / 4 (4x GPU - batch size 1) = 14250
|
| 38 |
|
| 39 |
It took like 37 hours on 4x RTX A6000 GPU with slow config - faster config would take like half
|
| 40 |
|
| 41 |
There were 2 trainings made with this dataset. Epoch 500 checkpoints are named as below
|
| 42 |
|
| 43 |
-
SECourses_Style_Inconsistent_DATASET_NO_Captions.safetensors
|
| 44 |
-
SECourses_Style_Inconsistent_DATASET_With_Captions.safetensors
|
| 45 |
|
| 46 |
Their checkpoints are saved in below folders
|
| 47 |
|
| 48 |
-
Training-Checkpoints-NO-Captions
|
| 49 |
-
Training-Checkpoints-With-Captions
|
| 50 |
|
| 51 |
Its grid results are shared below
|
| 52 |
|
| 53 |
-
https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/Inconsistent-Training-Dataset-Results-Grid-26100x23700px.jpg
|
| 54 |
|
| 55 |
When you pay attention to above image you will see that it has inconsistent results
|
| 56 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 57 |
|
| 58 |
1 : https://youtu.be/bupRePUOA18
|
| 59 |
|
|
|
|
| 23 |
|
| 24 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/6345bd89fe134dfd7a0dba40/jK75d8i1x5hAHSYSsJNBd.png" alt="Training configuration" style="max-height: 500px; width: auto;">
|
| 25 |
|
| 26 |
+
All trainings are saved as Float and 128 LoRA rank thus they are above 2GB per checkpoint
|
| 27 |
|
| 28 |
## Inconsistent Dataset Training
|
| 29 |
|
|
|
|
| 33 |
|
| 34 |
When you pay attention to the grid image above shared, you will see that the dataset is not consistent
|
| 35 |
|
| 36 |
+
The training dataset with used captions (only for With Captions training) can be see in below directory
|
| 37 |
+
|
| 38 |
+
[Training-Dataset](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/tree/main/Training-Dataset)
|
| 39 |
+
|
| 40 |
It has total 114 images
|
| 41 |
|
| 42 |
+
This training total step count was 500 * 114 / 4 (4x GPU - batch size 1) = 14250 steps
|
| 43 |
|
| 44 |
It took like 37 hours on 4x RTX A6000 GPU with slow config - faster config would take like half
|
| 45 |
|
| 46 |
There were 2 trainings made with this dataset. Epoch 500 checkpoints are named as below
|
| 47 |
|
| 48 |
+
[SECourses_Style_Inconsistent_DATASET_NO_Captions.safetensors](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/SECourses_Style_Inconsistent_DATASET_NO_Captions.safetensors)
|
| 49 |
+
[SECourses_Style_Inconsistent_DATASET_With_Captions.safetensors](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/SECourses_Style_Inconsistent_DATASET_With_Captions.safetensors)
|
| 50 |
|
| 51 |
Their checkpoints are saved in below folders
|
| 52 |
|
| 53 |
+
[Training-Checkpoints-NO-Captions](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/tree/main/Training-Checkpoints-NO-Captions)
|
| 54 |
+
[Training-Checkpoints-With-Captions](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/tree/main/Training-Checkpoints-With-Captions)
|
| 55 |
|
| 56 |
Its grid results are shared below
|
| 57 |
|
| 58 |
+
[Inconsistent-Training-Dataset-Results-Grid-26100x23700px.jpg](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/Inconsistent-Training-Dataset-Results-Grid-26100x23700px.jpg)
|
| 59 |
|
| 60 |
When you pay attention to above image you will see that it has inconsistent results
|
| 61 |
|
| 62 |
+
## Consistent Dataset Training
|
| 63 |
+
|
| 64 |
+
After I noticed that the initial training dataset was inconsistent i have pruned the dataset and made it much more consistent
|
| 65 |
+
|
| 66 |
+
[Fixed-Consistent-Training-Dataset-Images-Grid.jpg](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/Fixed-Consistent-Training-Dataset-Images-Grid.jpg)
|
| 67 |
+
|
| 68 |
+
When you pay attention to the grid image above shared, you will see that is way more consistent, still not perfect though
|
| 69 |
+
|
| 70 |
+
Now it has total 66 images
|
| 71 |
+
|
| 72 |
+
The training dataset with used captions for this training (only for With Captions training) can be see in below directory
|
| 73 |
+
|
| 74 |
+
[Training-Dataset](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/tree/main/Training-Dataset)
|
| 75 |
+
|
| 76 |
+
This training total step count was 500 * 66 / 4 (4x GPU - batch size 1) = 8250 steps
|
| 77 |
+
|
| 78 |
+
It took like 24 hours on 4x RTX A6000 GPU with slow config - faster config would take like half
|
| 79 |
+
|
| 80 |
+
There were 2 trainings made with this dataset. Epoch 500 checkpoints are named as below
|
| 81 |
+
|
| 82 |
+
[SECourses_Style_Inconsistent_DATASET_NO_Captions.safetensors](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/SECourses_Style_Inconsistent_DATASET_NO_Captions.safetensors)
|
| 83 |
+
[SECourses_Style_Inconsistent_DATASET_With_Captions.safetensors](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/SECourses_Style_Inconsistent_DATASET_With_Captions.safetensors)
|
| 84 |
+
|
| 85 |
+
Their checkpoints are saved in below folders
|
| 86 |
+
|
| 87 |
+
[Training-Checkpoints-NO-Captions](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/tree/main/Training-Checkpoints-NO-Captions)
|
| 88 |
+
[Training-Checkpoints-With-Captions](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/tree/main/Training-Checkpoints-With-Captions)
|
| 89 |
+
|
| 90 |
+
Its grid results are shared below
|
| 91 |
+
|
| 92 |
+
[Inconsistent-Training-Dataset-Results-Grid-26100x23700px.jpg](https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX/resolve/main/Inconsistent-Training-Dataset-Results-Grid-26100x23700px.jpg)
|
| 93 |
+
|
| 94 |
+
When you pay attention to above image you will see that it has inconsistent results
|
| 95 |
|
| 96 |
1 : https://youtu.be/bupRePUOA18
|
| 97 |
|