Raxephion commited on
Commit
50a8d43
ยท
verified ยท
1 Parent(s): d361373

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +102 -0
README.md ADDED
@@ -0,0 +1,102 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ base_model:
4
+ - stable-diffusion-v1-5/stable-diffusion-v1-5
5
+ library_name: diffusers
6
+ ---
7
+
8
+ # ๐ŸŒช๏ธ Typhoon V2 (Stable Diffusion 1.5 Edition)
9
+
10
+ > _"Still SD1.5. Still cursed. But at least now it understands limbs."_
11
+
12
+ ---
13
+
14
+ ## ๐Ÿงฌ Overview
15
+
16
+ Typhoon V2 is the long-overdue upgrade to Typhoon V1, trained for SD1.5 using smarter techniques, cleaner data, and a few hard-earned lessons from the first version. The result? More control, better anatomy, stronger stylization โ€” and fewer existential crises per batch.
17
+
18
+ It builds on the core identity of V1 (tag-based prompts, no trigger words, bold composition) but addresses its flaws head-on: warped limbs, prompt misfires, and the occasional brush with the uncanny.
19
+
20
+ V2 plays much nicer with short prompts, handles faces even better than before, and generally wonโ€™t lose its mind when asked for basic body parts. Still no poetic-prose magic, though โ€” this one *also* thinks in tags.
21
+
22
+ ---
23
+
24
+ ## ๐Ÿ”ง Development Notes
25
+
26
+ Typhoon V2 was trained from scratch again, but this time with properly scaled datasets (no more 512ร—512-only crops) and aspect-ratio-aware augmentation. The architecture is still SD1.5, but the dataset strategy and training parameters got a much-needed overhaul.
27
+
28
+ Training was done on rented A100s โ€” because, apparently, learning costs pain. Dataset prep was completely redone, with better captions, refined tag filtering, and entirely new sets focused on pose coverage, negative regularization, and structural accuracy.
29
+
30
+ Merging? None this time โ€” this is a native checkpoint. No LoRA layering, no weight juggling. Just clean, consolidated training. (Still used my analysis tools, though โ€” because mistakes are expensive.)
31
+
32
+ ๐Ÿ› ๏ธ Tools used during development:
33
+
34
+ - [LoRA Strength Analyser](https://github.com/Raxephion/loRA-Strength-Analyser)
35
+ - [LoRA Epoch Analyser](https://github.com/Raxephion/loRA-Epoch-Analyser)
36
+ - [TensorPeek](https://github.com/Raxephion/The-Vault/tree/main/TensorPeek) โ€” for inspecting `.safetensors` metadata
37
+ - [LoRA Distiller (WIP)](https://github.com/Raxephion/The-Vault) โ€” experimental, but useful in this pipeline
38
+
39
+ The base model was once again `v1-5-pruned-emaonly.safetensors`, but augmented via LoRA distillation and pre-conditioning to mitigate its quirks. All improvements are checkpoint-native โ€” no merging required.
40
+
41
+ ---
42
+
43
+ ## ๐Ÿ–ผ๏ธ Sample Images
44
+
45
+ All images were generated using the base Typhoon V2 checkpoint. No LoRAs, no inpainting, no face fixers โ€” just raw inference with Hires Fix.
46
+
47
+ **Settings:**
48
+
49
+ - **Resolution**: 512ร—768, 576ร—832, or 640ร—896
50
+ - **Sampler**: DPM++ 2M Karras (Euler A also works fine)
51
+ - **CFG**: 6.5โ€“7
52
+ - **Hires Fix**:
53
+ - Denoising strength: 0.6โ€“0.7
54
+ - Upscaler: Latent
55
+ - Upscale by: 2
56
+ - **VAE**: [sd-vae-ft-ema](https://huggingface.co/stabilityai/sd-vae-ft-ema)
57
+
58
+ โš ๏ธ Legacy `.vae.pt` or `.vae.bin` files will likely cause washed-out or low-contrast results. Use the official VAE or none at all for correct output.
59
+
60
+ ---
61
+
62
+ ## โš™๏ธ Prompting Tips
63
+
64
+ - **Trigger Words**: None
65
+ - **Prompting Style**: Tag-based preferred (e.g. `1girl, long hair, looking at viewer`)
66
+ - **Natural Language**: Still not a fan โ€” use structured tags for best results
67
+ - **ADetailer / Face Fixing**: Rarely needed; faces are stable out of the box
68
+ - **Recommended Resolutions**:
69
+ - 512ร—768
70
+ - 576ร—832
71
+ - 640ร—896
72
+ Narrow resolutions like 512ร—640 are prone to artifacts. Avoid if possible.
73
+
74
+ ---
75
+
76
+ ## โš ๏ธ Limitations
77
+
78
+ - **NSFW**: Still mildly neutered by the base model. Performance has improved over V1, but results are hit-or-miss.
79
+ - **Anatomy**: Much better than V1, but still SD1.5 โ€” expect occasional hiccups
80
+ - **Natural Language Prompts**: Works better than V1, but short, tag-like prompts still yield the most consistent results
81
+
82
+ ---
83
+
84
+ ## ๐Ÿ”’ License & Usage
85
+
86
+ - โœ… Personal use: Absolutely
87
+ - ๐Ÿšซ **Do NOT** upload this model to generation websites or aggregators
88
+ - ๐Ÿšซ **Do NOT** merge this model into other checkpoints
89
+
90
+ > Why? Typhoon V2 was trained cleanly and directly. Merging would break its stylistic balance, ruin its improvements, and waste the training effort. Please donโ€™t.
91
+
92
+ ---
93
+
94
+ ## ๐Ÿ”ฎ Future Work
95
+
96
+ - Further refinements possible via targeted LoRAs or partial retrains
97
+ - Potential distillation into an SDXL variant
98
+ - Ongoing prompt testing and edge-case analysis
99
+
100
+ ---
101
+
102
+ **Enjoy the storm โ€” again.** โ›ˆ๏ธ