Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,102 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: creativeml-openrail-m
|
| 3 |
+
base_model:
|
| 4 |
+
- stable-diffusion-v1-5/stable-diffusion-v1-5
|
| 5 |
+
library_name: diffusers
|
| 6 |
+
---
|
| 7 |
+
|
| 8 |
+
# ๐ช๏ธ Typhoon V2 (Stable Diffusion 1.5 Edition)
|
| 9 |
+
|
| 10 |
+
> _"Still SD1.5. Still cursed. But at least now it understands limbs."_
|
| 11 |
+
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
## ๐งฌ Overview
|
| 15 |
+
|
| 16 |
+
Typhoon V2 is the long-overdue upgrade to Typhoon V1, trained for SD1.5 using smarter techniques, cleaner data, and a few hard-earned lessons from the first version. The result? More control, better anatomy, stronger stylization โ and fewer existential crises per batch.
|
| 17 |
+
|
| 18 |
+
It builds on the core identity of V1 (tag-based prompts, no trigger words, bold composition) but addresses its flaws head-on: warped limbs, prompt misfires, and the occasional brush with the uncanny.
|
| 19 |
+
|
| 20 |
+
V2 plays much nicer with short prompts, handles faces even better than before, and generally wonโt lose its mind when asked for basic body parts. Still no poetic-prose magic, though โ this one *also* thinks in tags.
|
| 21 |
+
|
| 22 |
+
---
|
| 23 |
+
|
| 24 |
+
## ๐ง Development Notes
|
| 25 |
+
|
| 26 |
+
Typhoon V2 was trained from scratch again, but this time with properly scaled datasets (no more 512ร512-only crops) and aspect-ratio-aware augmentation. The architecture is still SD1.5, but the dataset strategy and training parameters got a much-needed overhaul.
|
| 27 |
+
|
| 28 |
+
Training was done on rented A100s โ because, apparently, learning costs pain. Dataset prep was completely redone, with better captions, refined tag filtering, and entirely new sets focused on pose coverage, negative regularization, and structural accuracy.
|
| 29 |
+
|
| 30 |
+
Merging? None this time โ this is a native checkpoint. No LoRA layering, no weight juggling. Just clean, consolidated training. (Still used my analysis tools, though โ because mistakes are expensive.)
|
| 31 |
+
|
| 32 |
+
๐ ๏ธ Tools used during development:
|
| 33 |
+
|
| 34 |
+
- [LoRA Strength Analyser](https://github.com/Raxephion/loRA-Strength-Analyser)
|
| 35 |
+
- [LoRA Epoch Analyser](https://github.com/Raxephion/loRA-Epoch-Analyser)
|
| 36 |
+
- [TensorPeek](https://github.com/Raxephion/The-Vault/tree/main/TensorPeek) โ for inspecting `.safetensors` metadata
|
| 37 |
+
- [LoRA Distiller (WIP)](https://github.com/Raxephion/The-Vault) โ experimental, but useful in this pipeline
|
| 38 |
+
|
| 39 |
+
The base model was once again `v1-5-pruned-emaonly.safetensors`, but augmented via LoRA distillation and pre-conditioning to mitigate its quirks. All improvements are checkpoint-native โ no merging required.
|
| 40 |
+
|
| 41 |
+
---
|
| 42 |
+
|
| 43 |
+
## ๐ผ๏ธ Sample Images
|
| 44 |
+
|
| 45 |
+
All images were generated using the base Typhoon V2 checkpoint. No LoRAs, no inpainting, no face fixers โ just raw inference with Hires Fix.
|
| 46 |
+
|
| 47 |
+
**Settings:**
|
| 48 |
+
|
| 49 |
+
- **Resolution**: 512ร768, 576ร832, or 640ร896
|
| 50 |
+
- **Sampler**: DPM++ 2M Karras (Euler A also works fine)
|
| 51 |
+
- **CFG**: 6.5โ7
|
| 52 |
+
- **Hires Fix**:
|
| 53 |
+
- Denoising strength: 0.6โ0.7
|
| 54 |
+
- Upscaler: Latent
|
| 55 |
+
- Upscale by: 2
|
| 56 |
+
- **VAE**: [sd-vae-ft-ema](https://huggingface.co/stabilityai/sd-vae-ft-ema)
|
| 57 |
+
|
| 58 |
+
โ ๏ธ Legacy `.vae.pt` or `.vae.bin` files will likely cause washed-out or low-contrast results. Use the official VAE or none at all for correct output.
|
| 59 |
+
|
| 60 |
+
---
|
| 61 |
+
|
| 62 |
+
## โ๏ธ Prompting Tips
|
| 63 |
+
|
| 64 |
+
- **Trigger Words**: None
|
| 65 |
+
- **Prompting Style**: Tag-based preferred (e.g. `1girl, long hair, looking at viewer`)
|
| 66 |
+
- **Natural Language**: Still not a fan โ use structured tags for best results
|
| 67 |
+
- **ADetailer / Face Fixing**: Rarely needed; faces are stable out of the box
|
| 68 |
+
- **Recommended Resolutions**:
|
| 69 |
+
- 512ร768
|
| 70 |
+
- 576ร832
|
| 71 |
+
- 640ร896
|
| 72 |
+
Narrow resolutions like 512ร640 are prone to artifacts. Avoid if possible.
|
| 73 |
+
|
| 74 |
+
---
|
| 75 |
+
|
| 76 |
+
## โ ๏ธ Limitations
|
| 77 |
+
|
| 78 |
+
- **NSFW**: Still mildly neutered by the base model. Performance has improved over V1, but results are hit-or-miss.
|
| 79 |
+
- **Anatomy**: Much better than V1, but still SD1.5 โ expect occasional hiccups
|
| 80 |
+
- **Natural Language Prompts**: Works better than V1, but short, tag-like prompts still yield the most consistent results
|
| 81 |
+
|
| 82 |
+
---
|
| 83 |
+
|
| 84 |
+
## ๐ License & Usage
|
| 85 |
+
|
| 86 |
+
- โ
Personal use: Absolutely
|
| 87 |
+
- ๐ซ **Do NOT** upload this model to generation websites or aggregators
|
| 88 |
+
- ๐ซ **Do NOT** merge this model into other checkpoints
|
| 89 |
+
|
| 90 |
+
> Why? Typhoon V2 was trained cleanly and directly. Merging would break its stylistic balance, ruin its improvements, and waste the training effort. Please donโt.
|
| 91 |
+
|
| 92 |
+
---
|
| 93 |
+
|
| 94 |
+
## ๐ฎ Future Work
|
| 95 |
+
|
| 96 |
+
- Further refinements possible via targeted LoRAs or partial retrains
|
| 97 |
+
- Potential distillation into an SDXL variant
|
| 98 |
+
- Ongoing prompt testing and edge-case analysis
|
| 99 |
+
|
| 100 |
+
---
|
| 101 |
+
|
| 102 |
+
**Enjoy the storm โ again.** โ๏ธ
|