suzushi
/

miso-diffusion-m-1.0

Model card Files Files and versions

miso-diffusion-m-1.0 / README.md

suzushi's picture

Update README.md

4bc8577 verified 11 months ago

|

history blame contribute delete

1.11 kB

	---
	language:
	- en
	library_name: diffusers
	pipeline_tag: text-to-image
	tags:
	- text-to-image
	base_model:
	- stabilityai/stable-diffusion-3.5-medium
	---


	<div style="display: flex; justify-content: center; gap: 20px; margin-bottom: 20px;">
	<img src="demo1.png" width="400" />
	<img src="demo2.png" width="400" />
	</div>
	# Anime SD3.5 medium Model

	An attempt to fine tune sd3.5 medium

	## Version History

	\| Version \| Base Training \| Aesthetic Training \| Total Epochs \|
	\|---------\|--------------\|-------------------\|--------------\|
	\| alpha \| 250K images \| 0 images \| 1 \|
	\| beta \| 160K images \| 0 images \| 3 \|
	\| 1.0 \| 600k images \| 0 images \| 2 + (3 from beta) \|

	## Training Methodology

	Training is done on gh200 with 96gb vram

	Training setting: Adafactor with a batchsize of 40, lr_scheduler: cosine

	SD3.5 Specific setting:

	enable_scaled_pos_embed = true

	pos_emb_random_crop_rate = 0.2

	weighting_scheme = "flow"

	learning_rate = 3e-6

	learning_rate_te1 = 2e-6

	learning_rate_te2 = 2e-6

	Train Clip: true, Train t5xxl: false