Cool Japan Diffusion 2.1.2 Model Card
English version is here.
ใฏใใใซ
Cool Japan Diffusion ใฏStable Diffsionใใใกใคใณใใฅใผใใณใฐใใฆใใขใใกใใใณใฌใใฒใผใ ใชใฉใฎใฏใผใซใธใฃใใณใ่กจ็พใใใใจใซ็นๅใใใขใใซใงใใใชใใๅ ้ฃๅบใฎใฏใผใซใธใฃใใณๆฆ็ฅใจใฏ็นใซ้ขไฟใฏใใใพใใใ
ไฝฟใๆน
ๆ่ปฝใซๆฅฝใใฟใใๆนใฏใใใกใใฎSpaceใใไฝฟใใใ ใใใ ่ฉณใใๆฌใขใใซใฎๅใๆฑใๆนใฏใใกใใฎๅๆฑ่ชฌๆๆธใซใใใใฆใใพใใ ใขใใซใฏใใใใใใฆใณใญใผใใงใใพใใ
ใฉใคใปใณในใซใคใใฆ
ใฉใคใปใณในใซใคใใฆใฏใใใจใฎใฉใคใปใณใน CreativeML Open RAIL++-M License ใซไพๅคใ้คใๅ็จๅฉ็จ็ฆๆญขใ่ฟฝๅ ใใใ ใใงใใ ไพๅคใ้คใๅ็จๅฉ็จ็ฆๆญขใ่ฟฝๅ ใใ็็ฑใฏๅตไฝๆฅญ็ใซๆชๅฝฑ้ฟใๅใผใใใญใชใใจใใๆธๅฟตใใใงใใ ใใฎๆธๅฟตใๆๆญใใใใฐใๆฌกใฎใใผใธใงใณใใๅ ใฎใฉใคใปใณในใซๆปใใๅ็จๅฉ็จๅฏ่ฝใจใใพใใ ใกใชใฟใซใๅ ใฎใฉใคใปใณในใฎๆฅๆฌ่ช่จณใฏใใกใใซใชใใพใใ ๅถๅฉไผๆฅญใซใใๆนใฏๆณๅ้จใซใใไบบใจ็ธ่ซใใฆใใ ใใใ ่ถฃๅณใงๅฉ็จใใๆนใฏใใพใๆฐใซใใชใใฆใไธ่ฌๅธธ่ญใๅฎใใฐๅคงไธๅคซใชใฏใใงใใ ใชใใใฉใคใปใณในใซใใ้ใใใใฎใขใใซใๆน้ ใใฆใใใใฎใฉใคใปใณในใๅผใ็ถใๅฟ ่ฆใใใใพใใ
ๆณๅพใๅซ็ใซใคใใฆ
ๆฌใขใใซใฏๆฅๆฌใซใฆไฝๆใใใพใใใใใใใฃใฆใๆฅๆฌใฎๆณๅพใ้ฉ็จใใใพใใ ๆฌใขใใซใฎๅญฆ็ฟใฏใ่ไฝๆจฉๆณ็ฌฌ30ๆกใฎ4ใซๅบใฅใใๅๆณใงใใใจไธปๅผตใใพใใ ใพใใๆฌใขใใซใฎ้ ๅธใซใคใใฆใฏใ่ไฝๆจฉๆณใๅๆณ175ๆกใซ็ งใใใฆใฟใฆใใ ๆญฃ็ฏใๅนๅฉ็ฏใซใ่ฉฒๅฝใใชใใจไธปๅผตใใพใใ่ฉณใใใฏๆฟๆฒผๅผ่ญทๅฃซใฎ่ฆ่งฃใๅพก่ฆงใใ ใใใ ใใ ใใใฉใคใปใณในใซใใใ้ใใๆฌใขใใซใฎ็ๆ็ฉใฏๅ็จฎๆณไปคใซๅพใฃใฆๅใๆฑใฃใฆไธใใใ
ใใใใๆฌใขใใซใ้ ๅธใใ่ก็บใๅซ็็ใซ่ฏใใชใใจใฏไฝ่ ใฏๆใฃใฆใใพใใ ใใใฏๅญฆ็ฟใใ่ไฝ็ฉใซๅฏพใใฆ่ไฝ่ ใฎ่จฑๅฏใๅพใฆใใชใใใใงใใ ใใ ใใๅญฆ็ฟใใใซใฏ่ไฝ่ ใฎ่จฑๅฏใฏๆณๅพไธๅฟ ่ฆใใชใใๆค็ดขใจใณใธใณใจๅๆงๆณๅพไธใฏๅ้กใฏใใใพใใใ ใใใใฃใฆใๆณ็ใชๅด้ขใงใฏใชใใๅซ็็ใชๅด้ขใ่ชฟๆปใใ็ฎ็ใๆฌ้ ๅธใฏๅ ผใญใฆใใใจ่ใใฆใใ ใใใ
ไปฅไธใไธ่ฌ็ใชใขใใซใซใผใใฎๆฅๆฌ่ช่จณใงใใ
ใขใใซ่ฉณ็ดฐ
้็บ่ : Robin Rombach, Patrick Esser, Alfred Increment
ใขใใซใฟใคใ: ๆกๆฃใขใใซใใผในใฎ text-to-image ็ๆใขใใซ
่จ่ช: ๆฅๆฌ่ช
ใฉใคใปใณใน: CreativeML Open RAIL++-M-NC License
ใขใใซใฎ่ชฌๆ: ใใฎใขใใซใฏใใญใณใใใซๅฟใใฆ้ฉๅใช็ปๅใ็ๆใใใใจใใงใใพใใใขใซใดใชใบใ ใฏ Latent Diffusion Model ใจ OpenCLIP-ViT/H ใงใใ
่ฃ่ถณ:
ๅ่ๆ็ฎ:
@InProceedings{Rombach_2022_CVPR, author = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn}, title = {High-Resolution Image Synthesis With Latent Diffusion Models}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2022}, pages = {10684-10695} }
ใขใใซใฎไฝฟ็จไพ
Stable Diffusion v2ใจๅใไฝฟใๆนใงใใ ใใใใใฎๆนๆณใใใใพใใใ๏ผใคใฎใใฟใผใณใๆไพใใพใใ
- Web UI
- Diffusers
Web UIใฎๅ ดๅ
ไปๅใใใฏxformersใใคใณในใใผใซใใใใจใใใใใใใพใใ ใใกใใฎๅๆฑ่ชฌๆๆธใซๅพใฃใฆไฝๆใใฆใใ ใใใ
Diffusersใฎๅ ดๅ
๐ค's Diffusers library ใไฝฟใฃใฆใใ ใใใ
ใพใใฏใไปฅไธใฎในใฏใชใใใๅฎ่กใใใฉใคใใฉใชใใใใฆใใ ใใใ
pip install --upgrade git+https://github.com/huggingface/diffusers.git transformers accelerate scipy
ๆฌกใฎในใฏใชใใใๅฎ่กใใ็ปๅใ็ๆใใฆใใ ใใใ
from diffusers import StableDiffusionPipeline, EulerAncestralDiscreteScheduler
import torch
model_id = "aipicasso/cool-japan-diffusion-2-1-2"
scheduler = EulerAncestralDiscreteScheduler.from_pretrained(model_id, subfolder="scheduler")
pipe = StableDiffusionPipeline.from_pretrained(model_id, scheduler=scheduler, torch_dtype=torch.float32)
pipe = pipe.to("cuda")
prompt = "anime, masterpiece, a portrait of a girl, good pupil, 4k, detailed"
negative_prompt="deformed, blurry, bad anatomy, bad pupil, disfigured, poorly drawn face, mutation, mutated, extra limb, ugly, poorly drawn hands, bad hands, fused fingers, messy drawing, broken legs censor, low quality, mutated hands and fingers, long body, mutation, poorly drawn, bad eyes, ui, error, missing fingers, fused fingers, one hand with more than 5 fingers, one hand with less than 5 fingers, one hand with more than 5 digit, one hand with less than 5 digit, extra digit, fewer digits, fused digit, missing digit, bad digit, liquid digit, long body, uncoordinated body, unnatural body, lowres, jpeg artifacts, 3d, cg, text, japanese kanji"
images = pipe(prompt,negative_prompt=negative_prompt, num_inference_steps=20).images
images[0].save("girl.png")
ๆณจๆ:
- xformers ใไฝฟใใจๆฉใใชใใใใใงใใ
- GPUใไฝฟใ้ใซGPUใฎใกใขใชใๅฐใชใไบบใฏ
pipe.enable_attention_slicing()ใไฝฟใฃใฆใใ ใใใ
ๆณๅฎใใใ็จ้
- ็ปๅ็ๆAIใซ้ขใใๅ ฑ้
- ๅ
ฌๅ
ฑๆพ้ใ ใใงใชใใๅถๅฉไผๆฅญใงใๅฏ่ฝ
- ็ปๅๅๆAIใซ้ขใใๆ ๅ ฑใใ็ฅใๆจฉๅฉใใฏๅตไฝๆฅญ็ใซๆชๅฝฑ้ฟใๅใผใใชใใจๅคๆญใใใใใงใใใพใใๅ ฑ้ใฎ่ช็ฑใชใฉใๅฐ้ใใพใใใ
- ๅ
ฌๅ
ฑๆพ้ใ ใใงใชใใๅถๅฉไผๆฅญใงใๅฏ่ฝ
- ใฏใผใซใธใฃใใณใฎ็ดนไป
- ไปๅฝใฎไบบใซใฏใผใซใธใฃใใณใจใฏใชใซใใ่ชฌๆใใใใจใ
- ไปๅฝใฎ็ๅญฆ็ใฏใฏใผใซใธใฃใใณใซๆนใใใฆๆฅๆฌใซๆฅใใใจใใใใใใใพใใใใใงใใฏใผใซใธใฃใใณใๆฅๆฌใงใฏใใฏใผใซใงใชใใใจใใใฆใใใใจใซใใฃใใใใใใใจใใจใฆใๅคใใจAlfred Incrementใฏๆใใฆใใใพใใไปๅฝใฎไบบใๆงใใ่ชๅฝใฎๆๅใใใฃใจ่ชใใซๆใฃใฆใใ ใใใ
- ไปๅฝใฎไบบใซใฏใผใซใธใฃใใณใจใฏใชใซใใ่ชฌๆใใใใจใ
- ็ ็ฉถ้็บ
- Discordไธใงใฎใขใใซใฎๅฉ็จ
- ใใญใณใใใจใณใธใใขใชใณใฐ
- ใใกใคใณใใฅใผใใณใฐ๏ผ่ฟฝๅ ๅญฆ็ฟใจใ๏ผ
- DreamBooth ใชใฉ
- ไปใฎใขใใซใจใฎใใผใธ
- Latent Diffusion Modelใจใฏใผใซใธใฃใใณใจใฎ็ธๆง
- ๆฌใขใใซใฎๆง่ฝใFIDใชใฉใง่ชฟในใใใจ
- ๆฌใขใใซใStable Diffusionไปฅๅคใฎใขใใซใจใฏ็ฌ็ซใงใใใใจใใใงใใฏใตใ ใใใใทใฅ้ขๆฐใชใฉใง่ชฟในใใใจ
- Discordไธใงใฎใขใใซใฎๅฉ็จ
- ๆ่ฒ
- ็พๅคง็ใๅฐ้ๅญฆๆ ก็ใฎๅๆฅญๅถไฝ
- ๅคงๅญฆ็ใฎๅๆฅญ่ซๆใ่ชฒ้กๅถไฝ
- ๅ ็ใ็ปๅ็ๆAIใฎ็พ็ถใไผใใใใจ
- ่ชๅทฑ่กจ็พ
- SNSไธใง่ชๅใฎๆๆ ใๆ่ใ่กจ็พใใใใจ
- Hugging Face ใฎ Community ใซใใใฆใใ็จ้
- ๆฅๆฌ่ชใ่ฑ่ชใง่ณชๅใใฆใใ ใใ
ๆณๅฎใใใชใ็จ้
- ็ฉไบใไบๅฎใจใใฆ่กจ็พใใใใใชใใจ
- ๅ็ๅใใใฆใใYouTubeใชใฉใฎใณใณใใณใใธใฎไฝฟ็จ
- ๅ็จใฎใตใผใในใจใใฆ็ดๆฅๆไพใใใใจ
- ๅ ็ใๅฐใใใใใใชใใจ
- ใใฎไปใๅตไฝๆฅญ็ใซๆชๅฝฑ้ฟใๅใผใใใจ
ไฝฟ็จใใฆใฏใใใชใ็จ้ใๆชๆใฎใใ็จ้
- ใใธใฟใซ่ดไฝ (Digital Forgery) ใฏๅ ฌ้ใใชใใงใใ ใใ๏ผ่ไฝๆจฉๆณใซ้ๅใใใใใ๏ผ
- ไปไบบใฎไฝๅใ็กๆญใงImage-to-Imageใใชใใงใใ ใใ๏ผ่ไฝๆจฉๆณใซ้ๅใใใใใ๏ผ
- ใใใใค็ฉใ้ ๅธใใชใใงใใ ใใ (ๅๆณ175ๆกใซ้ๅใใใใใ๏ผ
- ใใใใๆฅญ็ใฎใใใผใๅฎใใชใใใใชใใจ
- ไบๅฎใซๅบใฅใใชใใใจใไบๅฎใฎใใใซ่ชใใชใใใใซใใฆใใ ใใ๏ผๅจๅๆฅญๅๅฆจๅฎณ็ฝชใ้ฉ็จใใใใใใ๏ผ
- ใใงใคใฏใใฅใผใน
ใขใใซใฎ้็ใใใคใขใน
ใขใใซใฎ้็
- ใใใใใฃใฆใใชใ
ใใคใขใน
Stable Diffusionใจๅใใใคใขในใๆใใฃใฆใใพใใ ๆฐใใคใใฆใใ ใใใ
ๅญฆ็ฟ
ๅญฆ็ฟใใผใฟ
ๆฌกใฎใใผใฟใใขใใซใไธปใซไฝฟใฃใฆStable Diffusionใใใกใคใณใใฅใผใใณใฐใใฆใใพใใ
- VAEใซใคใใฆ
- DanbooruใDanbooru datasetใ้คใใๆฅๆฌใฎๅฝๅ ๆณใ้ตๅฎใใใใผใฟ: 65ไธ็จฎ้ก ๏ผใใผใฟๆกๅผตใซใใ็ก้ๆไฝๆ๏ผ
- U-Netใซใคใใฆ
- DanbooruใDanbooru datasetใ้คใใๆฅๆฌใฎๅฝๅ ๆณใ้ตๅฎใใใใผใฟ: 200ไธใใข
- ใใผใธใใใขใใซ: 3ใค
ๅญฆ็ฟใใญใปใน
Stable DiffusionใฎVAEใจU-Netใใใกใคใณใใฅใผใใณใฐใใพใใใ
- ใใผใใฆใงใข: A6000
- ใชใใใฃใใคใถใผ: AdamW
- Gradient Accumulations: 1
- ใใใใตใคใบ: 1
่ฉไพก็ตๆ
็ฐๅขใธใฎๅฝฑ้ฟ
ใปใจใใฉใใใพใใใ
- ใใผใใฆใงใขใฟใคใ: A6000
- ไฝฟ็จๆ้๏ผๅไฝใฏๆ้๏ผ: 200
- ใฏใฉใฆใไบๆฅญ่ : ใชใ
- ๅญฆ็ฟใใๅ ดๆ: ๆฅๆฌ
- ใซใผใใณๆๅบ้: ใใใชใซใชใ
ๅ่ๆ็ฎ
@InProceedings{Rombach_2022_CVPR,
author = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
title = {High-Resolution Image Synthesis With Latent Diffusion Models},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2022},
pages = {10684-10695}
}
*ใใฎใขใใซใซใผใใฏ Stable Diffusion v2 ใซๅบใฅใใฆใAlfred Incrementใใใใพใใใ
- Downloads last month
- 52
