alfredplpl
/

flux.1-dev-modern-anime-fp8-diffusers

Diffusers

Safetensors

Model card Files Files and versions

xet

Community

alfredplpl commited on Aug 17, 2024

Commit

63052cc

verified ·

1 Parent(s): 3e4d64a

Update README.md

Browse files

Files changed (1) hide show

README.md +14 -4

README.md CHANGED Viewed

@@ -10,7 +10,9 @@ license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICE
 ![eyecatch](eyecatch.jpg)
 FLUX.1 dev Modern Anime FP8 With Quanto is an anime model with 8-bit float by Quanto library.
-We can load this anime model < 15GB VRAM with RTX 4090, maybe.
 # Usage
 - diffusers
@@ -31,6 +33,8 @@ from optimum.quanto import QuantizedDiffusersModel, QuantizedTransformersModel
 from transformers import T5EncoderModel
 from huggingface_hub import snapshot_download
 snapshot_download(repo_id="alfredplpl/flux.1-dev-modern-anime-fp8",local_dir="./anime_fp8")
 class QuantizedT5EncoderModel(QuantizedTransformersModel):
@@ -45,10 +49,16 @@ pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev",
                                     text_encoder_2=None,
                                     torch_dtype=torch.bfloat16)
-pipe.transformer=QuantizedFlux2DModel.from_pretrained("./anime_fp8/transformer")
-pipe.text_encoder_2=QuantizedT5EncoderModel.from_pretrained("./anime_fp8/text_encoder_2")
 pipe.vae=pipe.vae.to(torch.float32)
-pipe.enable_model_cpu_offload()
 prompt = "modern anime style, A close-up portrait of a young girl with green hair. Her hair is vibrant and shoulder-length, framing her face softly. She has large, expressive eyes that are slightly tilted upward, with a gentle and calm expression. Her facial features are delicate, with a small nose and soft lips. The background is simple, focusing attention on her face, with soft lighting that highlights her features. The overall style of the illustration is warm and inviting, with a soft color palette and a slightly dreamy atmosphere."
 image = pipe(

 ![eyecatch](eyecatch.jpg)
 FLUX.1 dev Modern Anime FP8 With Quanto is an anime model with 8-bit float by Quanto library.
+We can load this anime model < 15GB VRAM if enable_model_cpu_offload is True.
+otherwise, we can load this anime model < 20GB VRAM.
+We can run this model on RTX 4090 or NVIDIA L4.
 # Usage
 - diffusers
 from transformers import T5EncoderModel
 from huggingface_hub import snapshot_download
+enable_model_cpu_offload=True
 snapshot_download(repo_id="alfredplpl/flux.1-dev-modern-anime-fp8",local_dir="./anime_fp8")
 class QuantizedT5EncoderModel(QuantizedTransformersModel):
                                     text_encoder_2=None,
                                     torch_dtype=torch.bfloat16)
+pipe.transformer=QuantizedFlux2DModel.from_pretrained("./anime_fp8/transformer")._wrapped
+pipe.text_encoder_2=QuantizedT5EncoderModel.from_pretrained("./anime_fp8/text_encoder_2")._wrapped
 pipe.vae=pipe.vae.to(torch.float32)
+# Option
+if(enable_model_cpu_offload):
+    pipe.enable_model_cpu_offload()
+else:
+    pipe.text_encoder_2=pipe.text_encoder_2.to("cuda:1")
+    pipe.transformer=pipe.transformer.to("cuda:1")
+    pipe=pipe.to("cuda:1")
 prompt = "modern anime style, A close-up portrait of a young girl with green hair. Her hair is vibrant and shoulder-length, framing her face softly. She has large, expressive eyes that are slightly tilted upward, with a gentle and calm expression. Her facial features are delicate, with a small nose and soft lips. The background is simple, focusing attention on her face, with soft lighting that highlights her features. The overall style of the illustration is warm and inviting, with a soft color palette and a slightly dreamy atmosphere."
 image = pipe(