yzhdvbQ commited on
Commit
8901dd8
·
verified ·
1 Parent(s): 86f279e

Upload LoRA model

Browse files
README.md ADDED
@@ -0,0 +1,108 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: openrail++
3
+ base_model: runwayml/stable-diffusion-v1-5
4
+ tags:
5
+ - stable-diffusion
6
+ - stable-diffusion-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - lora
10
+ - cartoon
11
+ - illustration
12
+ datasets:
13
+ - juliaturc/captioned-cartoons
14
+ ---
15
+
16
+ # cartoon-lora-aggressive-medium
17
+
18
+ Fine-tuned LoRA model for generating colorful cartoon-style illustrations.
19
+
20
+ ## Model Details
21
+
22
+ - **Base Model:** runwayml/stable-diffusion-v1-5
23
+ - **Dataset:** juliaturc/captioned-cartoons
24
+ - **Training Framework:** Diffusers + LoRA
25
+ - **Task:** Text-to-Image Generation (Cartoon Style)
26
+
27
+ ## Training Configuration
28
+
29
+ ```json
30
+ {
31
+ "learning_rate": 0.0001,
32
+ "max_train_steps": 1500,
33
+ "rank": 48,
34
+ "lr_scheduler": "cosine"
35
+ }
36
+ ```
37
+
38
+ ## Usage
39
+
40
+ ### Install dependencies
41
+
42
+ ```bash
43
+ pip install diffusers transformers torch
44
+ ```
45
+
46
+ ### Load and generate
47
+
48
+ ```python
49
+ import torch
50
+ from diffusers import StableDiffusionPipeline
51
+
52
+ # Load base model
53
+ model_id = "runwayml/stable-diffusion-v1-5"
54
+ pipe = StableDiffusionPipeline.from_pretrained(
55
+ model_id,
56
+ torch_dtype=torch.float16
57
+ )
58
+ pipe = pipe.to("cuda")
59
+
60
+ # Load LoRA weights
61
+ pipe.load_lora_weights("yzhdvbQ/cartoon-lora-aggressive-medium")
62
+
63
+ # Generate image
64
+ prompt = "Two people walking in a park with dogs, colorful cartoon style, woman in pink shirt, man in green vest, fall scenery"
65
+ image = pipe(prompt, num_inference_steps=30).images[0]
66
+ image.save("output.png")
67
+ ```
68
+
69
+ ## Example Prompts
70
+
71
+ 1. `Two people walking in a park with dogs, colorful cartoon style, woman in pink shirt, man in green vest, fall scenery`
72
+ 2. `A young woman sitting on a bench reading a book in a city park, wearing a yellow sweater and jeans, autumn trees, colorful cartoon illustration`
73
+ 3. `A happy family having a picnic in a meadow, parents and two children, red checkered blanket, colorful cartoon style`
74
+
75
+
76
+ ## Training Details
77
+
78
+ This model was fine-tuned on a cartoon illustration dataset with detailed captions describing:
79
+ - Characters and their appearance
80
+ - Clothing and accessories
81
+ - Actions and poses
82
+ - Environment and scenery
83
+ - Composition and style
84
+
85
+ The training used Low-Rank Adaptation (LoRA) to efficiently fine-tune Stable Diffusion while keeping the base model frozen.
86
+
87
+ ## Limitations
88
+
89
+ - Works best with detailed, descriptive prompts
90
+ - Optimized for cartoon/illustration style
91
+ - May not perform well on photorealistic requests
92
+ - Based on Stable Diffusion 1.5 architecture
93
+
94
+ ## Citation
95
+
96
+ ```bibtex
97
+ @misc{cartoon-lora-ai54,
98
+ author = {AI54 Course},
99
+ title = {Cartoon LoRA for Stable Diffusion},
100
+ year = {2026},
101
+ publisher = {HuggingFace},
102
+ url = {https://huggingface.co/yzhdvbQ/cartoon-lora-aggressive-medium}
103
+ }
104
+ ```
105
+
106
+ ## License
107
+
108
+ This model inherits the license from Stable Diffusion (OpenRAIL++)
checkpoint-1125/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c74599da6822ae840d2a3bbb275f9adb5a64c0ef8992f44a08664501ace1fe6
3
+ size 1757430584
checkpoint-1125/optimizer.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:43cd532895420f4c46705dccf97ae6a3e066fdec6b11d8b2ecca67e914f83a61
3
+ size 19720037
checkpoint-1125/pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a5b5140e57ba6f973eb51b7e4ad95ff5951ba9b2d7f4880838f7826d1c074b7a
3
+ size 38303072
checkpoint-1125/random_states_0.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4a222d39384e2b6785c2dbaffc562a67d5e13edee36a192261094034db24442
3
+ size 14821
checkpoint-1125/scaler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:700693d928978825d667fc68946ad86d86700bc0ea5eda3489a11638fcc8748f
3
+ size 1383
checkpoint-1125/scheduler.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:300f83cc8e26aa90f4633ffd51acf702e0cac49ff264a7642168f40eebc5da66
3
+ size 1401
checkpoint-1500/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bcd0f7cbcf1c65c4126fa3444307aabb2b3aba03f7b3ce9bdf6e8e3dfa58330e
3
+ size 1757430584
checkpoint-1500/optimizer.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f8e3f854c1b6ed08011561ee7360096a1221d4dd450cd790335631d5eb6dcfeb
3
+ size 19720037
checkpoint-1500/pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e1be82476aba093f2598ba91cbc6f1102f12f874e08fc57b446279900504761
3
+ size 38303072
checkpoint-1500/random_states_0.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1cd92147c6fc7756cd7f6b1edae9be2aa2f2360c5cd75a1543693a71fda7b0aa
3
+ size 14821
checkpoint-1500/scaler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ca372268f4fa9335030c0cb7aedb6cdba75f457da50e7a4034abb1a2d0843689
3
+ size 1383
checkpoint-1500/scheduler.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91bd43bc92cb54476a18846bc0624215bebb382070577d998ca3c3869138c573
3
+ size 1401
pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e1be82476aba093f2598ba91cbc6f1102f12f874e08fc57b446279900504761
3
+ size 38303072