yzhdvbQ commited on
Commit
1fda195
·
verified ·
1 Parent(s): 88551db

Upload LoRA model

Browse files
README.md ADDED
@@ -0,0 +1,108 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: openrail++
3
+ base_model: runwayml/stable-diffusion-v1-5
4
+ tags:
5
+ - stable-diffusion
6
+ - stable-diffusion-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - lora
10
+ - cartoon
11
+ - illustration
12
+ datasets:
13
+ - juliaturc/captioned-cartoons
14
+ ---
15
+
16
+ # cartoon-lora-aggressive-high
17
+
18
+ Fine-tuned LoRA model for generating colorful cartoon-style illustrations.
19
+
20
+ ## Model Details
21
+
22
+ - **Base Model:** runwayml/stable-diffusion-v1-5
23
+ - **Dataset:** juliaturc/captioned-cartoons
24
+ - **Training Framework:** Diffusers + LoRA
25
+ - **Task:** Text-to-Image Generation (Cartoon Style)
26
+
27
+ ## Training Configuration
28
+
29
+ ```json
30
+ {
31
+ "learning_rate": 0.00015,
32
+ "max_train_steps": 1500,
33
+ "rank": 48,
34
+ "lr_scheduler": "linear"
35
+ }
36
+ ```
37
+
38
+ ## Usage
39
+
40
+ ### Install dependencies
41
+
42
+ ```bash
43
+ pip install diffusers transformers torch
44
+ ```
45
+
46
+ ### Load and generate
47
+
48
+ ```python
49
+ import torch
50
+ from diffusers import StableDiffusionPipeline
51
+
52
+ # Load base model
53
+ model_id = "runwayml/stable-diffusion-v1-5"
54
+ pipe = StableDiffusionPipeline.from_pretrained(
55
+ model_id,
56
+ torch_dtype=torch.float16
57
+ )
58
+ pipe = pipe.to("cuda")
59
+
60
+ # Load LoRA weights
61
+ pipe.load_lora_weights("yzhdvbQ/cartoon-lora-aggressive-high")
62
+
63
+ # Generate image
64
+ prompt = "Two people walking in a park with dogs, colorful cartoon style, woman in pink shirt, man in green vest, fall scenery"
65
+ image = pipe(prompt, num_inference_steps=30).images[0]
66
+ image.save("output.png")
67
+ ```
68
+
69
+ ## Example Prompts
70
+
71
+ 1. `Two people walking in a park with dogs, colorful cartoon style, woman in pink shirt, man in green vest, fall scenery`
72
+ 2. `A young woman sitting on a bench reading a book in a city park, wearing a yellow sweater and jeans, autumn trees, colorful cartoon illustration`
73
+ 3. `A happy family having a picnic in a meadow, parents and two children, red checkered blanket, colorful cartoon style`
74
+
75
+
76
+ ## Training Details
77
+
78
+ This model was fine-tuned on a cartoon illustration dataset with detailed captions describing:
79
+ - Characters and their appearance
80
+ - Clothing and accessories
81
+ - Actions and poses
82
+ - Environment and scenery
83
+ - Composition and style
84
+
85
+ The training used Low-Rank Adaptation (LoRA) to efficiently fine-tune Stable Diffusion while keeping the base model frozen.
86
+
87
+ ## Limitations
88
+
89
+ - Works best with detailed, descriptive prompts
90
+ - Optimized for cartoon/illustration style
91
+ - May not perform well on photorealistic requests
92
+ - Based on Stable Diffusion 1.5 architecture
93
+
94
+ ## Citation
95
+
96
+ ```bibtex
97
+ @misc{cartoon-lora-ai54,
98
+ author = {AI54 Course},
99
+ title = {Cartoon LoRA for Stable Diffusion},
100
+ year = {2026},
101
+ publisher = {HuggingFace},
102
+ url = {https://huggingface.co/yzhdvbQ/cartoon-lora-aggressive-high}
103
+ }
104
+ ```
105
+
106
+ ## License
107
+
108
+ This model inherits the license from Stable Diffusion (OpenRAIL++)
checkpoint-1125/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4478e5427043da2c1a2a7b8cb92ac66be992ce5ab14a74a742582f74feb4714
3
+ size 1757430584
checkpoint-1125/optimizer.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:21a31742ed9a1ada4b437941bf4c9e24776eed95524fe36fa9975adbc4d58dd0
3
+ size 19720037
checkpoint-1125/pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:09e45d8fc02ca94fe5bfb987d57741d86955e5d630c71e09c4739f76649a8d7f
3
+ size 38303072
checkpoint-1125/random_states_0.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4a222d39384e2b6785c2dbaffc562a67d5e13edee36a192261094034db24442
3
+ size 14821
checkpoint-1125/scaler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:700693d928978825d667fc68946ad86d86700bc0ea5eda3489a11638fcc8748f
3
+ size 1383
checkpoint-1125/scheduler.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:31269057f4b74aa6b112f395c42bd06e7a43eb12b8b6c99528a3bd66651509d3
3
+ size 1401
checkpoint-1500/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5dd6e3e1fc7580fe6d63a41c85f9f467f0c5ab58f81fc86b98f4ecfac8266d62
3
+ size 1757430584
checkpoint-1500/optimizer.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7cf11bb512e66f372496807a9bac1fec0cff054f21f15fc0881445e18353fcef
3
+ size 19720037
checkpoint-1500/pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a7cc13964c7e07431295655f23083f38e27f3a881cd852618cfd865257a8d47
3
+ size 38303072
checkpoint-1500/random_states_0.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1cd92147c6fc7756cd7f6b1edae9be2aa2f2360c5cd75a1543693a71fda7b0aa
3
+ size 14821
checkpoint-1500/scaler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ca372268f4fa9335030c0cb7aedb6cdba75f457da50e7a4034abb1a2d0843689
3
+ size 1383
checkpoint-1500/scheduler.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:25f1241211f9d3d1f7d833c292a7657174f09344cc8981769d8ed11b85de0f5f
3
+ size 1401
pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a7cc13964c7e07431295655f23083f38e27f3a881cd852618cfd865257a8d47
3
+ size 38303072