File size: 4,145 Bytes
924fe50
 
 
 
 
 
5ce76c9
 
924fe50
d2f2fef
 
 
 
 
 
 
924fe50
 
 
 
d2f2fef
 
 
5ce76c9
924fe50
e859c0e
 
 
7906492
924fe50
 
 
fb2d4ba
 
 
 
 
32f51c4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
924fe50
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e859c0e
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
---
tags:
- text-to-image
- lora
- diffusers
- template:diffusion-lora
- yarn
- art
widget:
- src: images/1.jpg
  text: >-
    [photo content], transformed into a crochet plush doll, with visible yarn stitches, button eyes, and cozy handmade charm.
  prompt: >
    [photo content], transformed into a crochet plush doll, with visible yarn stitches, button eyes, and cozy handmade charm.
  output:
    url: images/2.webp
base_model: black-forest-labs/FLUX.1-Kontext-dev
instance_prompt: >-
  [photo content], transformed into a crochet plush doll, with visible yarn
  stitches, button eyes, and cozy handmade charm.
license: other
license_name: flux-1-dev-non-commercial-license
license_link: LICENSE.md
pipeline_tag: image-to-image
---

![1.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/E6fxB7-W_Bp3Vek3qblUL.png)

# **Yarn-Photo-i2i [Image-to-Image]**

<Gallery />

Yarn-Photo-i2i is an adapter for black-forest-lab's FLUX.1-Kontext-dev, designed for converting images into yarn-stitched artwork while preserving the original characteristics of the subject. The model was trained on 28 image pairs (14 start images, 14 end images). Synthetic result nodes were generated using NanoBanana from Google and SeedDream 4 (dataset for result sets), and labeled with DeepCaption-VLA-7B. The adapter is triggered with the following prompt:

> [!note]
[photo content], transformed into a crochet plush doll, with visible yarn stitches, button eyes, and cozy handmade charm.

---

## Sample Inference

| ex1 | ex2 |
|------|-------|
| ![Left Screenshot](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/3k76am1XiJJl4POgDI0cv.png) | ![Right Screenshot](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/5vl4iufT4NRWVNdJINtDL.png) |

---

## Parameter Settings

| Setting                  | Value                    |
| ------------------------ | ------------------------ |
| Module Type              | Adapter                     |
| Base Model               | FLUX.1 Kontext Dev - fp8 |
| Trigger Words            | [photo content], transformed into a crochet plush doll, with visible yarn stitches, button eyes, and cozy handmade charm. |
| Image Processing Repeats | 50                       |
| Epochs                   | 22                       |
| Save Every N Epochs      | 1                        |

    Labeling: DeepCaption-VLA-7B(natural language & English)
    
    Total Images Used for Training : 28 Image Pairs (14 Start, 14 End)

    Synthetic Result Node generated by NanoBanana from Google (Image Result Sets Dataset)
    

## Training Parameters

| Setting                     | Value     |
| --------------------------- | --------- |
| Seed                        | -         |
| Clip Skip                   | -         |
| Text Encoder LR             | 0.00001   |
| UNet LR                     | 0.00005   |
| LR Scheduler                | constant  |
| Optimizer                   | AdamW8bit |
| Network Dimension           | 64        |
| Network Alpha               | 32        |
| Gradient Accumulation Steps | -         |

## Label Parameters

| Setting         | Value |
| --------------- | ----- |
| Shuffle Caption | -     |
| Keep N Tokens   | -     |

## Advanced Parameters

| Setting                   | Value |
| ------------------------- | ----- |
| Noise Offset              | 0.03  |
| Multires Noise Discount   | 0.1   |
| Multires Noise Iterations | 10    |
| Conv Dimension            | -     |
| Conv Alpha                | -     |
| Batch Size                | -     |
| Steps   | 2900  |
| Sampler | euler |

---

## Trigger words

You should use `[photo content]` to trigger the image generation.

You should use `transformed into a crochet plush doll` to trigger the image generation.

You should use `with visible yarn stitches` to trigger the image generation.

You should use `button eyes` to trigger the image generation.

You should use `and cozy handmade charm.` to trigger the image generation.


## Download model


[Download](/prithivMLmods/Yarn-Photo-i2i/tree/main) them in the Files & versions tab.