AbstractPhil commited on
Commit
5a5d4e0
·
verified ·
1 Parent(s): 1909c1e

geovocab v1: schnell_full_1_5e-5 — 1 epoch 50k synthetic-characters

Browse files
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: sd15-flow-trainer
4
+ tags:
5
+ - geometric-deep-learning
6
+ - stable-diffusion
7
+ - ksimplex
8
+ - pentachoron
9
+ - flow-matching
10
+ - cross-attention-prior
11
+ base_model: sd-legacy/stable-diffusion-v1-5
12
+ pipeline_tag: text-to-image
13
+ ---
14
+
15
+ # KSimplex Geometric Attention Prior
16
+
17
+ Geometric cross-attention prior for SD1.5 using pentachoron (4-simplex) structures.
18
+
19
+ ## Architecture
20
+
21
+ | Component | Params |
22
+ |-----------|--------|
23
+ | SD1.5 UNet (frozen) | 859,520,964 |
24
+ | **Geo prior (trained)** | **4,845,725** |
25
+ | **Geo conditioner (trained)** | **1,613,847** |
26
+
27
+
28
+ ## Simplex Configuration
29
+
30
+ | Parameter | Value |
31
+ |-----------|-------|
32
+ | k (simplex dim) | 4 |
33
+ | Embedding dim | 32 |
34
+ | Feature dim | 768 |
35
+ | Stacked layers | 4 |
36
+ | Attention heads | 8 |
37
+ | Base deformation | 0.25 |
38
+ | Residual blend | learnable |
39
+ | Timestep conditioned | True |
40
+
41
+ ## GeoVocab Conditioning
42
+
43
+ | Parameter | Value |
44
+ |-----------|-------|
45
+ | Gate dim | 17 |
46
+ | Patch feat dim | 256 |
47
+ | Num patches | 64 |
48
+ | Cross-attention | enabled |
49
+ | Cross-attn heads | 8 |
50
+ | Blend mode | learnable |
51
+
52
+
53
+ ## Usage
54
+
55
+ ```python
56
+ from sd15_trainer_geo.pipeline import load_pipeline
57
+
58
+ pipe = load_pipeline(geo_repo_id="AbstractPhil/sd15-geovocab-lora-prototype")
59
+ ```
60
+
61
+ ## Training Info
62
+
63
+ - **dataset**: AbstractPhil/synthetic-characters (schnell_full_1_512)
64
+ - **subdir**: schnell_full_1_5e-5
65
+ - **samples**: 50000
66
+ - **epochs**: 1
67
+ - **steps**: 8333
68
+ - **shift**: 2.5
69
+ - **base_lr**: 5e-05
70
+ - **min_snr_gamma**: 5.0
71
+ - **cfg_dropout**: 0.1
72
+ - **batch_size**: 6
73
+ - **geo_loss_weight**: 0.01
74
+ - **geovocab_lr_mult**: 2.0
75
+ - **clip_vae**: AbstractPhil/geovae-proto/clip_vae/best_model.pt
76
+ - **patch_maker**: AbstractPhil/geovocab-patch-maker
77
+ - **loss_final**: 0.32467847257852556
78
+
79
+ ## License
80
+
81
+ MIT — [AbstractPhil](https://huggingface.co/AbstractPhil)
geo_conditioner.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5378fe04ae1d7fa45b417da8ecdc414f0dd61f483f851a51064a938df89b1e47
3
+ size 6457596
geo_prior.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6077921ede4e807aa07698c82217e0ff1bbb9b9a518d939a6c3c533e6d4a518
3
+ size 19391076
geovocab_config.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "gate_dim": 17,
3
+ "patch_feat_dim": 256,
4
+ "num_patches": 64,
5
+ "clip_vae_dim": 768,
6
+ "clip_vae_bottleneck": 256,
7
+ "deform_hidden": 128,
8
+ "deform_num_layers": 4,
9
+ "cross_attn_enabled": true,
10
+ "cross_attn_heads": 8,
11
+ "cross_attn_dim": 768,
12
+ "geo_blend_mode": "learnable",
13
+ "geo_blend_init": 0.0
14
+ }
simplex_config.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "k": 4,
3
+ "edim": 32,
4
+ "feat_dim": 768,
5
+ "num_layers": 4,
6
+ "base_deformation": 0.25,
7
+ "learnable_deformation": true,
8
+ "timestep_conditioned": true,
9
+ "num_heads": 8,
10
+ "dropout": 0.0,
11
+ "cm_loss_weight": 0.01,
12
+ "vol_consistency_weight": 0.005,
13
+ "residual_blend": "learnable",
14
+ "initial_blend": 0.0,
15
+ "_base_repo": "sd-legacy/stable-diffusion-v1-5"
16
+ }