File size: 9,012 Bytes
094c5fd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
======================================================================
FLOW MATCHING + CONSTELLATION RELAY REGULATOR
  Dataset: CIFAR-10
  Base channels: 64
  Relay: True
  Flow matching: ODE (conditional)
  Sampler: Euler, 50 steps
  Device: cuda
======================================================================
  Train: 50,000 images
  Total params: 6,746,403
  Relay params: 76,384 (1.1%)
  Relay modules: 2

======================================================================
TRAINING β€” 50 epochs
======================================================================
E  1/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:11<00:00, 34.46b/s, loss=0.3728, lr=3.0e-04]
  E  1: loss=0.3695 lr=3.0e-04 (11s) β˜…
  β†’ Saved samples/epoch_001.png
E  2/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.22b/s, loss=0.2382, lr=3.0e-04]
  E  2: loss=0.2379 lr=3.0e-04 (11s) β˜…
E  3/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.74b/s, loss=0.2233, lr=3.0e-04]
  E  3: loss=0.2230 lr=3.0e-04 (11s) β˜…
E  4/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.82b/s, loss=0.2147, lr=3.0e-04]
  E  4: loss=0.2145 lr=3.0e-04 (11s) β˜…
E  5/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 37.00b/s, loss=0.2094, lr=2.9e-04]
  E  5: loss=0.2093 lr=2.9e-04 (11s) β˜…
  β†’ Saved samples/epoch_005.png
E  6/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.99b/s, loss=0.2050, lr=2.9e-04]
  E  6: loss=0.2049 lr=2.9e-04 (11s) β˜…
E  7/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.87b/s, loss=0.2010, lr=2.9e-04]
  E  7: loss=0.2009 lr=2.9e-04 (11s) β˜…
E  8/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1984, lr=2.8e-04]
  E  8: loss=0.1983 lr=2.8e-04 (11s) β˜…
E  9/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.68b/s, loss=0.1966, lr=2.8e-04]
  E  9: loss=0.1967 lr=2.8e-04 (11s) β˜…
E 10/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.75b/s, loss=0.1950, lr=2.7e-04]
  E 10: loss=0.1951 lr=2.7e-04 (11s) β˜…
  β†’ Saved samples/epoch_010.png
  Relay diagnostics:
    mid_block1.relay: drift=0.0382 rad (2.2Β°) gate=0.0519
    mid_block2.relay: drift=0.0548 rad (3.1Β°) gate=0.0548
E 11/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.42b/s, loss=0.1947, lr=2.7e-04]
  E 11: loss=0.1946 lr=2.7e-04 (11s) β˜…
E 12/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.88b/s, loss=0.1923, lr=2.6e-04]
  E 12: loss=0.1923 lr=2.6e-04 (11s) β˜…
E 13/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1910, lr=2.5e-04]
  E 13: loss=0.1909 lr=2.5e-04 (11s) β˜…
E 14/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.50b/s, loss=0.1907, lr=2.5e-04]
  E 14: loss=0.1907 lr=2.5e-04 (11s) β˜…
E 15/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.65b/s, loss=0.1901, lr=2.4e-04]
  E 15: loss=0.1901 lr=2.4e-04 (11s) β˜…
  β†’ Saved samples/epoch_015.png
E 16/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.56b/s, loss=0.1894, lr=2.3e-04]
  E 16: loss=0.1893 lr=2.3e-04 (11s) β˜…
E 17/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.26b/s, loss=0.1881, lr=2.2e-04]
  E 17: loss=0.1880 lr=2.2e-04 (11s) β˜…
E 18/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1883, lr=2.1e-04]
  E 18: loss=0.1883 lr=2.1e-04 (11s)
E 19/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.58b/s, loss=0.1875, lr=2.1e-04]
  E 19: loss=0.1874 lr=2.1e-04 (11s) β˜…
E 20/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.81b/s, loss=0.1869, lr=2.0e-04]
  E 20: loss=0.1870 lr=2.0e-04 (11s) β˜…
  β†’ Saved samples/epoch_020.png
  Relay diagnostics:
    mid_block1.relay: drift=0.0703 rad (4.0Β°) gate=0.0561
    mid_block2.relay: drift=0.0938 rad (5.4Β°) gate=0.0618
E 21/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1853, lr=1.9e-04]
  E 21: loss=0.1853 lr=1.9e-04 (11s) β˜…
E 22/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.75b/s, loss=0.1864, lr=1.8e-04]
  E 22: loss=0.1864 lr=1.8e-04 (11s)
E 23/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1851, lr=1.7e-04]
  E 23: loss=0.1851 lr=1.7e-04 (11s) β˜…
E 24/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.72b/s, loss=0.1849, lr=1.6e-04]
  E 24: loss=0.1849 lr=1.6e-04 (11s) β˜…
E 25/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1850, lr=1.5e-04]
  E 25: loss=0.1849 lr=1.5e-04 (11s) β˜…
  β†’ Saved samples/epoch_025.png
E 26/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1851, lr=1.4e-04]
  E 26: loss=0.1848 lr=1.4e-04 (11s) β˜…
E 27/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1835, lr=1.3e-04]
  E 27: loss=0.1833 lr=1.3e-04 (11s) β˜…
E 28/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1840, lr=1.2e-04]
  E 28: loss=0.1839 lr=1.2e-04 (11s)
E 29/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.88b/s, loss=0.1837, lr=1.1e-04]
  E 29: loss=0.1837 lr=1.1e-04 (11s)
E 30/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.47b/s, loss=0.1823, lr=1.0e-04]
  E 30: loss=0.1822 lr=1.0e-04 (11s) β˜…
  β†’ Saved samples/epoch_030.png
  Relay diagnostics:
    mid_block1.relay: drift=0.0918 rad (5.3Β°) gate=0.0586
    mid_block2.relay: drift=0.1132 rad (6.5Β°) gate=0.0649
E 31/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.72b/s, loss=0.1823, lr=9.6e-05]
  E 31: loss=0.1823 lr=9.5e-05 (11s)
E 32/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.64b/s, loss=0.1823, lr=8.7e-05]
  E 32: loss=0.1823 lr=8.7e-05 (11s)
E 33/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1816, lr=7.9e-05]
  E 33: loss=0.1816 lr=7.8e-05 (11s) β˜…
E 34/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1809, lr=7.1e-05]
  E 34: loss=0.1809 lr=7.0e-05 (11s) β˜…
E 35/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.80b/s, loss=0.1810, lr=6.3e-05]
  E 35: loss=0.1810 lr=6.3e-05 (11s)
  β†’ Saved samples/epoch_035.png
E 36/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.77b/s, loss=0.1819, lr=5.5e-05]
  E 36: loss=0.1819 lr=5.5e-05 (11s)
E 37/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.79b/s, loss=0.1812, lr=4.8e-05]
  E 37: loss=0.1813 lr=4.8e-05 (11s)
E 38/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1808, lr=4.2e-05]
  E 38: loss=0.1808 lr=4.2e-05 (11s) β˜…
E 39/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1815, lr=3.5e-05]
  E 39: loss=0.1814 lr=3.5e-05 (11s)
E 40/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.71b/s, loss=0.1800, lr=3.0e-05]
  E 40: loss=0.1800 lr=3.0e-05 (11s) β˜…
  β†’ Saved samples/epoch_040.png
  Relay diagnostics:
    mid_block1.relay: drift=0.0964 rad (5.5Β°) gate=0.0593
    mid_block2.relay: drift=0.1163 rad (6.7Β°) gate=0.0657
E 41/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.53b/s, loss=0.1803, lr=2.4e-05]
  E 41: loss=0.1803 lr=2.4e-05 (11s)
E 42/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.30b/s, loss=0.1801, lr=2.0e-05]
  E 42: loss=0.1801 lr=1.9e-05 (11s)
E 43/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.80b/s, loss=0.1800, lr=1.5e-05]
  E 43: loss=0.1799 lr=1.5e-05 (11s) β˜…
E 44/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.84b/s, loss=0.1801, lr=1.2e-05]
  E 44: loss=0.1799 lr=1.1e-05 (11s)
E 45/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.48b/s, loss=0.1800, lr=8.4e-06]
  E 45: loss=0.1799 lr=8.3e-06 (11s) β˜…
  β†’ Saved samples/epoch_045.png
E 46/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.62b/s, loss=0.1803, lr=5.8e-06]
  E 46: loss=0.1805 lr=5.7e-06 (11s)
E 47/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1803, lr=3.7e-06]
  E 47: loss=0.1803 lr=3.6e-06 (11s)
E 48/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.84b/s, loss=0.1791, lr=2.2e-06]
  E 48: loss=0.1793 lr=2.2e-06 (11s) β˜…
E 49/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.45b/s, loss=0.1796, lr=1.3e-06]
  E 49: loss=0.1796 lr=1.3e-06 (11s)
E 50/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.74b/s, loss=0.1797, lr=1.0e-06]
  E 50: loss=0.1797 lr=1.0e-06 (11s)
  β†’ Saved samples/epoch_050.png
  Relay diagnostics:
    mid_block1.relay: drift=0.0968 rad (5.5Β°) gate=0.0594
    mid_block2.relay: drift=0.1164 rad (6.7Β°) gate=0.0658

======================================================================
DONE β€” Best loss: 0.1793
  Params: 6,746,403 (relay: 76,384)
  Samples in: samples/
======================================================================