File size: 9,012 Bytes
094c5fd | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 | ======================================================================
FLOW MATCHING + CONSTELLATION RELAY REGULATOR
Dataset: CIFAR-10
Base channels: 64
Relay: True
Flow matching: ODE (conditional)
Sampler: Euler, 50 steps
Device: cuda
======================================================================
Train: 50,000 images
Total params: 6,746,403
Relay params: 76,384 (1.1%)
Relay modules: 2
======================================================================
TRAINING β 50 epochs
======================================================================
E 1/50: 100%|ββββββββββ| 390/390 [00:11<00:00, 34.46b/s, loss=0.3728, lr=3.0e-04]
E 1: loss=0.3695 lr=3.0e-04 (11s) β
β Saved samples/epoch_001.png
E 2/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.22b/s, loss=0.2382, lr=3.0e-04]
E 2: loss=0.2379 lr=3.0e-04 (11s) β
E 3/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.74b/s, loss=0.2233, lr=3.0e-04]
E 3: loss=0.2230 lr=3.0e-04 (11s) β
E 4/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.82b/s, loss=0.2147, lr=3.0e-04]
E 4: loss=0.2145 lr=3.0e-04 (11s) β
E 5/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 37.00b/s, loss=0.2094, lr=2.9e-04]
E 5: loss=0.2093 lr=2.9e-04 (11s) β
β Saved samples/epoch_005.png
E 6/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.99b/s, loss=0.2050, lr=2.9e-04]
E 6: loss=0.2049 lr=2.9e-04 (11s) β
E 7/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.87b/s, loss=0.2010, lr=2.9e-04]
E 7: loss=0.2009 lr=2.9e-04 (11s) β
E 8/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1984, lr=2.8e-04]
E 8: loss=0.1983 lr=2.8e-04 (11s) β
E 9/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.68b/s, loss=0.1966, lr=2.8e-04]
E 9: loss=0.1967 lr=2.8e-04 (11s) β
E 10/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.75b/s, loss=0.1950, lr=2.7e-04]
E 10: loss=0.1951 lr=2.7e-04 (11s) β
β Saved samples/epoch_010.png
Relay diagnostics:
mid_block1.relay: drift=0.0382 rad (2.2Β°) gate=0.0519
mid_block2.relay: drift=0.0548 rad (3.1Β°) gate=0.0548
E 11/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.42b/s, loss=0.1947, lr=2.7e-04]
E 11: loss=0.1946 lr=2.7e-04 (11s) β
E 12/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.88b/s, loss=0.1923, lr=2.6e-04]
E 12: loss=0.1923 lr=2.6e-04 (11s) β
E 13/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1910, lr=2.5e-04]
E 13: loss=0.1909 lr=2.5e-04 (11s) β
E 14/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.50b/s, loss=0.1907, lr=2.5e-04]
E 14: loss=0.1907 lr=2.5e-04 (11s) β
E 15/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.65b/s, loss=0.1901, lr=2.4e-04]
E 15: loss=0.1901 lr=2.4e-04 (11s) β
β Saved samples/epoch_015.png
E 16/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.56b/s, loss=0.1894, lr=2.3e-04]
E 16: loss=0.1893 lr=2.3e-04 (11s) β
E 17/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.26b/s, loss=0.1881, lr=2.2e-04]
E 17: loss=0.1880 lr=2.2e-04 (11s) β
E 18/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1883, lr=2.1e-04]
E 18: loss=0.1883 lr=2.1e-04 (11s)
E 19/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.58b/s, loss=0.1875, lr=2.1e-04]
E 19: loss=0.1874 lr=2.1e-04 (11s) β
E 20/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.81b/s, loss=0.1869, lr=2.0e-04]
E 20: loss=0.1870 lr=2.0e-04 (11s) β
β Saved samples/epoch_020.png
Relay diagnostics:
mid_block1.relay: drift=0.0703 rad (4.0Β°) gate=0.0561
mid_block2.relay: drift=0.0938 rad (5.4Β°) gate=0.0618
E 21/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1853, lr=1.9e-04]
E 21: loss=0.1853 lr=1.9e-04 (11s) β
E 22/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.75b/s, loss=0.1864, lr=1.8e-04]
E 22: loss=0.1864 lr=1.8e-04 (11s)
E 23/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1851, lr=1.7e-04]
E 23: loss=0.1851 lr=1.7e-04 (11s) β
E 24/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.72b/s, loss=0.1849, lr=1.6e-04]
E 24: loss=0.1849 lr=1.6e-04 (11s) β
E 25/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1850, lr=1.5e-04]
E 25: loss=0.1849 lr=1.5e-04 (11s) β
β Saved samples/epoch_025.png
E 26/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1851, lr=1.4e-04]
E 26: loss=0.1848 lr=1.4e-04 (11s) β
E 27/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1835, lr=1.3e-04]
E 27: loss=0.1833 lr=1.3e-04 (11s) β
E 28/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1840, lr=1.2e-04]
E 28: loss=0.1839 lr=1.2e-04 (11s)
E 29/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.88b/s, loss=0.1837, lr=1.1e-04]
E 29: loss=0.1837 lr=1.1e-04 (11s)
E 30/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.47b/s, loss=0.1823, lr=1.0e-04]
E 30: loss=0.1822 lr=1.0e-04 (11s) β
β Saved samples/epoch_030.png
Relay diagnostics:
mid_block1.relay: drift=0.0918 rad (5.3Β°) gate=0.0586
mid_block2.relay: drift=0.1132 rad (6.5Β°) gate=0.0649
E 31/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.72b/s, loss=0.1823, lr=9.6e-05]
E 31: loss=0.1823 lr=9.5e-05 (11s)
E 32/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.64b/s, loss=0.1823, lr=8.7e-05]
E 32: loss=0.1823 lr=8.7e-05 (11s)
E 33/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1816, lr=7.9e-05]
E 33: loss=0.1816 lr=7.8e-05 (11s) β
E 34/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1809, lr=7.1e-05]
E 34: loss=0.1809 lr=7.0e-05 (11s) β
E 35/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.80b/s, loss=0.1810, lr=6.3e-05]
E 35: loss=0.1810 lr=6.3e-05 (11s)
β Saved samples/epoch_035.png
E 36/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.77b/s, loss=0.1819, lr=5.5e-05]
E 36: loss=0.1819 lr=5.5e-05 (11s)
E 37/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.79b/s, loss=0.1812, lr=4.8e-05]
E 37: loss=0.1813 lr=4.8e-05 (11s)
E 38/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1808, lr=4.2e-05]
E 38: loss=0.1808 lr=4.2e-05 (11s) β
E 39/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1815, lr=3.5e-05]
E 39: loss=0.1814 lr=3.5e-05 (11s)
E 40/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.71b/s, loss=0.1800, lr=3.0e-05]
E 40: loss=0.1800 lr=3.0e-05 (11s) β
β Saved samples/epoch_040.png
Relay diagnostics:
mid_block1.relay: drift=0.0964 rad (5.5Β°) gate=0.0593
mid_block2.relay: drift=0.1163 rad (6.7Β°) gate=0.0657
E 41/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.53b/s, loss=0.1803, lr=2.4e-05]
E 41: loss=0.1803 lr=2.4e-05 (11s)
E 42/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.30b/s, loss=0.1801, lr=2.0e-05]
E 42: loss=0.1801 lr=1.9e-05 (11s)
E 43/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.80b/s, loss=0.1800, lr=1.5e-05]
E 43: loss=0.1799 lr=1.5e-05 (11s) β
E 44/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.84b/s, loss=0.1801, lr=1.2e-05]
E 44: loss=0.1799 lr=1.1e-05 (11s)
E 45/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.48b/s, loss=0.1800, lr=8.4e-06]
E 45: loss=0.1799 lr=8.3e-06 (11s) β
β Saved samples/epoch_045.png
E 46/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.62b/s, loss=0.1803, lr=5.8e-06]
E 46: loss=0.1805 lr=5.7e-06 (11s)
E 47/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1803, lr=3.7e-06]
E 47: loss=0.1803 lr=3.6e-06 (11s)
E 48/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.84b/s, loss=0.1791, lr=2.2e-06]
E 48: loss=0.1793 lr=2.2e-06 (11s) β
E 49/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.45b/s, loss=0.1796, lr=1.3e-06]
E 49: loss=0.1796 lr=1.3e-06 (11s)
E 50/50: 100%|ββββββββββ| 390/390 [00:10<00:00, 36.74b/s, loss=0.1797, lr=1.0e-06]
E 50: loss=0.1797 lr=1.0e-06 (11s)
β Saved samples/epoch_050.png
Relay diagnostics:
mid_block1.relay: drift=0.0968 rad (5.5Β°) gate=0.0594
mid_block2.relay: drift=0.1164 rad (6.7Β°) gate=0.0658
======================================================================
DONE β Best loss: 0.1793
Params: 6,746,403 (relay: 76,384)
Samples in: samples/
====================================================================== |