geolip-diffusion-proto / cifar10_proto_diffuser_output.txt
AbstractPhil's picture
Create cifar10_proto_diffuser_output.txt
094c5fd verified
======================================================================
FLOW MATCHING + CONSTELLATION RELAY REGULATOR
Dataset: CIFAR-10
Base channels: 64
Relay: True
Flow matching: ODE (conditional)
Sampler: Euler, 50 steps
Device: cuda
======================================================================
Train: 50,000 images
Total params: 6,746,403
Relay params: 76,384 (1.1%)
Relay modules: 2
======================================================================
TRAINING β€” 50 epochs
======================================================================
E 1/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:11<00:00, 34.46b/s, loss=0.3728, lr=3.0e-04]
E 1: loss=0.3695 lr=3.0e-04 (11s) β˜…
β†’ Saved samples/epoch_001.png
E 2/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.22b/s, loss=0.2382, lr=3.0e-04]
E 2: loss=0.2379 lr=3.0e-04 (11s) β˜…
E 3/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.74b/s, loss=0.2233, lr=3.0e-04]
E 3: loss=0.2230 lr=3.0e-04 (11s) β˜…
E 4/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.82b/s, loss=0.2147, lr=3.0e-04]
E 4: loss=0.2145 lr=3.0e-04 (11s) β˜…
E 5/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 37.00b/s, loss=0.2094, lr=2.9e-04]
E 5: loss=0.2093 lr=2.9e-04 (11s) β˜…
β†’ Saved samples/epoch_005.png
E 6/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.99b/s, loss=0.2050, lr=2.9e-04]
E 6: loss=0.2049 lr=2.9e-04 (11s) β˜…
E 7/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.87b/s, loss=0.2010, lr=2.9e-04]
E 7: loss=0.2009 lr=2.9e-04 (11s) β˜…
E 8/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1984, lr=2.8e-04]
E 8: loss=0.1983 lr=2.8e-04 (11s) β˜…
E 9/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.68b/s, loss=0.1966, lr=2.8e-04]
E 9: loss=0.1967 lr=2.8e-04 (11s) β˜…
E 10/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.75b/s, loss=0.1950, lr=2.7e-04]
E 10: loss=0.1951 lr=2.7e-04 (11s) β˜…
β†’ Saved samples/epoch_010.png
Relay diagnostics:
mid_block1.relay: drift=0.0382 rad (2.2Β°) gate=0.0519
mid_block2.relay: drift=0.0548 rad (3.1Β°) gate=0.0548
E 11/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.42b/s, loss=0.1947, lr=2.7e-04]
E 11: loss=0.1946 lr=2.7e-04 (11s) β˜…
E 12/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.88b/s, loss=0.1923, lr=2.6e-04]
E 12: loss=0.1923 lr=2.6e-04 (11s) β˜…
E 13/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1910, lr=2.5e-04]
E 13: loss=0.1909 lr=2.5e-04 (11s) β˜…
E 14/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.50b/s, loss=0.1907, lr=2.5e-04]
E 14: loss=0.1907 lr=2.5e-04 (11s) β˜…
E 15/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.65b/s, loss=0.1901, lr=2.4e-04]
E 15: loss=0.1901 lr=2.4e-04 (11s) β˜…
β†’ Saved samples/epoch_015.png
E 16/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.56b/s, loss=0.1894, lr=2.3e-04]
E 16: loss=0.1893 lr=2.3e-04 (11s) β˜…
E 17/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.26b/s, loss=0.1881, lr=2.2e-04]
E 17: loss=0.1880 lr=2.2e-04 (11s) β˜…
E 18/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1883, lr=2.1e-04]
E 18: loss=0.1883 lr=2.1e-04 (11s)
E 19/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.58b/s, loss=0.1875, lr=2.1e-04]
E 19: loss=0.1874 lr=2.1e-04 (11s) β˜…
E 20/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.81b/s, loss=0.1869, lr=2.0e-04]
E 20: loss=0.1870 lr=2.0e-04 (11s) β˜…
β†’ Saved samples/epoch_020.png
Relay diagnostics:
mid_block1.relay: drift=0.0703 rad (4.0Β°) gate=0.0561
mid_block2.relay: drift=0.0938 rad (5.4Β°) gate=0.0618
E 21/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1853, lr=1.9e-04]
E 21: loss=0.1853 lr=1.9e-04 (11s) β˜…
E 22/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.75b/s, loss=0.1864, lr=1.8e-04]
E 22: loss=0.1864 lr=1.8e-04 (11s)
E 23/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1851, lr=1.7e-04]
E 23: loss=0.1851 lr=1.7e-04 (11s) β˜…
E 24/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.72b/s, loss=0.1849, lr=1.6e-04]
E 24: loss=0.1849 lr=1.6e-04 (11s) β˜…
E 25/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1850, lr=1.5e-04]
E 25: loss=0.1849 lr=1.5e-04 (11s) β˜…
β†’ Saved samples/epoch_025.png
E 26/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.78b/s, loss=0.1851, lr=1.4e-04]
E 26: loss=0.1848 lr=1.4e-04 (11s) β˜…
E 27/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1835, lr=1.3e-04]
E 27: loss=0.1833 lr=1.3e-04 (11s) β˜…
E 28/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1840, lr=1.2e-04]
E 28: loss=0.1839 lr=1.2e-04 (11s)
E 29/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.88b/s, loss=0.1837, lr=1.1e-04]
E 29: loss=0.1837 lr=1.1e-04 (11s)
E 30/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.47b/s, loss=0.1823, lr=1.0e-04]
E 30: loss=0.1822 lr=1.0e-04 (11s) β˜…
β†’ Saved samples/epoch_030.png
Relay diagnostics:
mid_block1.relay: drift=0.0918 rad (5.3Β°) gate=0.0586
mid_block2.relay: drift=0.1132 rad (6.5Β°) gate=0.0649
E 31/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.72b/s, loss=0.1823, lr=9.6e-05]
E 31: loss=0.1823 lr=9.5e-05 (11s)
E 32/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.64b/s, loss=0.1823, lr=8.7e-05]
E 32: loss=0.1823 lr=8.7e-05 (11s)
E 33/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.85b/s, loss=0.1816, lr=7.9e-05]
E 33: loss=0.1816 lr=7.8e-05 (11s) β˜…
E 34/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1809, lr=7.1e-05]
E 34: loss=0.1809 lr=7.0e-05 (11s) β˜…
E 35/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.80b/s, loss=0.1810, lr=6.3e-05]
E 35: loss=0.1810 lr=6.3e-05 (11s)
β†’ Saved samples/epoch_035.png
E 36/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.77b/s, loss=0.1819, lr=5.5e-05]
E 36: loss=0.1819 lr=5.5e-05 (11s)
E 37/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.79b/s, loss=0.1812, lr=4.8e-05]
E 37: loss=0.1813 lr=4.8e-05 (11s)
E 38/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.86b/s, loss=0.1808, lr=4.2e-05]
E 38: loss=0.1808 lr=4.2e-05 (11s) β˜…
E 39/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1815, lr=3.5e-05]
E 39: loss=0.1814 lr=3.5e-05 (11s)
E 40/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.71b/s, loss=0.1800, lr=3.0e-05]
E 40: loss=0.1800 lr=3.0e-05 (11s) β˜…
β†’ Saved samples/epoch_040.png
Relay diagnostics:
mid_block1.relay: drift=0.0964 rad (5.5Β°) gate=0.0593
mid_block2.relay: drift=0.1163 rad (6.7Β°) gate=0.0657
E 41/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.53b/s, loss=0.1803, lr=2.4e-05]
E 41: loss=0.1803 lr=2.4e-05 (11s)
E 42/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.30b/s, loss=0.1801, lr=2.0e-05]
E 42: loss=0.1801 lr=1.9e-05 (11s)
E 43/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.80b/s, loss=0.1800, lr=1.5e-05]
E 43: loss=0.1799 lr=1.5e-05 (11s) β˜…
E 44/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.84b/s, loss=0.1801, lr=1.2e-05]
E 44: loss=0.1799 lr=1.1e-05 (11s)
E 45/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.48b/s, loss=0.1800, lr=8.4e-06]
E 45: loss=0.1799 lr=8.3e-06 (11s) β˜…
β†’ Saved samples/epoch_045.png
E 46/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.62b/s, loss=0.1803, lr=5.8e-06]
E 46: loss=0.1805 lr=5.7e-06 (11s)
E 47/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.73b/s, loss=0.1803, lr=3.7e-06]
E 47: loss=0.1803 lr=3.6e-06 (11s)
E 48/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.84b/s, loss=0.1791, lr=2.2e-06]
E 48: loss=0.1793 lr=2.2e-06 (11s) β˜…
E 49/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.45b/s, loss=0.1796, lr=1.3e-06]
E 49: loss=0.1796 lr=1.3e-06 (11s)
E 50/50: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 390/390 [00:10<00:00, 36.74b/s, loss=0.1797, lr=1.0e-06]
E 50: loss=0.1797 lr=1.0e-06 (11s)
β†’ Saved samples/epoch_050.png
Relay diagnostics:
mid_block1.relay: drift=0.0968 rad (5.5Β°) gate=0.0594
mid_block2.relay: drift=0.1164 rad (6.7Β°) gate=0.0658
======================================================================
DONE β€” Best loss: 0.1793
Params: 6,746,403 (relay: 76,384)
Samples in: samples/
======================================================================