Add trained checkpoint: 3B tokens, loss=3.16, MFU=31.5% 5442313 verified prometheus04 commited on May 28
GPU-session fixes (RNG cpu, shard filter, cu124, 3090 config) 511257f verified prometheus04 commited on May 23