internal-states-peek / 2l-2gpus /000-module.3.self_attention.scale_mask_softmax
293 kB
stas's picture
small model
5a89037