moe-hash-xsum-noload / best_model
912 MB
ishro's picture
Epoch 4: train_loss=3.3821, val_loss=3.5047
3932fcc verified