moe-has-xsum / best_model
912 MB
ishro's picture
Epoch 3: train_loss=3.6393, val_loss=3.6889
3e1faf9 verified