en-ms-transformer / src /training.py

Commit History

v2: 2M training, dropout 0.1, full-corpus tokenizer — chrF 48.93 (was 45.62)
d7fa769
verified

AstralPotato commited on

Upload en-ms Transformer (6+2 Tied, 16K BPE, chrF 45.62)
e7f17a4
verified

AstralPotato commited on