Training setup
Num train steps 10000
Max seq len 256
Batch size 512
Total data points seen 5.1 mil
Total tokens seen 450 mil
Checkpoint step 10000
Learning rate 3e-4
Metric Val Test
BLEU 32.2 30.1
chrf++ 50.1 48.5
Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support