deberta-v3-large_spell_10k_3_p3 / train_results.json
stuartmesham's picture
Upload with huggingface_hub
ae9749e
raw
history blame contribute delete
196 Bytes
{
"epoch": 5.0,
"train_loss": 0.25945684233708166,
"train_runtime": 598.3689,
"train_samples": 34304,
"train_samples_per_second": 859.938,
"train_steps_per_second": 6.718
}