deberta-v3-large_spell_5k_1_p3 / train_results.json
stuartmesham's picture
Upload with huggingface_hub
a4d9a73
raw
history blame contribute delete
196 Bytes
{
"epoch": 4.0,
"train_loss": 0.2863717150332323,
"train_runtime": 491.6983,
"train_samples": 34304,
"train_samples_per_second": 1046.495,
"train_steps_per_second": 8.176
}