bert-beatrix-2048 / training_state.json
AbstractPhil's picture
Upload 12 files
14cdead verified
raw
history blame
293 Bytes
{
"step": 248000,
"epoch": 13,
"vocab_size": 30574,
"model_vocab_size": 30592,
"config": {
"optimizer_type": "adamw",
"lr": 2e-05,
"weight_decay": 0.01,
"warmup_steps": 8000,
"scheduler_type": "cosine",
"scheduler_params": {
"eta_min": 1e-07
}
}
}