xonic48 commited on
Commit
5093722
·
verified ·
1 Parent(s): ed8f7eb

End of training

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -34,11 +34,11 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0005
37
- - train_batch_size: 64
38
- - eval_batch_size: 64
39
  - seed: 42
40
  - gradient_accumulation_steps: 8
41
- - total_train_batch_size: 512
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: cosine
44
  - lr_scheduler_warmup_steps: 1000
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0005
37
+ - train_batch_size: 32
38
+ - eval_batch_size: 32
39
  - seed: 42
40
  - gradient_accumulation_steps: 8
41
+ - total_train_batch_size: 256
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: cosine
44
  - lr_scheduler_warmup_steps: 1000