Sean Halpin commited on
Commit ·
2646413
1
Parent(s): bbb4872
update model card README.md
Browse files
README.md
CHANGED
|
@@ -38,7 +38,7 @@ on the `CIFAR10` dataset.
|
|
| 38 |
The following hyperparameters were used during training:
|
| 39 |
- learning_rate: 1e-05
|
| 40 |
- train_batch_size: 256
|
| 41 |
-
- eval_batch_size:
|
| 42 |
- gradient_accumulation_steps: 1
|
| 43 |
- optimizer: AdamW with betas=(0.95, 0.999), weight_decay=1e-06 and epsilon=1e-08
|
| 44 |
- lr_scheduler: cosine
|
|
|
|
| 38 |
The following hyperparameters were used during training:
|
| 39 |
- learning_rate: 1e-05
|
| 40 |
- train_batch_size: 256
|
| 41 |
+
- eval_batch_size: 16
|
| 42 |
- gradient_accumulation_steps: 1
|
| 43 |
- optimizer: AdamW with betas=(0.95, 0.999), weight_decay=1e-06 and epsilon=1e-08
|
| 44 |
- lr_scheduler: cosine
|