software-si commited on
Commit
89d2729
·
verified ·
1 Parent(s): bf8d904

Add new CrossEncoder model

Browse files
Files changed (1) hide show
  1. README.md +10 -11
README.md CHANGED
@@ -147,8 +147,8 @@ You can finetune this model on your own dataset.
147
  #### Non-Default Hyperparameters
148
 
149
  - `eval_strategy`: steps
150
- - `per_device_train_batch_size`: 32
151
- - `per_device_eval_batch_size`: 32
152
  - `learning_rate`: 1e-05
153
  - `num_train_epochs`: 1
154
  - `warmup_steps`: 10283
@@ -162,8 +162,8 @@ You can finetune this model on your own dataset.
162
  - `do_predict`: False
163
  - `eval_strategy`: steps
164
  - `prediction_loss_only`: True
165
- - `per_device_train_batch_size`: 32
166
- - `per_device_eval_batch_size`: 32
167
  - `per_gpu_train_batch_size`: None
168
  - `per_gpu_eval_batch_size`: None
169
  - `gradient_accumulation_steps`: 1
@@ -280,14 +280,13 @@ You can finetune this model on your own dataset.
280
  </details>
281
 
282
  ### Training Logs
283
- | Epoch | Step | Training Loss | Validation Loss |
284
- |:------:|:----:|:-------------:|:---------------:|
285
- | 0.1556 | 500 | 0.2842 | 0.1468 |
286
- | 0.3111 | 1000 | 0.1083 | 0.0741 |
287
- | 0.1556 | 500 | 0.0652 | 0.0457 |
288
- | 0.3111 | 1000 | 0.0303 | 0.0189 |
289
- | 0.4667 | 1500 | 0.0157 | 0.0357 |
290
 
 
291
 
292
  ### Framework Versions
293
  - Python: 3.12.11
 
147
  #### Non-Default Hyperparameters
148
 
149
  - `eval_strategy`: steps
150
+ - `per_device_train_batch_size`: 64
151
+ - `per_device_eval_batch_size`: 64
152
  - `learning_rate`: 1e-05
153
  - `num_train_epochs`: 1
154
  - `warmup_steps`: 10283
 
162
  - `do_predict`: False
163
  - `eval_strategy`: steps
164
  - `prediction_loss_only`: True
165
+ - `per_device_train_batch_size`: 64
166
+ - `per_device_eval_batch_size`: 64
167
  - `per_gpu_train_batch_size`: None
168
  - `per_gpu_eval_batch_size`: None
169
  - `gradient_accumulation_steps`: 1
 
280
  </details>
281
 
282
  ### Training Logs
283
+ | Epoch | Step | Training Loss | Validation Loss |
284
+ |:----------:|:--------:|:-------------:|:---------------:|
285
+ | 0.3111 | 500 | 0.0082 | 0.0072 |
286
+ | **0.6223** | **1000** | **0.0043** | **0.0027** |
287
+ | 0.9334 | 1500 | 0.0041 | 0.0388 |
 
 
288
 
289
+ * The bold row denotes the saved checkpoint.
290
 
291
  ### Framework Versions
292
  - Python: 3.12.11