Add new CrossEncoder model
Browse files
README.md
CHANGED
|
@@ -147,8 +147,8 @@ You can finetune this model on your own dataset.
|
|
| 147 |
#### Non-Default Hyperparameters
|
| 148 |
|
| 149 |
- `eval_strategy`: steps
|
| 150 |
-
- `per_device_train_batch_size`:
|
| 151 |
-
- `per_device_eval_batch_size`:
|
| 152 |
- `learning_rate`: 1e-05
|
| 153 |
- `num_train_epochs`: 1
|
| 154 |
- `warmup_steps`: 10283
|
|
@@ -162,8 +162,8 @@ You can finetune this model on your own dataset.
|
|
| 162 |
- `do_predict`: False
|
| 163 |
- `eval_strategy`: steps
|
| 164 |
- `prediction_loss_only`: True
|
| 165 |
-
- `per_device_train_batch_size`:
|
| 166 |
-
- `per_device_eval_batch_size`:
|
| 167 |
- `per_gpu_train_batch_size`: None
|
| 168 |
- `per_gpu_eval_batch_size`: None
|
| 169 |
- `gradient_accumulation_steps`: 1
|
|
@@ -280,14 +280,13 @@ You can finetune this model on your own dataset.
|
|
| 280 |
</details>
|
| 281 |
|
| 282 |
### Training Logs
|
| 283 |
-
| Epoch
|
| 284 |
-
|
| 285 |
-
| 0.
|
| 286 |
-
| 0.
|
| 287 |
-
| 0.
|
| 288 |
-
| 0.3111 | 1000 | 0.0303 | 0.0189 |
|
| 289 |
-
| 0.4667 | 1500 | 0.0157 | 0.0357 |
|
| 290 |
|
|
|
|
| 291 |
|
| 292 |
### Framework Versions
|
| 293 |
- Python: 3.12.11
|
|
|
|
| 147 |
#### Non-Default Hyperparameters
|
| 148 |
|
| 149 |
- `eval_strategy`: steps
|
| 150 |
+
- `per_device_train_batch_size`: 64
|
| 151 |
+
- `per_device_eval_batch_size`: 64
|
| 152 |
- `learning_rate`: 1e-05
|
| 153 |
- `num_train_epochs`: 1
|
| 154 |
- `warmup_steps`: 10283
|
|
|
|
| 162 |
- `do_predict`: False
|
| 163 |
- `eval_strategy`: steps
|
| 164 |
- `prediction_loss_only`: True
|
| 165 |
+
- `per_device_train_batch_size`: 64
|
| 166 |
+
- `per_device_eval_batch_size`: 64
|
| 167 |
- `per_gpu_train_batch_size`: None
|
| 168 |
- `per_gpu_eval_batch_size`: None
|
| 169 |
- `gradient_accumulation_steps`: 1
|
|
|
|
| 280 |
</details>
|
| 281 |
|
| 282 |
### Training Logs
|
| 283 |
+
| Epoch | Step | Training Loss | Validation Loss |
|
| 284 |
+
|:----------:|:--------:|:-------------:|:---------------:|
|
| 285 |
+
| 0.3111 | 500 | 0.0082 | 0.0072 |
|
| 286 |
+
| **0.6223** | **1000** | **0.0043** | **0.0027** |
|
| 287 |
+
| 0.9334 | 1500 | 0.0041 | 0.0388 |
|
|
|
|
|
|
|
| 288 |
|
| 289 |
+
* The bold row denotes the saved checkpoint.
|
| 290 |
|
| 291 |
### Framework Versions
|
| 292 |
- Python: 3.12.11
|