Model save
Browse files- README.md +48 -2
- best/model.safetensors +1 -1
- best/training_args.bin +1 -1
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 13 |
|
| 14 |
This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
-
- Loss:
|
| 17 |
|
| 18 |
## Model description
|
| 19 |
|
|
@@ -39,7 +39,7 @@ The following hyperparameters were used during training:
|
|
| 39 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 40 |
- lr_scheduler_type: linear
|
| 41 |
- lr_scheduler_warmup_steps: 30
|
| 42 |
-
- num_epochs:
|
| 43 |
- mixed_precision_training: Native AMP
|
| 44 |
|
| 45 |
### Training results
|
|
@@ -396,6 +396,52 @@ The following hyperparameters were used during training:
|
|
| 396 |
| 2.3827 | 39.7005 | 356352 | 4.5531 |
|
| 397 |
| 2.3827 | 39.8146 | 357376 | 4.5598 |
|
| 398 |
| 2.3827 | 39.9287 | 358400 | 4.5574 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 399 |
|
| 400 |
|
| 401 |
### Framework versions
|
|
|
|
| 13 |
|
| 14 |
This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
|
| 15 |
It achieves the following results on the evaluation set:
|
| 16 |
+
- Loss: 5.8601
|
| 17 |
|
| 18 |
## Model description
|
| 19 |
|
|
|
|
| 39 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 40 |
- lr_scheduler_type: linear
|
| 41 |
- lr_scheduler_warmup_steps: 30
|
| 42 |
+
- num_epochs: 60
|
| 43 |
- mixed_precision_training: Native AMP
|
| 44 |
|
| 45 |
### Training results
|
|
|
|
| 396 |
| 2.3827 | 39.7005 | 356352 | 4.5531 |
|
| 397 |
| 2.3827 | 39.8146 | 357376 | 4.5598 |
|
| 398 |
| 2.3827 | 39.9287 | 358400 | 4.5574 |
|
| 399 |
+
| 3.9984 | 40.0428 | 359424 | 5.7228 |
|
| 400 |
+
| 3.9984 | 40.1569 | 360448 | 5.6349 |
|
| 401 |
+
| 3.9984 | 40.2709 | 361472 | 5.6372 |
|
| 402 |
+
| 3.9984 | 40.3850 | 362496 | 5.5746 |
|
| 403 |
+
| 3.9984 | 40.4991 | 363520 | 5.5795 |
|
| 404 |
+
| 3.9984 | 40.6132 | 364544 | 5.5344 |
|
| 405 |
+
| 3.9984 | 40.7273 | 365568 | 5.5140 |
|
| 406 |
+
| 3.9984 | 40.8414 | 366592 | 5.4978 |
|
| 407 |
+
| 3.9984 | 40.9554 | 367616 | 5.4630 |
|
| 408 |
+
| 3.6244 | 41.0695 | 368640 | 5.4623 |
|
| 409 |
+
| 3.6244 | 41.1836 | 369664 | 5.4943 |
|
| 410 |
+
| 3.6244 | 41.2977 | 370688 | 5.4605 |
|
| 411 |
+
| 3.6244 | 41.4118 | 371712 | 5.5054 |
|
| 412 |
+
| 3.6244 | 41.5258 | 372736 | 5.4709 |
|
| 413 |
+
| 3.6244 | 41.6399 | 373760 | 5.5010 |
|
| 414 |
+
| 3.6244 | 41.7540 | 374784 | 5.5261 |
|
| 415 |
+
| 3.6244 | 41.8681 | 375808 | 5.5546 |
|
| 416 |
+
| 3.6244 | 41.9822 | 376832 | 5.5594 |
|
| 417 |
+
| 3.416 | 42.0963 | 377856 | 5.5247 |
|
| 418 |
+
| 3.416 | 42.2103 | 378880 | 5.5814 |
|
| 419 |
+
| 3.416 | 42.3244 | 379904 | 5.6016 |
|
| 420 |
+
| 3.416 | 42.4385 | 380928 | 5.5535 |
|
| 421 |
+
| 3.416 | 42.5526 | 381952 | 5.5606 |
|
| 422 |
+
| 3.416 | 42.6667 | 382976 | 5.5824 |
|
| 423 |
+
| 3.416 | 42.7807 | 384000 | 5.6214 |
|
| 424 |
+
| 3.416 | 42.8948 | 385024 | 5.6168 |
|
| 425 |
+
| 3.2543 | 43.0089 | 386048 | 5.6560 |
|
| 426 |
+
| 3.2543 | 43.1230 | 387072 | 5.6215 |
|
| 427 |
+
| 3.2543 | 43.2371 | 388096 | 5.7091 |
|
| 428 |
+
| 3.2543 | 43.3512 | 389120 | 5.7246 |
|
| 429 |
+
| 3.2543 | 43.4652 | 390144 | 5.6848 |
|
| 430 |
+
| 3.2543 | 43.5793 | 391168 | 5.7467 |
|
| 431 |
+
| 3.2543 | 43.6934 | 392192 | 5.7055 |
|
| 432 |
+
| 3.2543 | 43.8075 | 393216 | 5.7323 |
|
| 433 |
+
| 3.2543 | 43.9216 | 394240 | 5.7253 |
|
| 434 |
+
| 3.1132 | 44.0357 | 395264 | 5.7830 |
|
| 435 |
+
| 3.1132 | 44.1497 | 396288 | 5.7302 |
|
| 436 |
+
| 3.1132 | 44.2638 | 397312 | 5.7815 |
|
| 437 |
+
| 3.1132 | 44.3779 | 398336 | 5.7778 |
|
| 438 |
+
| 3.1132 | 44.4920 | 399360 | 5.8049 |
|
| 439 |
+
| 3.1132 | 44.6061 | 400384 | 5.7594 |
|
| 440 |
+
| 3.1132 | 44.7201 | 401408 | 5.7803 |
|
| 441 |
+
| 3.1132 | 44.8342 | 402432 | 5.8086 |
|
| 442 |
+
| 3.1132 | 44.9483 | 403456 | 5.8097 |
|
| 443 |
+
| 2.9936 | 45.0624 | 404480 | 5.8311 |
|
| 444 |
+
| 2.9936 | 45.1765 | 405504 | 5.8601 |
|
| 445 |
|
| 446 |
|
| 447 |
### Framework versions
|
best/model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 211234576
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d080a3096c85aa09745a609f9d6e6543a51c2d05260c0ec9a82fd455e47f0ccb
|
| 3 |
size 211234576
|
best/training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5112
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2660a352155598071c791074e13fe1708575a37e1d19526e713ca16bd0e8ee36
|
| 3 |
size 5112
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 211234576
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d080a3096c85aa09745a609f9d6e6543a51c2d05260c0ec9a82fd455e47f0ccb
|
| 3 |
size 211234576
|