Fine-tuned mpnet-base for patent-claim binary classification MPNetv2
Browse files- README.md +33 -34
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -1,7 +1,6 @@
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
| 3 |
-
|
| 4 |
-
base_model: sentence-transformers/all-mpnet-base-v2
|
| 5 |
tags:
|
| 6 |
- generated_from_trainer
|
| 7 |
metrics:
|
|
@@ -19,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 19 |
|
| 20 |
# g-patentsbertav2-e2e
|
| 21 |
|
| 22 |
-
This model is a fine-tuned version of [
|
| 23 |
It achieves the following results on the evaluation set:
|
| 24 |
-
- Loss: 0.
|
| 25 |
-
- Accuracy: 0.
|
| 26 |
-
- Precision: 0.
|
| 27 |
-
- Recall: 0.
|
| 28 |
-
- F1: 0.
|
| 29 |
|
| 30 |
## Model description
|
| 31 |
|
|
@@ -58,32 +57,32 @@ The following hyperparameters were used during training:
|
|
| 58 |
|
| 59 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
|
| 60 |
|:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
|
| 61 |
-
| 0.
|
| 62 |
-
| 0.
|
| 63 |
-
| 0.
|
| 64 |
-
| 0.
|
| 65 |
-
| 0.
|
| 66 |
-
| 0.
|
| 67 |
-
| 0.
|
| 68 |
-
| 0.
|
| 69 |
-
| 0.
|
| 70 |
-
| 0.
|
| 71 |
-
| 0.
|
| 72 |
-
| 0.
|
| 73 |
-
| 0.
|
| 74 |
-
| 0.
|
| 75 |
-
| 0.
|
| 76 |
-
| 0.
|
| 77 |
-
| 0.
|
| 78 |
-
| 0.
|
| 79 |
-
| 0.
|
| 80 |
-
| 0.
|
| 81 |
-
| 0.
|
| 82 |
-
| 0.
|
| 83 |
-
| 0.
|
| 84 |
-
| 0.
|
| 85 |
-
| 0.
|
| 86 |
-
| 0.
|
| 87 |
|
| 88 |
|
| 89 |
### Framework versions
|
|
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
| 3 |
+
base_model: AAUBS/PatentSBERTa_V2
|
|
|
|
| 4 |
tags:
|
| 5 |
- generated_from_trainer
|
| 6 |
metrics:
|
|
|
|
| 18 |
|
| 19 |
# g-patentsbertav2-e2e
|
| 20 |
|
| 21 |
+
This model is a fine-tuned version of [AAUBS/PatentSBERTa_V2](https://huggingface.co/AAUBS/PatentSBERTa_V2) on the None dataset.
|
| 22 |
It achieves the following results on the evaluation set:
|
| 23 |
+
- Loss: 0.4385
|
| 24 |
+
- Accuracy: 0.8350
|
| 25 |
+
- Precision: 0.2942
|
| 26 |
+
- Recall: 0.8453
|
| 27 |
+
- F1: 0.4365
|
| 28 |
|
| 29 |
## Model description
|
| 30 |
|
|
|
|
| 57 |
|
| 58 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
|
| 59 |
|:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|
|
| 60 |
+
| 0.4397 | 0.1866 | 2000 | 0.4432 | 0.7928 | 0.7523 | 0.8625 | 0.8036 |
|
| 61 |
+
| 0.4168 | 0.3731 | 4000 | 0.4196 | 0.8100 | 0.8480 | 0.7473 | 0.7945 |
|
| 62 |
+
| 0.4098 | 0.5597 | 6000 | 0.4036 | 0.8172 | 0.7941 | 0.8478 | 0.8201 |
|
| 63 |
+
| 0.4003 | 0.7463 | 8000 | 0.4040 | 0.8204 | 0.8322 | 0.7949 | 0.8131 |
|
| 64 |
+
| 0.3876 | 0.9328 | 10000 | 0.3855 | 0.8256 | 0.7981 | 0.8636 | 0.8296 |
|
| 65 |
+
| 0.3634 | 1.1194 | 12000 | 0.4008 | 0.8283 | 0.8099 | 0.8501 | 0.8295 |
|
| 66 |
+
| 0.3531 | 1.3060 | 14000 | 0.3921 | 0.8334 | 0.8050 | 0.8722 | 0.8373 |
|
| 67 |
+
| 0.3448 | 1.4925 | 16000 | 0.3845 | 0.8329 | 0.8110 | 0.8607 | 0.8351 |
|
| 68 |
+
| 0.3741 | 1.6791 | 18000 | 0.3875 | 0.8379 | 0.8525 | 0.8104 | 0.8309 |
|
| 69 |
+
| 0.3558 | 1.8657 | 20000 | 0.3771 | 0.8416 | 0.8416 | 0.8348 | 0.8382 |
|
| 70 |
+
| 0.3137 | 2.0522 | 22000 | 0.3830 | 0.8415 | 0.8304 | 0.8512 | 0.8407 |
|
| 71 |
+
| 0.2852 | 2.2388 | 24000 | 0.4188 | 0.8359 | 0.8102 | 0.8699 | 0.8390 |
|
| 72 |
+
| 0.2933 | 2.4254 | 26000 | 0.4280 | 0.8426 | 0.8333 | 0.8496 | 0.8414 |
|
| 73 |
+
| 0.3250 | 2.6119 | 28000 | 0.3846 | 0.8383 | 0.8162 | 0.8659 | 0.8403 |
|
| 74 |
+
| 0.3087 | 2.7985 | 30000 | 0.3822 | 0.8418 | 0.8313 | 0.8508 | 0.8409 |
|
| 75 |
+
| 0.2794 | 2.9851 | 32000 | 0.3997 | 0.8398 | 0.8267 | 0.8528 | 0.8395 |
|
| 76 |
+
| 0.2409 | 3.1716 | 34000 | 0.4402 | 0.8354 | 0.8198 | 0.8523 | 0.8358 |
|
| 77 |
+
| 0.2451 | 3.3582 | 36000 | 0.4359 | 0.8408 | 0.8340 | 0.8440 | 0.8390 |
|
| 78 |
+
| 0.2557 | 3.5448 | 38000 | 0.4241 | 0.8383 | 0.8382 | 0.8314 | 0.8348 |
|
| 79 |
+
| 0.2409 | 3.7313 | 40000 | 0.4388 | 0.8420 | 0.8482 | 0.8264 | 0.8372 |
|
| 80 |
+
| 0.2493 | 3.9179 | 42000 | 0.4319 | 0.8468 | 0.8357 | 0.8566 | 0.8460 |
|
| 81 |
+
| 0.2013 | 4.1045 | 44000 | 0.4871 | 0.8409 | 0.8329 | 0.8460 | 0.8394 |
|
| 82 |
+
| 0.2074 | 4.2910 | 46000 | 0.4942 | 0.8401 | 0.8373 | 0.8375 | 0.8374 |
|
| 83 |
+
| 0.2205 | 4.4776 | 48000 | 0.5179 | 0.8421 | 0.8388 | 0.8402 | 0.8395 |
|
| 84 |
+
| 0.1912 | 4.6642 | 50000 | 0.4978 | 0.8385 | 0.8274 | 0.8483 | 0.8377 |
|
| 85 |
+
| 0.1933 | 4.8507 | 52000 | 0.5141 | 0.8391 | 0.8238 | 0.8557 | 0.8395 |
|
| 86 |
|
| 87 |
|
| 88 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 437975176
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b11826874b4b52f5cabf5c1de6ee199c1f86a2565c87b842f07af599dc269524
|
| 3 |
size 437975176
|