subcate_cs
Browse files- README.md +14 -19
- model.safetensors +1 -1
- runs/Jul10_06-41-27_49a3fa77351e/events.out.tfevents.1720593693.49a3fa77351e.25532.7 +3 -0
- runs/Jul10_06-42-36_49a3fa77351e/events.out.tfevents.1720593762.49a3fa77351e.25532.8 +3 -0
- runs/Jul10_07-04-16_49a3fa77351e/events.out.tfevents.1720595071.49a3fa77351e.25532.9 +3 -0
- runs/Jul10_07-04-16_49a3fa77351e/events.out.tfevents.1720595643.49a3fa77351e.25532.10 +3 -0
- runs/Jul10_07-14-40_49a3fa77351e/events.out.tfevents.1720595686.49a3fa77351e.25532.11 +3 -0
- runs/Jul10_07-14-40_49a3fa77351e/events.out.tfevents.1720596255.49a3fa77351e.25532.12 +3 -0
- runs/Jul10_07-24-58_49a3fa77351e/events.out.tfevents.1720596305.49a3fa77351e.25532.13 +3 -0
- runs/Jul10_07-25-24_49a3fa77351e/events.out.tfevents.1720596330.49a3fa77351e.25532.14 +3 -0
- runs/Jul10_07-25-46_49a3fa77351e/events.out.tfevents.1720596353.49a3fa77351e.25532.15 +3 -0
- runs/Jul10_07-25-46_49a3fa77351e/events.out.tfevents.1720596909.49a3fa77351e.25532.16 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 20 |
|
| 21 |
This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on the None dataset.
|
| 22 |
It achieves the following results on the evaluation set:
|
| 23 |
-
- Loss: 0.
|
| 24 |
-
- Accuracy: 0.
|
| 25 |
-
- F1: 0.
|
| 26 |
-
- Precision: 0.
|
| 27 |
-
- Recall: 0.
|
| 28 |
|
| 29 |
## Model description
|
| 30 |
|
|
@@ -43,28 +43,23 @@ More information needed
|
|
| 43 |
### Training hyperparameters
|
| 44 |
|
| 45 |
The following hyperparameters were used during training:
|
| 46 |
-
- learning_rate:
|
| 47 |
-
- train_batch_size:
|
| 48 |
-
- eval_batch_size:
|
| 49 |
- seed: 42
|
| 50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 51 |
- lr_scheduler_type: linear
|
| 52 |
-
- num_epochs:
|
| 53 |
|
| 54 |
### Training results
|
| 55 |
|
| 56 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
|
| 57 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
|
| 58 |
-
| No log | 1.0 |
|
| 59 |
-
|
|
| 60 |
-
|
|
| 61 |
-
|
|
| 62 |
-
|
|
| 63 |
-
| 0.0213 | 6.0 | 2340 | 0.0479 | 0.6656 | 0.7041 | 0.7365 | 0.6743 |
|
| 64 |
-
| 0.0169 | 7.0 | 2730 | 0.0491 | 0.6701 | 0.7060 | 0.7316 | 0.6820 |
|
| 65 |
-
| 0.0146 | 8.0 | 3120 | 0.0504 | 0.6759 | 0.7055 | 0.7263 | 0.6859 |
|
| 66 |
-
| 0.012 | 9.0 | 3510 | 0.0512 | 0.6797 | 0.7070 | 0.7281 | 0.6871 |
|
| 67 |
-
| 0.012 | 10.0 | 3900 | 0.0513 | 0.6829 | 0.7101 | 0.7310 | 0.6903 |
|
| 68 |
|
| 69 |
|
| 70 |
### Framework versions
|
|
|
|
| 20 |
|
| 21 |
This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on the None dataset.
|
| 22 |
It achieves the following results on the evaluation set:
|
| 23 |
+
- Loss: 0.0762
|
| 24 |
+
- Accuracy: 0.6868
|
| 25 |
+
- F1: 0.7105
|
| 26 |
+
- Precision: 0.7235
|
| 27 |
+
- Recall: 0.6980
|
| 28 |
|
| 29 |
## Model description
|
| 30 |
|
|
|
|
| 43 |
### Training hyperparameters
|
| 44 |
|
| 45 |
The following hyperparameters were used during training:
|
| 46 |
+
- learning_rate: 2e-05
|
| 47 |
+
- train_batch_size: 64
|
| 48 |
+
- eval_batch_size: 256
|
| 49 |
- seed: 42
|
| 50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
| 51 |
- lr_scheduler_type: linear
|
| 52 |
+
- num_epochs: 5
|
| 53 |
|
| 54 |
### Training results
|
| 55 |
|
| 56 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
|
| 57 |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
|
| 58 |
+
| No log | 1.0 | 98 | 0.0743 | 0.6887 | 0.7074 | 0.7103 | 0.7044 |
|
| 59 |
+
| No log | 2.0 | 196 | 0.0753 | 0.6926 | 0.7141 | 0.7262 | 0.7025 |
|
| 60 |
+
| No log | 3.0 | 294 | 0.0751 | 0.6829 | 0.7078 | 0.7219 | 0.6942 |
|
| 61 |
+
| No log | 4.0 | 392 | 0.0774 | 0.6797 | 0.7009 | 0.7117 | 0.6903 |
|
| 62 |
+
| No log | 5.0 | 490 | 0.0762 | 0.6868 | 0.7105 | 0.7235 | 0.6980 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 63 |
|
| 64 |
|
| 65 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 567715456
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:997a0d591a4e09faa4d11509e9c01c2762f504d82e4c21ca8f7f1fa156c5a82e
|
| 3 |
size 567715456
|
runs/Jul10_06-41-27_49a3fa77351e/events.out.tfevents.1720593693.49a3fa77351e.25532.7
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f69207aa1c60674723af15c41d491c0bb67271d0c8618845b9aa1914ab3ca24a
|
| 3 |
+
size 7740
|
runs/Jul10_06-42-36_49a3fa77351e/events.out.tfevents.1720593762.49a3fa77351e.25532.8
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1f6bcfb12081f0ea931a96718483c959e6e929859a5723fdc48afe3f4ae6a1f8
|
| 3 |
+
size 13447
|
runs/Jul10_07-04-16_49a3fa77351e/events.out.tfevents.1720595071.49a3fa77351e.25532.9
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:141d2eaa2a03bf934d1f09409afe43d8c276a8417093e32f2e7a7822f4d85156
|
| 3 |
+
size 10444
|
runs/Jul10_07-04-16_49a3fa77351e/events.out.tfevents.1720595643.49a3fa77351e.25532.10
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:933aafbf529320e65c2875d8e38557bd3c6cbd4004a25d0ec64a8650404c5db8
|
| 3 |
+
size 560
|
runs/Jul10_07-14-40_49a3fa77351e/events.out.tfevents.1720595686.49a3fa77351e.25532.11
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4d4668aa82ee725904bf223141bfae31aa7e8d4bebe89544afe052c189344173
|
| 3 |
+
size 10445
|
runs/Jul10_07-14-40_49a3fa77351e/events.out.tfevents.1720596255.49a3fa77351e.25532.12
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bc3003c2977bac7fb82ef66a729823f8bc0fee263bec057897ec0adbb31af0b0
|
| 3 |
+
size 560
|
runs/Jul10_07-24-58_49a3fa77351e/events.out.tfevents.1720596305.49a3fa77351e.25532.13
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c02a3e39a2ce633fb92044eef63f1dfec7effc84c35b0c95dc384990f2f497c7
|
| 3 |
+
size 7741
|
runs/Jul10_07-25-24_49a3fa77351e/events.out.tfevents.1720596330.49a3fa77351e.25532.14
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b1a5479647027e1f2eae73feb583c46fcb7c0cdd39014b913ea66559d7d46162
|
| 3 |
+
size 7741
|
runs/Jul10_07-25-46_49a3fa77351e/events.out.tfevents.1720596353.49a3fa77351e.25532.15
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bbfdf9f0b8b097026d7dcddacc52bba2498c4f0b9cf7d325c3cd191f3bdeb814
|
| 3 |
+
size 10445
|
runs/Jul10_07-25-46_49a3fa77351e/events.out.tfevents.1720596909.49a3fa77351e.25532.16
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bf9a3ae2a3d620aa0ad711365a85a6876e1ddb4ecbaca2a5e67f2822f84d3857
|
| 3 |
+
size 1032
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5112
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f4d11056e6153f1882de5760c23b4047c461a187135987c4d38e563a09bb3f8b
|
| 3 |
size 5112
|