saiteki-kai commited on
Commit
c9cbd05
·
verified ·
1 Parent(s): 73e39d3

Model save

Browse files
Files changed (2) hide show
  1. README.md +81 -0
  2. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: mit
4
+ base_model: microsoft/deberta-v3-large
5
+ tags:
6
+ - generated_from_trainer
7
+ metrics:
8
+ - accuracy
9
+ model-index:
10
+ - name: QA-DeBERTa-v3-large-diff-binary-2
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # QA-DeBERTa-v3-large-diff-binary-2
18
+
19
+ This model is a fine-tuned version of [microsoft/deberta-v3-large](https://huggingface.co/microsoft/deberta-v3-large) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.3192
22
+ - Accuracy: 0.8627
23
+ - Unsafe Precision: 0.8876
24
+ - Unsafe Recall: 0.8625
25
+ - Unsafe F1: 0.8748
26
+ - Unsafe Fpr: 0.1370
27
+ - Unsafe Aucpr: 0.9550
28
+ - Safe Precision: 0.8334
29
+ - Safe Recall: 0.8630
30
+ - Safe F1: 0.8479
31
+ - Safe Fpr: 0.1375
32
+ - Safe Aucpr: 0.9220
33
+
34
+ ## Model description
35
+
36
+ More information needed
37
+
38
+ ## Intended uses & limitations
39
+
40
+ More information needed
41
+
42
+ ## Training and evaluation data
43
+
44
+ More information needed
45
+
46
+ ## Training procedure
47
+
48
+ ### Training hyperparameters
49
+
50
+ The following hyperparameters were used during training:
51
+ - learning_rate: 6e-06
52
+ - train_batch_size: 64
53
+ - eval_batch_size: 64
54
+ - seed: 42
55
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
56
+ - lr_scheduler_type: linear
57
+ - lr_scheduler_warmup_steps: 1000
58
+ - num_epochs: 10
59
+
60
+ ### Training results
61
+
62
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | Unsafe Precision | Unsafe Recall | Unsafe F1 | Unsafe Fpr | Unsafe Aucpr | Safe Precision | Safe Recall | Safe F1 | Safe Fpr | Safe Aucpr |
63
+ |:-------------:|:------:|:-----:|:---------------:|:--------:|:----------------:|:-------------:|:---------:|:----------:|:------------:|:--------------:|:-----------:|:-------:|:--------:|:----------:|
64
+ | 0.2998 | 0.2501 | 2114 | 0.3677 | 0.8446 | 0.9027 | 0.8078 | 0.8526 | 0.1093 | 0.9436 | 0.7870 | 0.8907 | 0.8356 | 0.1922 | 0.8961 |
65
+ | 0.3262 | 0.5001 | 4228 | 0.3278 | 0.8561 | 0.8786 | 0.8602 | 0.8693 | 0.1491 | 0.9495 | 0.8291 | 0.8509 | 0.8399 | 0.1398 | 0.9087 |
66
+ | 0.3019 | 0.7502 | 6342 | 0.3236 | 0.8588 | 0.8972 | 0.8429 | 0.8692 | 0.1211 | 0.9527 | 0.8168 | 0.8789 | 0.8467 | 0.1571 | 0.9155 |
67
+ | 0.3479 | 1.0002 | 8456 | 0.3215 | 0.8599 | 0.8690 | 0.8811 | 0.8750 | 0.1666 | 0.9531 | 0.8482 | 0.8334 | 0.8407 | 0.1189 | 0.9175 |
68
+ | 0.302 | 1.2503 | 10570 | 0.3221 | 0.8611 | 0.8839 | 0.8639 | 0.8738 | 0.1423 | 0.9536 | 0.8340 | 0.8577 | 0.8457 | 0.1361 | 0.9176 |
69
+ | 0.2663 | 1.5004 | 12684 | 0.3409 | 0.8609 | 0.8682 | 0.8842 | 0.8761 | 0.1684 | 0.9538 | 0.8512 | 0.8316 | 0.8413 | 0.1158 | 0.9184 |
70
+ | 0.2841 | 1.7504 | 14798 | 0.3223 | 0.8622 | 0.8772 | 0.8748 | 0.8760 | 0.1537 | 0.9551 | 0.8435 | 0.8463 | 0.8449 | 0.1252 | 0.9202 |
71
+ | 0.3074 | 2.0005 | 16912 | 0.3244 | 0.8632 | 0.8995 | 0.8490 | 0.8735 | 0.1190 | 0.9553 | 0.8230 | 0.8810 | 0.8510 | 0.1510 | 0.9182 |
72
+ | 0.3052 | 2.2505 | 19026 | 0.3200 | 0.8618 | 0.8833 | 0.8660 | 0.8746 | 0.1435 | 0.9546 | 0.8359 | 0.8565 | 0.8461 | 0.1340 | 0.9221 |
73
+ | 0.268 | 2.5006 | 21140 | 0.3192 | 0.8627 | 0.8876 | 0.8625 | 0.8748 | 0.1370 | 0.9550 | 0.8334 | 0.8630 | 0.8479 | 0.1375 | 0.9220 |
74
+
75
+
76
+ ### Framework versions
77
+
78
+ - Transformers 4.57.3
79
+ - Pytorch 2.7.1+cu118
80
+ - Datasets 4.4.1
81
+ - Tokenizers 0.22.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:00b21ca755c7b8d3c51eaf9dde2074c854ff10c18e30de9565b6eb77a90943ae
3
  size 1757094344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e8c35244fa091ae864433966fb3a6de358d4a415b0bf3ee63cf6a31311ecf858
3
  size 1757094344