saiteki-kai
/

QA-DeBERTa-v3-base-binary

+---
+library_name: transformers
+license: mit
+base_model: microsoft/deberta-v3-base
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+model-index:
+- name: QA-DeBERTa-v3-base-binary
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# QA-DeBERTa-v3-base-binary
+This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.3289
+- Accuracy: 0.8592
+- Unsafe Precision: 0.8973
+- Unsafe Recall: 0.8435
+- Unsafe F1: 0.8696
+- Unsafe Fpr: 0.1211
+- Unsafe Aucpr: 0.9534
+- Safe Precision: 0.8174
+- Safe Recall: 0.8789
+- Safe F1: 0.8471
+- Safe Fpr: 0.1565
+- Safe Aucpr: 0.9158
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 6e-06
+- train_batch_size: 64
+- eval_batch_size: 64
+- seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 1000
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch  | Step  | Validation Loss | Accuracy | Unsafe Precision | Unsafe Recall | Unsafe F1 | Unsafe Fpr | Unsafe Aucpr | Safe Precision | Safe Recall | Safe F1 | Safe Fpr | Safe Aucpr |
+|:-------------:|:------:|:-----:|:---------------:|:--------:|:----------------:|:-------------:|:---------:|:----------:|:------------:|:--------------:|:-----------:|:-------:|:--------:|:----------:|
+| 0.3224        | 0.2501 | 2114  | 0.3766          | 0.8332   | 0.8908           | 0.7980        | 0.8419    | 0.1227     | 0.9351       | 0.7759         | 0.8773      | 0.8235  | 0.2020   | 0.8803     |
+| 0.3601        | 0.5001 | 4228  | 0.3495          | 0.8441   | 0.8565           | 0.8647        | 0.8606    | 0.1818     | 0.9433       | 0.8282         | 0.8182      | 0.8232  | 0.1353   | 0.8954     |
+| 0.3229        | 0.7502 | 6342  | 0.3409          | 0.8503   | 0.8891           | 0.8351        | 0.8612    | 0.1307     | 0.9464       | 0.8078         | 0.8693      | 0.8374  | 0.1649   | 0.9012     |
+| 0.3597        | 1.0002 | 8456  | 0.3344          | 0.8520   | 0.8695           | 0.8637        | 0.8666    | 0.1627     | 0.9482       | 0.8304         | 0.8373      | 0.8338  | 0.1363   | 0.9060     |
+| 0.3295        | 1.2503 | 10570 | 0.3357          | 0.8541   | 0.8741           | 0.8619        | 0.8680    | 0.1557     | 0.9491       | 0.8298         | 0.8443      | 0.8370  | 0.1381   | 0.9074     |
+| 0.2894        | 1.5004 | 12684 | 0.3492          | 0.8552   | 0.8710           | 0.8683        | 0.8697    | 0.1613     | 0.9500       | 0.8354         | 0.8387      | 0.8371  | 0.1317   | 0.9096     |
+| 0.2902        | 1.7504 | 14798 | 0.3321          | 0.8564   | 0.8678           | 0.8753        | 0.8715    | 0.1673     | 0.9510       | 0.8418         | 0.8327      | 0.8372  | 0.1247   | 0.9123     |
+| 0.3441        | 2.0005 | 16912 | 0.3246          | 0.8597   | 0.8804           | 0.8654        | 0.8728    | 0.1474     | 0.9524       | 0.8347         | 0.8526      | 0.8435  | 0.1346   | 0.9145     |
+| 0.3268        | 2.2505 | 19026 | 0.3310          | 0.8580   | 0.8664           | 0.8805        | 0.8734    | 0.1704     | 0.9516       | 0.8470         | 0.8296      | 0.8382  | 0.1195   | 0.9135     |
+| 0.2752        | 2.5006 | 21140 | 0.3318          | 0.8580   | 0.8647           | 0.8830        | 0.8737    | 0.1733     | 0.9521       | 0.8492         | 0.8267      | 0.8378  | 0.1170   | 0.9149     |
+| 0.2938        | 2.7507 | 23254 | 0.3236          | 0.8600   | 0.8759           | 0.8720        | 0.8739    | 0.1550     | 0.9529       | 0.8403         | 0.8450      | 0.8426  | 0.1280   | 0.9158     |
+| 0.2993        | 3.0007 | 25368 | 0.3229          | 0.8605   | 0.8826           | 0.8642        | 0.8733    | 0.1443     | 0.9532       | 0.8340         | 0.8557      | 0.8447  | 0.1358   | 0.9166     |
+| 0.2973        | 3.2508 | 27482 | 0.3283          | 0.8604   | 0.8831           | 0.8634        | 0.8731    | 0.1433     | 0.9529       | 0.8333         | 0.8567      | 0.8448  | 0.1366   | 0.9155     |
+| 0.2741        | 3.5008 | 29596 | 0.3288          | 0.8600   | 0.8832           | 0.8625        | 0.8727    | 0.1432     | 0.9531       | 0.8325         | 0.8568      | 0.8445  | 0.1375   | 0.9154     |
+| 0.3123        | 3.7509 | 31710 | 0.3289          | 0.8592   | 0.8973           | 0.8435        | 0.8696    | 0.1211     | 0.9534       | 0.8174         | 0.8789      | 0.8471  | 0.1565   | 0.9158     |
+### Framework versions
+- Transformers 4.57.3
+- Pytorch 2.7.1+cu118
+- Datasets 4.4.1
+- Tokenizers 0.22.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:188d8e9bd9cc217346e86b455ff7abe4422f99a6f60fd4fef38c4a847aec61d2
 size 737719272

 version https://git-lfs.github.com/spec/v1
+oid sha256:33e3435bfd6a59124f1eb0dacfafa60ea77eb8c3eb21be759281f8610eae269c
 size 737719272