|
|
--- |
|
|
library_name: transformers |
|
|
license: mit |
|
|
base_model: microsoft/deberta-v3-large |
|
|
tags: |
|
|
- single_label_classification |
|
|
- question-answering |
|
|
- text-classification |
|
|
- generated_from_trainer |
|
|
datasets: |
|
|
- beavertails |
|
|
metrics: |
|
|
- accuracy |
|
|
model-index: |
|
|
- name: QA-DeBERTa-v3-large-diff-binary-2 |
|
|
results: |
|
|
- task: |
|
|
name: Text Classification |
|
|
type: text-classification |
|
|
dataset: |
|
|
name: saiteki-kai/Beavertails-it |
|
|
type: beavertails |
|
|
metrics: |
|
|
- name: Accuracy |
|
|
type: accuracy |
|
|
value: 0.8608643577203313 |
|
|
--- |
|
|
|
|
|
<!-- This model card has been generated automatically according to the information the Trainer had access to. You |
|
|
should probably proofread and complete it, then remove this comment. --> |
|
|
|
|
|
# QA-DeBERTa-v3-large-diff-binary-2 |
|
|
|
|
|
This model is a fine-tuned version of [microsoft/deberta-v3-large](https://huggingface.co/microsoft/deberta-v3-large) on the saiteki-kai/Beavertails-it dataset. |
|
|
It achieves the following results on the evaluation set: |
|
|
- Loss: 0.3409 |
|
|
- Accuracy: 0.8609 |
|
|
- Unsafe Precision: 0.8682 |
|
|
- Unsafe Recall: 0.8842 |
|
|
- Unsafe F1: 0.8761 |
|
|
- Unsafe Fpr: 0.1684 |
|
|
- Unsafe Aucpr: 0.9538 |
|
|
- Safe Precision: 0.8512 |
|
|
- Safe Recall: 0.8316 |
|
|
- Safe F1: 0.8413 |
|
|
- Safe Fpr: 0.1158 |
|
|
- Safe Aucpr: 0.9184 |
|
|
|
|
|
## Model description |
|
|
|
|
|
More information needed |
|
|
|
|
|
## Intended uses & limitations |
|
|
|
|
|
More information needed |
|
|
|
|
|
## Training and evaluation data |
|
|
|
|
|
More information needed |
|
|
|
|
|
## Training procedure |
|
|
|
|
|
### Training hyperparameters |
|
|
|
|
|
The following hyperparameters were used during training: |
|
|
- learning_rate: 6e-06 |
|
|
- train_batch_size: 64 |
|
|
- eval_batch_size: 64 |
|
|
- seed: 42 |
|
|
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments |
|
|
- lr_scheduler_type: linear |
|
|
- lr_scheduler_warmup_steps: 1000 |
|
|
- num_epochs: 10 |
|
|
|
|
|
### Training results |
|
|
|
|
|
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Unsafe Precision | Unsafe Recall | Unsafe F1 | Unsafe Fpr | Unsafe Aucpr | Safe Precision | Safe Recall | Safe F1 | Safe Fpr | Safe Aucpr | |
|
|
|:-------------:|:------:|:-----:|:---------------:|:--------:|:----------------:|:-------------:|:---------:|:----------:|:------------:|:--------------:|:-----------:|:-------:|:--------:|:----------:| |
|
|
| 0.2998 | 0.2501 | 2114 | 0.3677 | 0.8446 | 0.9027 | 0.8078 | 0.8526 | 0.1093 | 0.9436 | 0.7870 | 0.8907 | 0.8356 | 0.1922 | 0.8961 | |
|
|
| 0.3262 | 0.5001 | 4228 | 0.3278 | 0.8561 | 0.8786 | 0.8602 | 0.8693 | 0.1491 | 0.9495 | 0.8291 | 0.8509 | 0.8399 | 0.1398 | 0.9087 | |
|
|
| 0.3019 | 0.7502 | 6342 | 0.3236 | 0.8588 | 0.8972 | 0.8429 | 0.8692 | 0.1211 | 0.9527 | 0.8168 | 0.8789 | 0.8467 | 0.1571 | 0.9155 | |
|
|
| 0.3479 | 1.0002 | 8456 | 0.3215 | 0.8599 | 0.8690 | 0.8811 | 0.8750 | 0.1666 | 0.9531 | 0.8482 | 0.8334 | 0.8407 | 0.1189 | 0.9175 | |
|
|
| 0.302 | 1.2503 | 10570 | 0.3221 | 0.8611 | 0.8839 | 0.8639 | 0.8738 | 0.1423 | 0.9536 | 0.8340 | 0.8577 | 0.8457 | 0.1361 | 0.9176 | |
|
|
| 0.2663 | 1.5004 | 12684 | 0.3409 | 0.8609 | 0.8682 | 0.8842 | 0.8761 | 0.1684 | 0.9538 | 0.8512 | 0.8316 | 0.8413 | 0.1158 | 0.9184 | |
|
|
| 0.2841 | 1.7504 | 14798 | 0.3223 | 0.8622 | 0.8772 | 0.8748 | 0.8760 | 0.1537 | 0.9551 | 0.8435 | 0.8463 | 0.8449 | 0.1252 | 0.9202 | |
|
|
| 0.3074 | 2.0005 | 16912 | 0.3244 | 0.8632 | 0.8995 | 0.8490 | 0.8735 | 0.1190 | 0.9553 | 0.8230 | 0.8810 | 0.8510 | 0.1510 | 0.9182 | |
|
|
| 0.3052 | 2.2505 | 19026 | 0.3200 | 0.8618 | 0.8833 | 0.8660 | 0.8746 | 0.1435 | 0.9546 | 0.8359 | 0.8565 | 0.8461 | 0.1340 | 0.9221 | |
|
|
| 0.268 | 2.5006 | 21140 | 0.3192 | 0.8627 | 0.8876 | 0.8625 | 0.8748 | 0.1370 | 0.9550 | 0.8334 | 0.8630 | 0.8479 | 0.1375 | 0.9220 | |
|
|
|
|
|
|
|
|
### Framework versions |
|
|
|
|
|
- Transformers 4.57.3 |
|
|
- Pytorch 2.7.1+cu118 |
|
|
- Datasets 4.4.1 |
|
|
- Tokenizers 0.22.1 |
|
|
|