SentimentT2 / README.md

LogischeIP

End of training

eefb672 verified about 2 years ago

1.92 kB

license: apache-2.0
base_model: distilbert-base-uncased
tags:
  - generated_from_trainer
metrics:
  - accuracy
  - f1
model-index:
  - name: SentimentT2
    results: []

SentimentT2

This model is a fine-tuned version of distilbert-base-uncased on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.3310
Accuracy: 0.8557
F1: 0.8589
Auc Roc: 0.9323
Log Loss: 0.3310

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 16
eval_batch_size: 20
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 32
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 500
num_epochs: 4
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1	Auc Roc	Log Loss
0.6957	1.0	101	0.6817	0.6766	0.7085	0.7441	0.6817
0.6144	2.0	203	0.4599	0.8234	0.8360	0.9071	0.4599
0.4133	3.0	304	0.3519	0.8570	0.8578	0.9275	0.3519
0.3316	3.98	404	0.3310	0.8557	0.8589	0.9323	0.3310

Framework versions

Transformers 4.35.2
Pytorch 2.1.0+cu121
Datasets 2.16.1
Tokenizers 0.15.0