ele-sage/mdeberta-v3-base-name-classifier-v2

8ca56ac verified 5 months ago

3.9 kB

library_name: transformers
license: mit
base_model: microsoft/mdeberta-v3-base
tags:
  - generated_from_trainer
metrics:
  - accuracy
  - precision
  - recall
  - f1
model-index:
  - name: mdeberta-v3-base-name-classifier-v2
    results: []

mdeberta-v3-base-name-classifier-v2

This model is a fine-tuned version of microsoft/mdeberta-v3-base on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.0269
Accuracy: 0.9927
Precision: 0.9966
Recall: 0.9906
F1: 0.9936

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 128
eval_batch_size: 128
seed: 42
optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 2000
num_epochs: 1

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy	Precision	Recall	F1
0.0495	0.0411	2000	0.0448	0.9881	0.9963	0.9826	0.9894
0.0433	0.0821	4000	0.0427	0.9906	0.9955	0.9879	0.9917
0.0385	0.1232	6000	0.0397	0.9900	0.9973	0.9851	0.9912
0.0315	0.1642	8000	0.0328	0.9913	0.9961	0.9885	0.9923
0.0326	0.2053	10000	0.0317	0.9914	0.9944	0.9905	0.9924
0.0331	0.2464	12000	0.0317	0.9917	0.9965	0.9888	0.9926
0.0383	0.2874	14000	0.0302	0.9919	0.9947	0.9909	0.9928
0.0299	0.3285	16000	0.0296	0.9919	0.9953	0.9904	0.9929
0.0299	0.3696	18000	0.0294	0.9921	0.9959	0.9901	0.9930
0.0322	0.4106	20000	0.0297	0.9921	0.9970	0.9890	0.9930
0.0323	0.4517	22000	0.0288	0.9922	0.9959	0.9903	0.9931
0.0309	0.4927	24000	0.0292	0.9921	0.9972	0.9889	0.9930
0.0325	0.5338	26000	0.0284	0.9921	0.9949	0.9912	0.9930
0.0292	0.5749	28000	0.0279	0.9923	0.9961	0.9903	0.9932
0.0265	0.6159	30000	0.0289	0.9923	0.9953	0.9911	0.9932
0.0291	0.6570	32000	0.0279	0.9924	0.9963	0.9903	0.9933
0.0307	0.6981	34000	0.0276	0.9926	0.9965	0.9904	0.9934
0.0286	0.7391	36000	0.0273	0.9926	0.9967	0.9903	0.9935
0.0287	0.7802	38000	0.0269	0.9927	0.9970	0.9901	0.9935
0.0284	0.8212	40000	0.0278	0.9925	0.9958	0.9910	0.9934
0.0258	0.8623	42000	0.0274	0.9927	0.9965	0.9905	0.9935
0.0245	0.9034	44000	0.0273	0.9927	0.9967	0.9904	0.9935
0.0324	0.9444	46000	0.0269	0.9927	0.9964	0.9907	0.9935
0.0311	0.9855	48000	0.0269	0.9927	0.9966	0.9906	0.9936

Framework versions

Transformers 4.57.1
Pytorch 2.9.0+cu128
Datasets 4.4.1
Tokenizers 0.22.1